<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1471-2164-10-100</ui>
   <ji>1471-2164</ji>
   <fm>
      <dochead>Research article</dochead>
      <bibl>
         <title>
            <p>An integrated approach for the systematic identification and characterization of heart-enriched genes with unknown functions</p>
         </title>
         <aug>
            <au ce="yes" id="A1">
               <snm>Uchida</snm>
               <fnm>Shizuka</fnm>
               <insr iid="I1"/>
               <email>Shizuka.Uchida@mpi-bn.mpg.de</email>
            </au>
            <au ce="yes" id="A2">
               <snm>Schneider</snm>
               <fnm>Andr&#233;</fnm>
               <insr iid="I1"/>
               <email>Andre.Schneider@mpi-bn.mpg.de</email>
            </au>
            <au id="A3">
               <snm>Wiesnet</snm>
               <fnm>Marion</fnm>
               <insr iid="I1"/>
               <email>Marion.Wiesnet@mpi-bn.mpg.de</email>
            </au>
            <au id="A4">
               <snm>Jungblut</snm>
               <fnm>Benno</fnm>
               <insr iid="I1"/>
               <email>Benno.Jungblut@mpi-bn.mpg.de</email>
            </au>
            <au id="A5">
               <snm>Zarjitskaya</snm>
               <fnm>Polina</fnm>
               <insr iid="I1"/>
               <email>Polina.Zarjitskaya@mpi-bn.mpg.de</email>
            </au>
            <au id="A6">
               <snm>Jenniches</snm>
               <fnm>Katharina</fnm>
               <insr iid="I1"/>
               <email>Katharina.Jenniches@mpi-bn.mpg.de</email>
            </au>
            <au id="A7">
               <snm>Kreymborg</snm>
               <mnm>grosse</mnm>
               <fnm>Karsten</fnm>
               <insr iid="I1"/>
               <insr iid="I2"/>
               <email>grosse.Kreymborg@mpi-bn.mpg.de</email>
            </au>
            <au id="A8">
               <snm>Seeger</snm>
               <fnm>Werner</fnm>
               <insr iid="I1"/>
               <insr iid="I2"/>
               <email>Werner.Seeger@innere.med.uni-giessen.de</email>
            </au>
            <au ca="yes" id="A9">
               <snm>Braun</snm>
               <fnm>Thomas</fnm>
               <insr iid="I1"/>
               <email>Thomas.Braun@mpi-bn.mpg.de</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Max-Planck-Institute for Heart and Lung Research, Parkstr. 1, 61231 Bad Nauheim, Germany</p>
            </ins>
            <ins id="I2">
               <p>University of Giessen Lung Center (UGLC), Medical Clinic II and V, Justus-Liebig-University Giessen, Klinikstrasse 36, 35392 Giessen, Germany</p>
            </ins>
         </insg>
         <source>BMC Genomics</source>
         <issn>1471-2164</issn>
         <pubdate>2009</pubdate>
         <volume>10</volume>
         <issue>1</issue>
         <fpage>100</fpage>
         <url>http://www.biomedcentral.com/1471-2164/10/100</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">19267916</pubid>
               <pubid idtype="doi">10.1186/1471-2164-10-100</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>03</day>
               <month>11</month>
               <year>2008</year>
            </date>
         </rec>
         <acc>
            <date>
               <day>06</day>
               <month>3</month>
               <year>2009</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>06</day>
               <month>3</month>
               <year>2009</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2009</year>
         <collab>Uchida et al; licensee BioMed Central Ltd.</collab>
         <note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>High throughput techniques have generated a huge set of biological data, which are deposited in various databases. Efficient exploitation of these databases is often hampered by a lack of appropriate tools, which allow easy and reliable identification of genes that miss functional characterization but are correlated with specific biological conditions (e.g. organotypic expression).</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>We have developed a simple algorithm (DGSA = <ul>D</ul>atabase-dependent <ul>G</ul>ene <ul>S</ul>election and <ul>A</ul>nalysis) to identify genes with unknown functions involved in organ development concentrating on the heart. Using our approach, we identified a large number of yet uncharacterized genes, which are expressed during heart development. An initial functional characterization of genes by loss-of-function analysis employing morpholino injections into zebrafish embryos disclosed severe developmental defects indicating a decisive function of selected genes for developmental processes.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusion</p>
               </st>
               <p>We conclude that DGSA is a versatile tool for database mining allowing efficient selection of uncharacterized genes for functional analysis.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>In recent years, the advent of high-throughput analytical techniques, such as microarrays and serial analysis of gene expression (SAGE), has led to a rapid accumulation of biological data. The large size of databases, which are now within the petabytes range <abbrgrp><abbr bid="B1">1</abbr></abbrgrp> precludes manual analysis and renders unsystematic approaches obsolete. To cope with these new challenges and to facilitate efficient data analyses, numerous academic and commercial software packages and databases have been developed <abbrgrp><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr></abbrgrp>. Yet, genes to which no biological function has been assigned compromise the usability of these data. In particular, the construction of linear and non-linear models has proven to be difficult if "function unknown genes" are included <abbrgrp><abbr bid="B6">6</abbr></abbrgrp>. Any attempts to integrate and analyze complex biological data from various "omics" techniques (e.g. transcriptomics, proteomics, and metabolomics) to understand biological phenomena as a collection of interconnected systems will depend on the functional annotation of the majority of their components <abbrgrp><abbr bid="B7">7</abbr></abbrgrp>. Thus, successful approaches to "Systems Biology" will depend on the identification and functional characterization of most if not all players in the system under study <abbrgrp><abbr bid="B8">8</abbr><abbr bid="B9">9</abbr></abbrgrp>. An obvious, almost trivial conclusion is the systematic identification of only those genes, which lack biologically valid annotations, followed by functional characterization. Surprisingly, only few attempts were made to address this problem systematically although most databases contain large numbers of genes, which have only been identified by computational sequence analyses <abbrgrp><abbr bid="B10">10</abbr><abbr bid="B11">11</abbr></abbrgrp>. Instead, most efforts have concentrated on large-scale functional analyses such a proteome-wide protein interaction screens <abbrgrp><abbr bid="B12">12</abbr><abbr bid="B13">13</abbr></abbrgrp> or genome-wide siRNA knockdowns <abbrgrp><abbr bid="B14">14</abbr><abbr bid="B15">15</abbr><abbr bid="B16">16</abbr></abbrgrp>, which are much more difficult to perform and are notoriously unreliable <abbrgrp><abbr bid="B17">17</abbr><abbr bid="B18">18</abbr><abbr bid="B19">19</abbr></abbrgrp>. Other approaches such as ChIP-on-chip <abbrgrp><abbr bid="B20">20</abbr></abbrgrp> and ChIP-Seq <abbrgrp><abbr bid="B21">21</abbr></abbrgrp> are limited to specific biological properties (e.g. DNA binding).</p>
         <p>The relevance of this problem becomes apparent in experiments that use large datasets such as in DNA microarray hybridizations. Transcriptional profiling of a diseased heart, for example, will identify changes in the expression of known signaling pathways as well as genes, which do not fit into existing regulatory circuits (see reviews <abbrgrp><abbr bid="B22">22</abbr></abbrgrp>). Although the expression profile of the latter group of genes might correlate strongly with certain physiological conditions or developmental status suggesting potential interactions with known regulatory networks, reliable data to describe the biological function are most often missing.</p>
         <p>Here, we describe a strategy for the systematic identification of genes, which are correlated with a specific biological condition (e.g. organ-specific expression) but lack functional characterization. Our approach is based on the systematic exploitation of existing databases such as the UniGene database <abbrgrp><abbr bid="B23">23</abbr></abbrgrp>, which contain information about organ-specific expression patterns, applying a number of different filters. Information gathered from expression databases were matched with publication records based on the assumption that virtually all genes with a functional characterization will also relay to a corresponding publication. Although the focus of the study was on "heart-enriched genes", our approach is applicable to virtually all organs and physiological status. The expression profiles of identified gene were validated by RT-PCR using different murine adult organs and by whole-mount <it>in situ </it>(WISH) hybridization of mouse embryos. The functional role of selected genes was also approached by morpholino knock-down experiments in zebrafish embryos</p>
      </sec>
      <sec>
         <st>
            <p>Results</p>
         </st>
         <sec>
            <st>
               <p>In Silico Screening of UniGene Database</p>
            </st>
            <p>Several public databases for "omics" data are available, which cover the expression profile of individual genes in various organisms to a different extent. For our study, we employed the UniGene database, which contains comprehensive coverage of various organisms and a collection of expression data. Each UniGene ID has an attached "Expression Profile", which can be used to search for specific expression sites. To identify genes in a systematic manner, which show a preferential expression in the heart but lack a proper functional annotation, we selected 4 different organisms for our analysis (rat, mouse, human, and chicken) (Figure <figr fid="F1">1</figr>). Applying the rule "Select genes whose rank for 'heart' is under the top 20% across homologues" resulted in the identification of 2,348 mouse genes, which fitted the rule in more than one organism. It should be pointed out that the combination of expression data from different organisms reduced the number of false-positive candidates. Since the expression profile of genes is normally conserved among evolutionary close species, any expression profile that is unique for a single species is likely to indicate faulty expression profiling rather than a biological meaningful result.</p>
            <fig id="F1">
               <title>
                  <p>Figure 1</p>
               </title>
               <caption>
                  <p>Flowchart of in silico UniGene profile screening</p>
               </caption>
               <text>
                  <p><b>Flowchart of in silico UniGene profile screening</b>. At the time of this study, 27,474 mouse GeneIDs (GI) with UniGene IDs were analyzed.</p>
               </text>
               <graphic file="1471-2164-10-100-1"/>
            </fig>
            <p>After identification of heart-enriched genes, we searched for genes in this group, which lack a proper annotation. We assumed that genes with no or very few publications are "uncharacterized genes". To avoid papers describing large screening efforts (e.g. microarrays, sequencing), which usually do not include a functional analysis, we excluded papers that contained more than 100 GeneIDs (GI). Obviously, this selection is rather strict resulting in exclusion of potentially interesting genes. To address this problem, we added another selection criterion that was based on Medical Subject Headings (MeSH). Inclusion of the MeSH term for "heart" with less than 2 publications resulted in the identification of 1,975 "uncharacterized" mouse genes, which show an expression in the heart. Figure <figr fid="F2">2</figr> depicts the numbers of heart-enriched genes, which were identified using different combinations of organisms. Selection of genes based on the concurrent expression in the heart in <it>all </it>model organisms massively decreased the number of positive genes yielding 124 heart-enriched genes of which 79 are uncharacterized. Unfortunately, the quality of the expression data is not identical in all model organisms, which resulted in exclusion of genes, which have not been properly accessed but in fact are expressed in the heart. An improvement of the quality of the expression data will reduce the number of false-negative genes in the future.</p>
            <fig id="F2">
               <title>
                  <p>Figure 2</p>
               </title>
               <caption>
                  <p>Venn diagram for heart-enriched genes</p>
               </caption>
               <text>
                  <p><b>Venn diagram for heart-enriched genes</b>. The numbers of heart-enriched genes selected for the top 20% in the ranking of UniGene tissue expression profiles are shown for the 4 organisms analyzed and displayed as combinations of these organisms. In each combination, the number of genes with less than 2 articles after MeSH term filtering for "heart" is shown in parenthesis.</p>
               </text>
               <graphic file="1471-2164-10-100-2"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>Validation of the In Silico Screening</p>
            </st>
            <p>To demonstrate the efficiency of our method, we searched the Gene Ontology project <abbrgrp><abbr bid="B24">24</abbr></abbrgrp> for the term "heart" and matched the results with our data set that was based on the analysis of the expression data including genes, which lack an annotation. We used 79 GO terms that contain the phrase "heart" and screened the gene names and symbols of the 4 selected model organisms. We found that 32 out of the 79 GO terms contained at least one gene in the organisms studied. The number of genes that were obtained through our method and the coverage of GO terms in percentages of the genes that matched our selection criteria are shown in Additional File <supplr sid="S1">1</supplr>. For example, about 30% of the genes that carry the GO term "heart development (GO:0007507)" were covered by our method (34% for mouse, 28% for rat, 37% for human, and 37% for chicken). The information about all 43 mouse genes that were extracted through our method using the GO term "heart development" is listed in Additional File <supplr sid="S2">2</supplr>.</p>
            <suppl id="S1">
               <title>
                  <p>Additional file 1</p>
               </title>
               <text>
                  <p><b>Coverage of GO terms by genes that matched the applied selection criteria.</b> For each organism, the percentage coverage was derived by dividing the number of genes obtained (numerator in the parenthesis) by the number of genes that are classified under a specific GO term (denominator in the parenthesis). Fisher's exact test was applied to each GO coverage.</p>
               </text>
               <file name="1471-2164-10-100-S1.pdf">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <suppl id="S2">
               <title>
                  <p>Additional file 2</p>
               </title>
               <text>
                  <p><b>Combinatorial GO annotations of mouse genes categorized under "heart development". </b>43 genes that are categorized under "heart development (GO:0007507)" were obtained. Gene symbols, GeneIDs, and combinations of organisms, in which they were selected, are indicated.</p>
               </text>
               <file name="1471-2164-10-100-S2.pdf">
                  <p>Click here for file</p>
               </file>
            </suppl>
         </sec>
         <sec>
            <st>
               <p>Further Selection Rules to Eliminate Ubiquitously Expressed Genes</p>
            </st>
            <p>To validate the data obtained by our selection rules and to confirm the database records, RT-PCR experiments using mouse adult organs were performed. Surprisingly, we found that a large number of genes were expressed in multiple organs (Figure <figr fid="F3">3A</figr>). We therefore devised a set of additional rules to exclude ubiquitously expressed genes: (i) Genes must be expressed in less than 50% of total tissues; (ii) Sum of Gene EST must be less than 400; and (iii) At least 25% of Gene EST must be expressed in the target tissue (e.g. heart). Application of these rules efficiently eliminated ubiquitously expressed genes and left genes that were expressed preferentially in the heart (Figure <figr fid="F4">4B</figr>). Interestingly, most of the genes that remained after this selection were also expressed in striated muscles. This result was not unexpected given the fact that skeletal muscles and the heart are both composed of striated muscle cells, which share numerous morphological and physiological properties. Table <tblr tid="T1">1</tblr> lists the number of mouse genes that were identified by using the combination of our selection rules. Inclusion of all three additional selection rules left only 30 mouse genes while inclusion of selection rules (i) and (ii) left 920 genes. As shown in Figure <figr fid="F4">4B</figr>, inclusion of selection rules (i) and (ii) did already efficiently exclude ubiquitously expressed genes. Addition of the third selection rule did not improve massively the number of genes with a heart-enriched expression pattern but result in a general decrease of selected genes. The inclusion of the third selection criterion proved to be helpful only for some cases (data not shown).</p>
            <fig id="F3">
               <title>
                  <p>Figure 3</p>
               </title>
               <caption>
                  <p>RT-PCR results of selected genes</p>
               </caption>
               <text>
                  <p><b>RT-PCR results of selected genes</b>. RT-PCR results of candidate genes derived without (A) and with (B) inclusion of additional selections rules described in materials and methods. The selection rules, which were applied for each of the analyzed are shown in (B) next to the images of the agarose gels. (C) <it>Gapdh </it>and <it>Actb </it>were used as loading controls. Water was used as a negative control for all reactions. With the exception of <it>Gapdh </it>and <it>Actb </it>(25 cycles), 28 amplification cycles were used.</p>
               </text>
               <graphic file="1471-2164-10-100-3"/>
            </fig>
            <fig id="F4">
               <title>
                  <p>Figure 4</p>
               </title>
               <caption>
                  <p>WISH results of selected genes</p>
               </caption>
               <text>
                  <p><b>WISH results of selected genes</b>. Expression patterns of 8 representative genes are shown. Forelimbs were removed from E11.5 embryos to reveal the expression in the heart, if necessary.</p>
               </text>
               <graphic file="1471-2164-10-100-4"/>
            </fig>
            <tbl id="T1">
               <title>
                  <p>Table 1</p>
               </title>
               <caption>
                  <p>Combination of selection rules successively restricts the number of heart-enriched genes</p>
               </caption>
               <tblbdy cols="2">
                  <r>
                     <c ca="left">
                        <p>
                           <b>Selection Category</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>Number of Mouse Genes</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="2">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Number of heart-enriched genes</p>
                     </c>
                     <c ca="left">
                        <p>2,348</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="2">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>With less than 2 publications in heart field in all 4 organisms</p>
                     </c>
                     <c ca="left">
                        <p>1,975</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="2">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>+ Mouse Further Rules 1 and others</p>
                     </c>
                     <c ca="left">
                        <p>926</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="2">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>+ Mouse Further Rules 1,2 or 1,2,3</p>
                     </c>
                     <c ca="left">
                        <p>920</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="2">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>+ Mouse Further Rules 1,2,3</p>
                     </c>
                     <c ca="left">
                        <p>30</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>The total numbers of genes that were identified using various combinations of selection tools are listed.</p>
               </tblfn>
            </tbl>
         </sec>
         <sec>
            <st>
               <p>Identification of Developmentally Expressed Genes</p>
            </st>
            <p>The expression in the adult heart does not necessarily mean that the corresponding gene is also expressed during early heart development. We therefore chose 20 genes from the genes that emerged from the computational screening and subjected them to whole-mount in situ hybridization (WISH) using E10.5 and E11.5 mouse embryos. At this stage, most organs have either formed or undergo organogenesis. 18 out of the 20 genes showed expression in the heart while 2 genes failed to display a specific expression at these stages (Figure <figr fid="F4">4</figr> and data not shown). Based on the expression pattern, we distinguished two different groups: Group 1 showed equally strong expression in the heart both at E10.5 and E11.5 (<it>4632412N22Rik</it>, <it>A5530016L24Rik</it>, <it>Klhl31</it>, <it>Fsd2</it>, <it>Lrrc39</it>, and <it>Rnf207</it>); and Group 2 showed an increase of expression at E11.5 compared to E10.5 (<it>Fhod3 </it>and <it>Lmod3</it>).</p>
         </sec>
         <sec>
            <st>
               <p>Preliminary Functional Characterization of Selected Genes by Loss-of-Function in Zebrafish Embryos</p>
            </st>
            <p>To obtain initial functional data for the selected genes, we turned to a loss-of-function analysis. Two uncharacterized, heart-enriched genes were chosen: leucine rich repeat containing 39 (<it>Lrrc39</it>) and kelch-like 31 (Drosophila) (<it>Klhl31</it>). <it>Lrrc39 </it>is highly expressed in the adult heart based on the RT-PCR analysis, and also found in eyes, skeletal muscle, brown fat, and weakly in the stomach (Figure <figr fid="F3">3B</figr>). During development, <it>Lrrc39 </it>is expressed at E10.5 in the heart, in somites and in the eye anlagen (Figure <figr fid="F4">4</figr>). <it>Klhl31 </it>showed a slightly different expression pattern. It was expressed in the adult heart, brown fat, skeletal muscle, and testes (Figure <figr fid="F3">3B</figr>). <it>Klhl31 </it>was also detected very strongly in the heart during development at E10.5 by WISH (Figure <figr fid="F4">4</figr>).</p>
            <p>We decided to accomplish the loss-of-function analysis in zebrafish embryos because knock-down of genes is relatively easily achieved in zebrafish by injection of antisense morpholinos. In addition, the development of the cardiovascular system in zebrafish embryos can be conveniently monitored by microscopical inspections. The zebrafish homologues of <it>Lrrc39 </it>and <it>Klhl31 </it>were identified using HomoloGene <abbrgrp><abbr bid="B25">25</abbr></abbrgrp> yielding "<it>zgc:112088</it>" and "<it>klhl31</it>" (which are indicated as "<it>zLrrc39</it>" and "<it>zKlhl31</it>" respectively). Morpholinos against "<it>zLrrc39</it>" and "<it>zKlhl31</it>" were directed at their ATG sites.</p>
            <p><it>zLrrc39 </it>morphants showed the first signs of disturbed development at 48 hour post fertilization (hpf). At 108 hpf, <it>zLrrc39 </it>morphants displayed a considerable pericardial edema, which is a characteristic sign of cardiac malfunctions (Figure <figr fid="F5">5A</figr>). All morphants died within 12 hours after this time point. To facilitate detection of developmental defects in the heart, we employed a transgenic zebrafish line "<it>Tg(myl7:EGFP-HRAS)</it><sup><it>s</it>883</sup>" <abbrgrp><abbr bid="B26">26</abbr></abbrgrp>, in which myocardial cells were labeled by GFP. Macroscopical evaluation of injected and control embryos at 48 hpf followed by sectioning revealed a disturbed arrangement of cardiomyocytes in the hearts of <it>zLrrc39 </it>morphants, but not in control embryos, indicating a role of <it>zLrrc39 </it>in morphogenetic processes controlling heart development (Figure <figr fid="F5">5B, C</figr>). In addition, <it>zLrrc39 </it>morphants were characterized by small eyes, malformations of the lower jaw, and distorted muscle structures (Figure <figr fid="F5">5A</figr>). It was interesting to note that the affected organs reflected the expression profile of <it>zLrrc39 </it>during mouse development.</p>
            <fig id="F5">
               <title>
                  <p>Figure 5</p>
               </title>
               <caption>
                  <p>Loss-of-function phenotypes after morpholino injections</p>
               </caption>
               <text>
                  <p><b>Loss-of-function phenotypes after morpholino injections</b>. (A) Effect of <it>zLrrc39 </it>at 108 hpf and (B) <it>zKlhl31 </it>morpholino injections on zebrafish development at 48 hpf, respectively. Lateral views of non-injected control (upper) and morphant (lower) embryos (Tg(myl7:EGFP-HRAS)s883) are shown. The arrow in (A) indicates the pericardial edema in injected <it>zLrrc39 </it>morphant embryos. (C-E) Fluorescent images of heart sections at 48 hpf of control (C), <it>zLrrc39 </it>(D) and <it>zKlhl31 </it>(E) morphants are shown. Cardiomyocytes of zebrafish embryos were marked by expression of EGFP to delineate cellular morphology. Scale bar: 50 &#956;m.</p>
               </text>
               <graphic file="1471-2164-10-100-5"/>
            </fig>
            <p>The knock-down of <it>zKlhl31 </it>resulted in a more severe phenotype compared to <it>zLrrc39 </it>essentially abrogating development at 72 hpf. At 48 hpf, <it>zKlhl31 </it>morphants showed cardiac and yolk sac edema as well as a distorted body axis (Figure <figr fid="F5">5D</figr>). Sectioning of the heart of <it>zKlhl31 </it>morphants revealed that ventricle morphogenesis was severely disturbed, which essentially prevented looping of the heart.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Discussion</p>
         </st>
         <p>In the current study, we have devised a new strategy to identify and characterize organ-specific gene, which lack a detailed functional annotation. We reason that the identification and characterization of such genes will facilitate future attempts to understand biological phenomena as a collection of interconnected systems. Our approach was based on a systematic exploitation of various databases such as UniGene. UniGene is a NCBI's database system for automatically partitioning GenBank sequences, including ESTs, into a non-redundant set of gene-oriented clusters <abbrgrp><abbr bid="B23">23</abbr></abbrgrp>. Each UniGene cluster contains sequences that represent a unique gene or expressed pseudogene, together with related data sets including information about tissue types in which the gene is expressed, model organism protein similarities, and genomic locations. As of May 2008, 97 species and thousands of sequence libraries from various tissues were used to build UniGene clusters. In principle, UniGene contains comprehensive expression profiles based on sequencing results, which can be used to obtain information about expression patterns for a particular gene. UniGene also provides a category called "Restricted Expression". In order for a UniGene gene cluster to fall under this category, more than half of the GenBank sequences assigned to the cluster must come from the same source tissue. In Mus musculus, there are only 34 gene clusters (e.g. <it>Myh6</it>, <it>Myh7</it>, and <it>Nppa</it>) listed under "Restricted Expression" in "heart", which limits this application to genes that are not expressed in any other tissue and restricts the usefulness of this tool.</p>
         <p>To overcome the loss-of-information, which occurs when applying such strict criteria, we developed selection rules that are based on the conservation of expression profiles of homologous genes between different species. This strategy successfully enlarged the number of detected genes without compromising the specificity of the detection.</p>
         <p>Many studies have been conducted to identify tissue/organ-specific genes with known and unknown functions (reviewed extensively in <abbrgrp><abbr bid="B27">27</abbr></abbrgrp>): endothelial <abbrgrp><abbr bid="B28">28</abbr></abbrgrp>; epidiymis <abbrgrp><abbr bid="B29">29</abbr><abbr bid="B30">30</abbr><abbr bid="B31">31</abbr></abbrgrp>; heart <abbrgrp><abbr bid="B32">32</abbr><abbr bid="B33">33</abbr><abbr bid="B34">34</abbr><abbr bid="B35">35</abbr></abbrgrp>; mammary gland <abbrgrp><abbr bid="B34">34</abbr></abbrgrp>; pancreas <abbrgrp><abbr bid="B34">34</abbr></abbrgrp>; preimplantation stages <abbrgrp><abbr bid="B36">36</abbr><abbr bid="B37">37</abbr></abbrgrp>; prostate <abbrgrp><abbr bid="B38">38</abbr></abbrgrp>; skeletal muscle <abbrgrp><abbr bid="B39">39</abbr></abbrgrp>; and testis <abbrgrp><abbr bid="B40">40</abbr></abbrgrp>. Other studies focused on the discovery of biomarkers for diseases such as colon <abbrgrp><abbr bid="B41">41</abbr></abbrgrp> and prostate cancer <abbrgrp><abbr bid="B42">42</abbr></abbrgrp>. Most of these studies utilized cDNA or EST sequences and libraries from dbEST <abbrgrp><abbr bid="B43">43</abbr></abbrgrp> or UniGene to screen for tissue/organ-specific genes. Some of these studies were validated by additional computational methods while others used RT-PCR or Northern blotting experiments to confirm the initial database searches. Only two studies included functional data <abbrgrp><abbr bid="B33">33</abbr><abbr bid="B35">35</abbr></abbrgrp>. In contrast to previous studies, which restricted the analysis to one or two species, we included four different organisms to identify species-conserved, heart-enriched expression patterns.</p>
         <p>Several databases such as dbEST or UniGene <abbrgrp><abbr bid="B44">44</abbr><abbr bid="B45">45</abbr><abbr bid="B46">46</abbr><abbr bid="B47">47</abbr></abbrgrp> provide knowledge about tissue/organ-specific genes and give information about expression in different organisms <abbrgrp><abbr bid="B48">48</abbr></abbrgrp> but are not particularly useful to serve as a starting point for further functional studies of uncharacterized genes. Our approach is simple and intuitive and does not require extensive programming and computational knowledge. We have demonstrated that DGSA (= <ul>D</ul>atabase-dependent <ul>G</ul>ene <ul>S</ul>election and <ul>A</ul>nalysis) provides an efficient means to select hitherto uncharacterized genes for further functional analysis. Since our selection criteria strongly relied on the conservation of expression profiles among species, it was straightforward to turn to a functional analysis of identified genes using non-amniotes model organisms such as zebrafish, which are particularly suited for rapid functional characterization using morpholino injections to achieve a loss-of-function phenotype. Selected genes might also be linked easily to databases of non-amniotes model organisms such as the Zebrafish Model Organisms Database (ZFIN) <abbrgrp><abbr bid="B49">49</abbr></abbrgrp>. As of April 6, 2008, 89 zebrafish homologs of heart-enriched genes were included in this database (data not shown). Of these, 25 (corresponds to 28% coverage) were linked to phenotypes in heart or cardiac-related structures (e.g. cardioblast differentiation, cardiac ventricle). Further efforts to characterize mutants of genes, which were identified by DGSA in amniotes, will certainly increase this coverage in the future.</p>
         <p>One might argue that a selection for conserved expression patterns might artificially restrict the number of genes, which can be detected or lead to the identification of genes that are not involved in the development, maintenance, or remodeling of the heart. To address this potential criticism, we matched the genes, which were identified by our selection criteria with GO terms for known heart-enriched genes. The fact that our algorithm provided 30% or more coverage for genes that are known to be involved in "heart development (GO:0007507)" clearly indicates that our selection rules work efficiently even without performing additional biological experiments. Although the current study focused on heart-enriched genes, we reasoned that our selection rules might be easily extended to other organs, such as brain, liver, spleen, and testis. In fact, we found that application of our selection rules to the above mentioned organs yielded the same coverage of GO terms as for the heart (Additional File <supplr sid="S3">3</supplr> for Venn diagrams and Additional File <supplr sid="S4">4</supplr> for GO coverage).</p>
         <suppl id="S3">
            <title>
               <p>Additional file 3</p>
            </title>
            <text>
               <p><b>Venn diagrams for brain-, liver-, spleen-, and testis-enriched genes.</b> The numbers of tissue-enriched genes selected for the top 20% in the ranking of UniGene tissue expression profiles are shown for the 4 organisms analyzed and displayed as combinations of these organisms. In each combination, the number of genes with less than 2 articles after MeSH term filtering for the corresponding tissue is shown in parenthesis. (A) brain-enriched genes; (B) liver-enriched genes; (C) spleen-enriched genes; and (D) testis-enriched genes.</p>
            </text>
            <file name="1471-2164-10-100-S3.jpeg">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S4">
            <title>
               <p>Additional file 4</p>
            </title>
            <text>
               <p><b>GO coverage for brain-, liver-, and spleen-enriched genes.</b> For each organism, the percentage coverage of the GO term for "development" of the corresponding tissue was derived by dividing the number of genes obtained (numerator in the parenthesis) by the number of genes that are classified under a specific GO term (denominator in the parenthesis). Fisher's exact test was applied to each GO coverage. In the case of "testis-enriched genes", there is no GO term for "testis development".</p>
            </text>
            <file name="1471-2164-10-100-S4.pdf">
               <p>Click here for file</p>
            </file>
         </suppl>
      </sec>
      <sec>
         <st>
            <p>Conclusion</p>
         </st>
         <p>We have introduced an efficient way to screen for "heart-enriched" genes with unknown functions. The computational approach to screen for genes across 4 species is simple and intuitive and might be applied by experimental biologists without programming knowledge. The presented strategy can be easily extended by implementing expression information about developing embryos obtained through public microarray and SAGE databases. Further functional analysis of identified genes will help to fill the gaps in our knowledge, which prevents a comprehensive understanding of complex molecular interactions as intended by "Systems Biology" researchers.</p>
      </sec>
      <sec>
         <st>
            <p>Methods</p>
         </st>
         <p>In the following section, all dataset names are indicated by quotation marks.</p>
         <sec>
            <st>
               <p>Selection Rules for UniGene Profiles</p>
            </st>
            <p>The data sources for all datasets used in this study are listed in Additional File <supplr sid="S5">5</supplr> including the FTP sites, dates of data retrievals, and version numbers, if applicable. As a starting point, UniGene's "Mm.data" was used to search for the number of mouse GeneIDs (GI) with UniGene IDs. Next, HomoloGene's "homologene.data", was used to search for homologs and their UniGene IDs in rat (Rattus norvegicus; Taxonomy ID 10116), human (Homo sapiens; Taxonomy ID 9606), and chicken (Gallus gallus; Taxonomy ID 9031). The UniGene tissue expression profiles ("Mm.profiles" for mouse, "Rn.profiles" for rat, "Hs.profiles" for human, and "Gga.profiles" for chicken) were screened with the selection rule: "Select genes whose rank for 'heart' is under the top 20% across homologs." At the time of the screening, the number of tissues for each organism and the definition of the top 20% were as follows (organism name, number of tissues, number of tissues that fall under the top 20% category): Mouse, 47, 9; Rat, 25, 5; Human, 45, 9; and Chicken, 18, 4. Selected genes were subjected to the following further selection rules to eliminate ubiquitously expressed genes in mouse: (i) Expressed in less than 50% of total tissues. (ii) Sum of Gene EST must be less than 400. (iii) At least 25% of Gene EST must be expressed in the target tissue. A flowchart of the UniGene in silico screening procedure is given in Figure <figr fid="F1">1</figr>.</p>
            <suppl id="S5">
               <title>
                  <p>Additional file 5</p>
               </title>
               <text>
                  <p><b>List of datasets used in this study.</b> List of datasets used in this study.</p>
               </text>
               <file name="1471-2164-10-100-S5.pdf">
                  <p>Click here for file</p>
               </file>
            </suppl>
         </sec>
         <sec>
            <st>
               <p>Identification of Publications Corresponding to GIs</p>
            </st>
            <p>To find the number of publications related to an identified gene, Entrez Gene's "gene2pubmed" was used. First, the number of GIs that are listed under each PMID was counted and classified. Publications that include more than 100 GIs were considered as articles that report large screening results (e.g. microarrays, sequencing) and excluded to be counted for the number of publications for each gene. Next, PMIDs whose classification in Medical Subject Headings (MeSH) is under "heart" were searched and cross matched to the number of publications for each gene. If this number was less than 2, the GI (gene) was defined as being "uncharacterized".</p>
         </sec>
         <sec>
            <st>
               <p>Searching for Genes with GO Terms Related to Heart</p>
            </st>
            <p>Gene Ontology <abbrgrp><abbr bid="B24">24</abbr></abbrgrp> terms for "heart" were searched through AmiGO <abbrgrp><abbr bid="B50">50</abbr></abbrgrp>. Next, using these GO terms, gene names and their symbols were extracted from Entrez Gene's "gene2go".</p>
            <p>To demonstrate the statistical significance of GO coverage, Fisher's exact test was applied. First, numbers of genes with at least one GO term were computed from Entrez Gene's "gene2go", which resulted in 18,225 genes for mouse, 16,437 for rat, 17,905 for human, and 9,226 for chicken. Then, a Fisher's exact test was applied to 2,348 heart-enriched genes for each GO term and its coverage.</p>
         </sec>
         <sec>
            <st>
               <p>RNA Preparation, First-Strand cDNA Synthesis and RT-PCR</p>
            </st>
            <p>To extract total RNA, adult mice were sacrificed, and washed extensively with ice-cold PBS to remove blood. Isolated organ parts were frozen in liquid nitrogen and homogenized using TRIzol Reagent (Invitrogen). 2 &#956;g of purified RNA was reverse transcribed using SuperScript II First-Strand Synthesis System (Invitrogen) to synthesize the first-strand cDNA by following the manufacturer's protocol. First-strand cDNA was diluted with water to generate a 1:5 dilution. 1 &#956;l of cDNA was used for each PCR reaction. The following thermal profile was used for all PCR experiments: 95&#176;C for 2 minutes, denaturation (94&#176;C) for 30 seconds, annealing (60&#8211;64&#176;C) for 30 seconds, extension (72&#176;C) for 1 minute, followed by a final extension (72&#176;C) for 7 minutes. Amplified PCR products were separated in 2% agarose gel and visualized using UV light.</p>
         </sec>
         <sec>
            <st>
               <p>Whole-Mount In Situ Hybridization</p>
            </st>
            <p>To generate RNA probes, PCR amplified products were purified from the gel and cloned using pGEM-T Easy System (Promega). Antisense RNA probes were labeled with digoxigenin-UTP (Roche), using SP6 or T7 RNA polymerase (Promega). Embryos were fixed in 4% paraformaldehyde (PFA) dissolved in PBS/DEPC-treated water overnight and dehydrated the next day with 25%, 50%, 75%, and 100% methanol. For whole-mount in situ hybridizations, published protocols were used <abbrgrp><abbr bid="B51">51</abbr></abbrgrp> with minor modifications. After staining, embryos were post-fixed in 4% PFA and stored at 4&#176;C until photographs were taken.</p>
         </sec>
         <sec>
            <st>
               <p>Morpholino Injection</p>
            </st>
            <p>The zebrafish strains used in this study were "Bad Nauheim (BNA)", which was originated from a local pet shop and was inbred for several generations, and "<it>Tg(myl7:EGFP-HRAS)</it><sup><it>s</it>883</sup>" <abbrgrp><abbr bid="B25">25</abbr></abbrgrp>. Antisense morpholinos were purchased from BioCat (Clone ID MORPH 1769 for <it>zLrrc39</it>) and GeneTools (<it>zKlhl31</it>). For the morpholino experiments, 3 concentrations were tested (approximately 2&#8211;4 nl of morpholino diluted in sterile water/phenol red as an indicator per zebrafish egg): "1:2 dilution (high)", which corresponds to 8.5&#8211;17 ng of morpholino; "1:5 dilution (medium)", 3.4&#8211;6.8 ng; and "1:10 dilution (low)", 1.7&#8211;3.4 ng. For each injection, more than 50 eggs were injected. As a control, sterile water/phenol red without morpholino was used to observe effects caused by the injection.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Authors' contributions</p>
         </st>
         <p>SU conceived the study, performed computational studies, contributed to the molecular genetics studies, and drafted the manuscript. AU conceived the study and contributed to the molecular genetics studies. MW carried out morpholino injections. BJ carried out morpholino injections and analyzed zebrafish phenotypes. PZ contributed to the molecular genetic studies. KJ analyzed zebrafish phenotypes. KgK and WS participated in the design of the study. TB participated in the design and coordination of the study and wrote the final version of the manuscript. All authors read and approved the final manuscript.</p>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>We thank Dr. Ren&#233; Zimmermann for establishing and maintaining the BNA zebrafish strain and Dr. Petra Uchida for critically reading the manuscript. This work was supported by the Max-Planck-Society, the DFG (Br1416), the EU Commisson (MYORES network of excellence), the Kerckhoff-Foundation and the Excellence Cluster Cardiopulmonary System. The authors declare that they have no conflicting commercial interests related to this work.</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>Progress towards petascale applications in biology: Status in 2006</p>
            </title>
            <aug>
               <au>
                  <snm>Stewart</snm>
                  <fnm>CA</fnm>
               </au>
               <au>
                  <snm>M&#252;ller</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Lingwall</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Euro-Par 2006: Parallel Processing</source>
            <publisher>Springer-verlag Berlin Heidelberg</publisher>
            <editor>Lehner W</editor>
            <pubdate>2007</pubdate>
            <volume>4375/2007</volume>
            <fpage>289</fpage>
            <lpage>303</lpage>
         </bibl>
         <bibl id="B2">
            <title>
               <p>Gene expression omnibus: microarray data storage, submission, retrieval, and analysis</p>
            </title>
            <aug>
               <au>
                  <snm>Barrett</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Edgar</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Methods Enzymol</source>
            <pubdate>2006</pubdate>
            <volume>411</volume>
            <fpage>352</fpage>
            <lpage>69</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1619900</pubid>
                  <pubid idtype="pmpid" link="fulltext">16939800</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>Software packages for quantitative microarray-based gene expression analysis</p>
            </title>
            <aug>
               <au>
                  <snm>Dresen</snm>
                  <fnm>IM</fnm>
               </au>
               <au>
                  <snm>H&#252;sing</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Kruse</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Boes</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>J&#246;ckel</snm>
                  <fnm>KH</fnm>
               </au>
            </aug>
            <source>Curr Pharm Biotechnol</source>
            <pubdate>2003</pubdate>
            <volume>4</volume>
            <issue>6</issue>
            <fpage>417</fpage>
            <lpage>437</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">14683435</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>Resources for integrative systems biology: from data through databases to networks and dynamic system models</p>
            </title>
            <aug>
               <au>
                  <snm>Ng</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Bursteinas</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Gao</snm>
                  <fnm>Q</fnm>
               </au>
               <au>
                  <snm>Mollison</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Zvelebil</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Brief Bioinform</source>
            <pubdate>2006</pubdate>
            <volume>7</volume>
            <issue>4</issue>
            <fpage>318</fpage>
            <lpage>330</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">17040977</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>Current bioinformatics tools in genomic biomedical research (Review)</p>
            </title>
            <aug>
               <au>
                  <snm>Teufel</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Krupp</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Weinmann</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Galle</snm>
                  <fnm>PR</fnm>
               </au>
            </aug>
            <source>Int J Mol Med</source>
            <pubdate>2006</pubdate>
            <volume>17</volume>
            <issue>6</issue>
            <fpage>967</fpage>
            <lpage>973</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">16685403</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>A model-based optimization framework for the inference of regulatory interactions using time-course DNA microarray expression data</p>
            </title>
            <aug>
               <au>
                  <snm>Thomas</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Paredes</snm>
                  <fnm>CJ</fnm>
               </au>
               <au>
                  <snm>Mehrotra</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Hatzimanikatis</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Papoutsakis</snm>
                  <fnm>ET</fnm>
               </au>
            </aug>
            <source>BMC Bioinformatics</source>
            <pubdate>2007</pubdate>
            <volume>8</volume>
            <fpage>228</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1940027</pubid>
                  <pubid idtype="pmpid" link="fulltext">17603872</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>Systems Biology: A Brief Overview</p>
            </title>
            <aug>
               <au>
                  <snm>Kitano</snm>
                  <fnm>H</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2002</pubdate>
            <volume>295</volume>
            <issue>5560</issue>
            <fpage>1662</fpage>
            <lpage>1664</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">11872829</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>A new approach to decoding life: systems biology</p>
            </title>
            <aug>
               <au>
                  <snm>Ideker</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Galitski</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Hood</snm>
                  <fnm>L</fnm>
               </au>
            </aug>
            <source>Annu Rev Genomics Hum Genet</source>
            <pubdate>2001</pubdate>
            <volume>2</volume>
            <fpage>343</fpage>
            <lpage>372</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">11701654</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p>An enzyme-centric approach for modelling non-linear biological complexity</p>
            </title>
            <aug>
               <au>
                  <snm>Yang</snm>
                  <fnm>CR</fnm>
               </au>
            </aug>
            <source>BMC Syst Biol</source>
            <pubdate>2008</pubdate>
            <volume>2</volume>
            <issue>1</issue>
            <fpage>70</fpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">18671883</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B10">
            <title>
               <p>Predicting gene function through systematic analysis and quality assessment of high-throughput data</p>
            </title>
            <aug>
               <au>
                  <snm>Kemmeren</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Kockelkorn</snm>
                  <fnm>TT</fnm>
               </au>
               <au>
                  <snm>Bijma</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Donders</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Holstege</snm>
                  <fnm>FC</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2005</pubdate>
            <volume>21</volume>
            <issue>8</issue>
            <fpage>1644</fpage>
            <lpage>1652</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">15531615</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>A critical assessment of Mus musculus gene function prediction using integrated genomic evidence</p>
            </title>
            <aug>
               <au>
                  <snm>Pe&#241;a-Castillo</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Tasan</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Myers</snm>
                  <fnm>CL</fnm>
               </au>
               <au>
                  <snm>Lee</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Joshi</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Guan</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Leone</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Pagnani</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Kim</snm>
                  <fnm>WK</fnm>
               </au>
               <au>
                  <snm>Krumpelman</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Tian</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Obozinski</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Qi</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Mostafavi</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Lin</snm>
                  <fnm>GN</fnm>
               </au>
               <au>
                  <snm>Berriz</snm>
                  <fnm>GF</fnm>
               </au>
               <au>
                  <snm>Gibbons</snm>
                  <fnm>FD</fnm>
               </au>
               <au>
                  <snm>Lanckriet</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Qiu</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Grant</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Barutcuoglu</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Hill</snm>
                  <fnm>DP</fnm>
               </au>
               <au>
                  <snm>Warde-Farley</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Grouios</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Ray</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Blake</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Deng</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Jordan</snm>
                  <fnm>MI</fnm>
               </au>
               <au>
                  <snm>Noble</snm>
                  <fnm>WS</fnm>
               </au>
               <au>
                  <snm>Morris</snm>
                  <fnm>Q</fnm>
               </au>
               <au>
                  <snm>Klein-Seetharaman</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Bar-Joseph</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Sun</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Troyanskaya</snm>
                  <fnm>OG</fnm>
               </au>
               <au>
                  <snm>Marcotte</snm>
                  <fnm>EM</fnm>
               </au>
               <au>
                  <snm>Xu</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Hughes</snm>
                  <fnm>TR</fnm>
               </au>
               <au>
                  <snm>Roth</snm>
                  <fnm>FP</fnm>
               </au>
            </aug>
            <source>Genome Biol</source>
            <pubdate>2008</pubdate>
            <volume>9</volume>
            <issue>Suppl 1</issue>
            <fpage>S2</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">2447536</pubid>
                  <pubid idtype="pmpid" link="fulltext">18613946</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>Interactome: gateway into systems biology</p>
            </title>
            <aug>
               <au>
                  <snm>Cusick</snm>
                  <fnm>ME</fnm>
               </au>
               <au>
                  <snm>Klitgord</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Vidal</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Hill</snm>
                  <fnm>DE</fnm>
               </au>
            </aug>
            <source>Hum Mol Genet</source>
            <pubdate>2005</pubdate>
            <volume>14</volume>
            <issue>Spec No 2</issue>
            <fpage>R171</fpage>
            <lpage>R181</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">16162640</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B13">
            <title>
               <p>Yeast two-hybrid contributions to interactome mapping</p>
            </title>
            <aug>
               <au>
                  <snm>Parrish</snm>
                  <fnm>JR</fnm>
               </au>
               <au>
                  <snm>Gulyas</snm>
                  <fnm>KD</fnm>
               </au>
               <au>
                  <snm>Finley</snm>
                  <fnm>RL</fnm>
                  <suf>Jr</suf>
               </au>
            </aug>
            <source>Curr Opin Biotechnol</source>
            <pubdate>2006</pubdate>
            <volume>17</volume>
            <issue>4</issue>
            <fpage>387</fpage>
            <lpage>393</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">16806892</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B14">
            <title>
               <p>3' UTR seed matches, but not overall identity, are associated with RNAi off-targets</p>
            </title>
            <aug>
               <au>
                  <snm>Birmingham</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Anderson</snm>
                  <fnm>EM</fnm>
               </au>
               <au>
                  <snm>Reynolds</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Ilsley-Tyree</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Leake</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Fedorov</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Baskerville</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Maksimova</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Robinson</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Karpilow</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Marshall</snm>
                  <fnm>WS</fnm>
               </au>
               <au>
                  <snm>Khvorova</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Nat Methods</source>
            <pubdate>2006</pubdate>
            <volume>3</volume>
            <issue>3</issue>
            <fpage>199</fpage>
            <lpage>204</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">16489337</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B15">
            <title>
               <p>A genome-wide transgenic RNAi library for conditional gene inactivation in Drosophila</p>
            </title>
            <aug>
               <au>
                  <snm>Dietzl</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Schnorrer</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Su</snm>
                  <fnm>KC</fnm>
               </au>
               <au>
                  <snm>Barinova</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Fellner</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Gasser</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Kinsey</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Oppel</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Scheiblauer</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Couto</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Marra</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Keleman</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Dickson</snm>
                  <fnm>BJ</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2007</pubdate>
            <volume>448</volume>
            <issue>7150</issue>
            <fpage>151</fpage>
            <lpage>156</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">17625558</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B16">
            <title>
               <p>A library of siRNA duplexes targeting the phosphoinositide 3-kinase pathway: determinants of gene silencing for use in cell-based screens</p>
            </title>
            <aug>
               <au>
                  <snm>Hsieh</snm>
                  <fnm>AC</fnm>
               </au>
               <au>
                  <snm>Bo</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Manola</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Vazquez</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Bare</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Khvorova</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Scaringe</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Sellers</snm>
                  <fnm>WR</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <volume>32</volume>
            <issue>3</issue>
            <fpage>893</fpage>
            <lpage>901</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">373385</pubid>
                  <pubid idtype="pmpid" link="fulltext">14769947</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B17">
            <title>
               <p>High-throughput two-hybrid analysis. The promise and the peril</p>
            </title>
            <aug>
               <au>
                  <snm>Fields</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>FEBS J</source>
            <pubdate>2005</pubdate>
            <volume>272</volume>
            <issue>21</issue>
            <fpage>5391</fpage>
            <lpage>5399</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">16262681</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B18">
            <title>
               <p>High-content siRNA screening</p>
            </title>
            <aug>
               <au>
                  <snm>Krausz</snm>
                  <fnm>E</fnm>
               </au>
            </aug>
            <source>Mol Biosyst</source>
            <pubdate>2007</pubdate>
            <volume>3</volume>
            <issue>4</issue>
            <fpage>232</fpage>
            <lpage>240</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">17372651</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B19">
            <title>
               <p>Molecular and cellular approaches for the detection of protein-protein interactions: latest techniques and current limitations</p>
            </title>
            <aug>
               <au>
                  <snm>Lalonde</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Ehrhardt</snm>
                  <fnm>DW</fnm>
               </au>
               <au>
                  <snm>Loqu&#233;</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Rhee</snm>
                  <fnm>SY</fnm>
               </au>
               <au>
                  <snm>Frommer</snm>
                  <fnm>WBL</fnm>
               </au>
            </aug>
            <source>Plant J</source>
            <pubdate>2008</pubdate>
            <volume>53</volume>
            <issue>4</issue>
            <fpage>610</fpage>
            <lpage>635</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">18269572</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B20">
            <title>
               <p>ChIP-chip: a genomic approach for identifying transcription factor binding sites</p>
            </title>
            <aug>
               <au>
                  <snm>Horak</snm>
                  <fnm>CE</fnm>
               </au>
               <au>
                  <snm>Snyder</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Methods Enzymol</source>
            <pubdate>2002</pubdate>
            <volume>350</volume>
            <fpage>469</fpage>
            <lpage>483</lpage>
            <xrefbib>
               <pubid idtype="pmpid">12073330</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B21">
            <title>
               <p>Genome-wide profiles of STAT1 DNA association using chromatin immunoprecipitation and massively parallel sequencing</p>
            </title>
            <aug>
               <au>
                  <snm>Robertson</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Hirst</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Bainbridge</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Bilenky</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Zhao</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Zeng</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Euskirchen</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Bernier</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Varhol</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Delaney</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Thiessen</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Griffith</snm>
                  <fnm>OL</fnm>
               </au>
               <au>
                  <snm>He</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Marra</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Snyder</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Jones</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Nat Methods</source>
            <pubdate>2007</pubdate>
            <volume>4</volume>
            <issue>8</issue>
            <fpage>651</fpage>
            <lpage>657</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">17558387</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B22">
            <title>
               <p>Differential gene expression profiling in genetic and multifactorial cardiovascular diseases</p>
            </title>
            <aug>
               <au>
                  <snm>Nanni</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Romualdi</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Maseri</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Lanfranchi</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>J Mol Cell Cardiol</source>
            <pubdate>2006</pubdate>
            <volume>41</volume>
            <issue>6</issue>
            <fpage>934</fpage>
            <lpage>948</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">17020763</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B23">
            <title>
               <p>Database resources of the National Center for Biotechnology</p>
            </title>
            <aug>
               <au>
                  <snm>Wheeler</snm>
                  <fnm>DL</fnm>
               </au>
               <au>
                  <snm>Church</snm>
                  <fnm>DM</fnm>
               </au>
               <au>
                  <snm>Federhen</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Lash</snm>
                  <fnm>AE</fnm>
               </au>
               <au>
                  <snm>Madden</snm>
                  <fnm>TL</fnm>
               </au>
               <au>
                  <snm>Pontius</snm>
                  <fnm>JU</fnm>
               </au>
               <au>
                  <snm>Schuler</snm>
                  <fnm>GD</fnm>
               </au>
               <au>
                  <snm>Schriml</snm>
                  <fnm>LM</fnm>
               </au>
               <au>
                  <snm>Sequeira</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Tatusova</snm>
                  <fnm>TA</fnm>
               </au>
               <au>
                  <snm>Wagner</snm>
                  <fnm>L</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2003</pubdate>
            <volume>31</volume>
            <issue>1</issue>
            <fpage>28</fpage>
            <lpage>33</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">165480</pubid>
                  <pubid idtype="pmpid" link="fulltext">12519941</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B24">
            <title>
               <p>Gene ontology: tool for the unification of biology. The Gene Ontology Consortium</p>
            </title>
            <aug>
               <au>
                  <snm>Ashburner</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Ball</snm>
                  <fnm>CA</fnm>
               </au>
               <au>
                  <snm>Blake</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Botstein</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Butler</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Cherry</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Davis</snm>
                  <fnm>AP</fnm>
               </au>
               <au>
                  <snm>Dolinski</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Dwight</snm>
                  <fnm>SS</fnm>
               </au>
               <au>
                  <snm>Eppig</snm>
                  <fnm>JT</fnm>
               </au>
               <au>
                  <snm>Harris</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Hill</snm>
                  <fnm>DP</fnm>
               </au>
               <au>
                  <snm>Issel-Tarver</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Kasarskis</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Lewis</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Matese</snm>
                  <fnm>JC</fnm>
               </au>
               <au>
                  <snm>Richardson</snm>
                  <fnm>JE</fnm>
               </au>
               <au>
                  <snm>Ringwald</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Rubin</snm>
                  <fnm>GM</fnm>
               </au>
               <au>
                  <snm>Sherlock</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>Nat Genet</source>
            <pubdate>2000</pubdate>
            <volume>25</volume>
            <issue>1</issue>
            <fpage>25</fpage>
            <lpage>29</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">10802651</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B25">
            <title>
               <p>HomoloGene</p>
            </title>
            <url>http://www.ncbi.nlm.nih.gov/sites/entrez?db=homologene</url>
         </bibl>
         <bibl id="B26">
            <title>
               <p>A mutation in zebrafish hmgcr1b reveals a role for isoprenoids in vertebrate heart-tube formation</p>
            </title>
            <aug>
               <au>
                  <snm>D'Amico</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Scott</snm>
                  <fnm>IC</fnm>
               </au>
               <au>
                  <snm>Jungblut</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Stainier</snm>
                  <fnm>DY</fnm>
               </au>
            </aug>
            <source>Curr Biol</source>
            <pubdate>2007</pubdate>
            <volume>17</volume>
            <issue>3</issue>
            <fpage>252</fpage>
            <lpage>259</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">17276918</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B27">
            <title>
               <p>Data mining for biomarker development: a review of tissue specificity analysis</p>
            </title>
            <aug>
               <au>
                  <snm>Klee</snm>
                  <fnm>EW</fnm>
               </au>
            </aug>
            <source>Clin Lab Med</source>
            <pubdate>2008</pubdate>
            <volume>28</volume>
            <issue>1</issue>
            <fpage>127</fpage>
            <lpage>143</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">18194723</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B28">
            <title>
               <p>In silico cloning of novel endothelial-specific genes</p>
            </title>
            <aug>
               <au>
                  <snm>Huminiecki</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Bicknell</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2000</pubdate>
            <volume>10</volume>
            <issue>11</issue>
            <fpage>1796</fpage>
            <lpage>1806</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">310981</pubid>
                  <pubid idtype="pmpid" link="fulltext">11076864</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B29">
            <title>
               <p>Identification of novel epididymal genes by expression profiling and in silico gene discovery</p>
            </title>
            <aug>
               <au>
                  <snm>Jalkanen</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Shariatmadari</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Pujianto</snm>
                  <fnm>DA</fnm>
               </au>
               <au>
                  <snm>Sipil&#228;</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Huhtaniemi</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Poutanen</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Mol Cell Endocrinol</source>
            <pubdate>2006</pubdate>
            <volume>250</volume>
            <issue>1&#8211;2</issue>
            <fpage>163</fpage>
            <lpage>168</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">16413671</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B30">
            <title>
               <p>Systematic identification and integrative analysis of novel genes expressed specifically or predominantly in mouse epididymis</p>
            </title>
            <aug>
               <au>
                  <snm>Oh</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Lee</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Woo</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Choi</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Park</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Han</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Baek</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Lee</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Kim</snm>
                  <fnm>DH</fnm>
               </au>
               <au>
                  <snm>Cho</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>BMC Genomics</source>
            <pubdate>2006</pubdate>
            <volume>7</volume>
            <fpage>314</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1764739</pubid>
                  <pubid idtype="pmpid" link="fulltext">17166261</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B31">
            <title>
               <p>Discovery in silico and characterization in vitro of novel genes exclusively expressed in the mouse epididymis</p>
            </title>
            <aug>
               <au>
                  <snm>Penttinen</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Pujianto</snm>
                  <fnm>DA</fnm>
               </au>
               <au>
                  <snm>Sipila</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Huhtaniemi</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Poutanen</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Mol Endocrinol</source>
            <pubdate>2003</pubdate>
            <volume>17</volume>
            <issue>11</issue>
            <fpage>2138</fpage>
            <lpage>2151</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12920233</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B32">
            <title>
               <p>Heart-specific genes revealed by expressed sequence tag (EST) sampling</p>
            </title>
            <aug>
               <au>
                  <snm>M&#233;gy</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Audic</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Claverie</snm>
                  <fnm>JM</fnm>
               </au>
            </aug>
            <source>Genome Biol</source>
            <pubdate>2002</pubdate>
            <volume>3</volume>
            <issue>12</issue>
            <fpage>RESEARCH0074</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">151176</pubid>
                  <pubid idtype="pmpid" link="fulltext">12537563</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B33">
            <title>
               <p>Comprehensive identification and characterization of novel cardiac genes in mouse</p>
            </title>
            <aug>
               <au>
                  <snm>Park</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Hong</snm>
                  <fnm>SE</fnm>
               </au>
               <au>
                  <snm>Kim</snm>
                  <fnm>TW</fnm>
               </au>
               <au>
                  <snm>Lee</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Oh</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Choi</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Han</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Lee</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Han Kim</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Cho</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>J Mol Cell Cardiol</source>
            <pubdate>2007</pubdate>
            <volume>43</volume>
            <issue>2</issue>
            <fpage>93</fpage>
            <lpage>106</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">17599348</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B34">
            <title>
               <p>Identifying tissue-enriched gene expression in mouse tissues using the NIH UniGene database</p>
            </title>
            <aug>
               <au>
                  <snm>Stanton</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Macgregor</snm>
                  <fnm>AB</fnm>
               </au>
               <au>
                  <snm>Green</snm>
                  <fnm>DP</fnm>
               </au>
            </aug>
            <source>Appl Bioinformatics</source>
            <pubdate>2003</pubdate>
            <volume>2</volume>
            <issue>3 Suppl</issue>
            <fpage>S65</fpage>
            <lpage>73</lpage>
            <xrefbib>
               <pubid idtype="pmpid">15130819</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B35">
            <title>
               <p>Activation of cardiac gene expression by myocardin, a transcriptional cofactor for serum response factor</p>
            </title>
            <aug>
               <au>
                  <snm>Wang</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Chang</snm>
                  <fnm>PS</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Sutherland</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Richardson</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Small</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Krieg</snm>
                  <fnm>PA</fnm>
               </au>
               <au>
                  <snm>Olson</snm>
                  <fnm>EN</fnm>
               </au>
            </aug>
            <source>Cell</source>
            <pubdate>2001</pubdate>
            <volume>105</volume>
            <issue>7</issue>
            <fpage>851</fpage>
            <lpage>862</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">11439182</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B36">
            <title>
               <p>Meta-analysis of gene expression in mouse preimplantation embryo development</p>
            </title>
            <aug>
               <au>
                  <snm>Stanton</snm>
                  <fnm>JL</fnm>
               </au>
               <au>
                  <snm>Green</snm>
                  <fnm>DP</fnm>
               </au>
            </aug>
            <source>Mol Hum Reprod</source>
            <pubdate>2001</pubdate>
            <volume>7</volume>
            <issue>6</issue>
            <fpage>545</fpage>
            <lpage>552</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">11385109</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B37">
            <title>
               <p>Building comparative gene expression databases for the mouse preimplantation embryo using a pipeline approach to UniGene</p>
            </title>
            <aug>
               <au>
                  <snm>Stanton</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Macgregor</snm>
                  <fnm>AB</fnm>
               </au>
               <au>
                  <snm>Mason</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Dameh</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Green</snm>
                  <fnm>DP</fnm>
               </au>
            </aug>
            <source>Mol Hum Reprod</source>
            <pubdate>2007</pubdate>
            <volume>13</volume>
            <issue>10</issue>
            <fpage>713</fpage>
            <lpage>720</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">17804433</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B38">
            <title>
               <p>Discovery of three genes specifically expressed in human prostate by expressed sequence tag database analysis</p>
            </title>
            <aug>
               <au>
                  <snm>Vasmatzis</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Essand</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Brinkmann</snm>
                  <fnm>U</fnm>
               </au>
               <au>
                  <snm>Lee</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Pastan</snm>
                  <fnm>I</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>1998</pubdate>
            <volume>95</volume>
            <issue>1</issue>
            <fpage>300</fpage>
            <lpage>304</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">18207</pubid>
                  <pubid idtype="pmpid" link="fulltext">9419370</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B39">
            <title>
               <p>The human adult skeletal muscle transcriptional profile reconstructed by a novel computational approach</p>
            </title>
            <aug>
               <au>
                  <snm>Bortoluzzi</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>d'Alessi</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Romualdi</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Danieli</snm>
                  <fnm>GA</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2000</pubdate>
            <volume>10</volume>
            <issue>3</issue>
            <fpage>344</fpage>
            <lpage>349</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">311426</pubid>
                  <pubid idtype="pmpid" link="fulltext">10720575</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B40">
            <title>
               <p>Integrative characterization of germ cell-specific genes from mouse spermatocyte UniGene library</p>
            </title>
            <aug>
               <au>
                  <snm>Choi</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Lee</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Oh</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Park</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Han</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Yi</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Kim do</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Cho</snm>
                  <fnm>BN</fnm>
               </au>
               <au>
                  <snm>Eddy</snm>
                  <fnm>EM</fnm>
               </au>
               <au>
                  <snm>Cho</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>BMC Genomics</source>
            <pubdate>2007</pubdate>
            <volume>8</volume>
            <fpage>256</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1955454</pubid>
                  <pubid idtype="pmpid" link="fulltext">17662146</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B41">
            <title>
               <p>A rapid bioinformatic method identifies novel genes with direct clinical relevance to colon cancer</p>
            </title>
            <aug>
               <au>
                  <snm>Brett</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Kemmner</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Koch</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Roefzaad</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Gross</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Schlag</snm>
                  <fnm>PM</fnm>
               </au>
            </aug>
            <source>Oncogene</source>
            <pubdate>2001</pubdate>
            <volume>20</volume>
            <issue>33</issue>
            <fpage>4581</fpage>
            <lpage>4585</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">11494154</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B42">
            <title>
               <p>Prediction of gene function by genome-scale expression analysis: prostate cancer-associated genes</p>
            </title>
            <aug>
               <au>
                  <snm>Walker</snm>
                  <fnm>MG</fnm>
               </au>
               <au>
                  <snm>Volkmuth</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Sprinzak</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Hodgson</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Klingler</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>1999</pubdate>
            <volume>9</volume>
            <issue>12</issue>
            <fpage>1198</fpage>
            <lpage>1203</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">310991</pubid>
                  <pubid idtype="pmpid" link="fulltext">10613842</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B43">
            <title>
               <p>dbEST &#8211; database for "expressed sequence tags"</p>
            </title>
            <aug>
               <au>
                  <snm>Boguski</snm>
                  <fnm>MS</fnm>
               </au>
               <au>
                  <snm>Lowe</snm>
                  <fnm>TM</fnm>
               </au>
               <au>
                  <snm>Tolstoshev</snm>
                  <fnm>CM</fnm>
               </au>
            </aug>
            <source>Nat Genet</source>
            <pubdate>1993</pubdate>
            <volume>4</volume>
            <issue>4</issue>
            <fpage>332</fpage>
            <lpage>333</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">8401577</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B44">
            <title>
               <p>T-STAG: resource and web-interface for tissue-specific transcripts and genes</p>
            </title>
            <aug>
               <au>
                  <snm>Gupta</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Vingron</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Haas</snm>
                  <fnm>SA</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2005</pubdate>
            <issue>33 Web Server</issue>
            <fpage>W654</fpage>
            <lpage>658</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1160111</pubid>
                  <pubid idtype="pmpid" link="fulltext">15980556</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B45">
            <title>
               <p>TissueInfo: high-throughput identification of tissue expression profiles and specificity</p>
            </title>
            <aug>
               <au>
                  <snm>Skrabanek</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Campagne</snm>
                  <fnm>F</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2001</pubdate>
            <volume>29</volume>
            <issue>21</issue>
            <fpage>E102</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">60201</pubid>
                  <pubid idtype="pmpid" link="fulltext">11691939</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B46">
            <title>
               <p>GEPIS &#8211; quantitative gene expression profiling in normal and cancer tissues</p>
            </title>
            <aug>
               <au>
                  <snm>Zhang</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Eberhard</snm>
                  <fnm>DA</fnm>
               </au>
               <au>
                  <snm>Frantz</snm>
                  <fnm>GD</fnm>
               </au>
               <au>
                  <snm>Dowd</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Wu</snm>
                  <fnm>TD</fnm>
               </au>
               <au>
                  <snm>Zhou</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Watanabe</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Luoh</snm>
                  <fnm>SM</fnm>
               </au>
               <au>
                  <snm>Polakis</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Hillan</snm>
                  <fnm>KJ</fnm>
               </au>
               <au>
                  <snm>Wood</snm>
                  <fnm>WI</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>Z</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2004</pubdate>
            <volume>20</volume>
            <issue>15</issue>
            <fpage>2390</fpage>
            <lpage>2398</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">15073007</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B47">
            <title>
               <p>GeneHub-GEPIS: digital expression profiling for normal and cancer tissues based on an integrated gene database</p>
            </title>
            <aug>
               <au>
                  <snm>Zhang</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Luoh</snm>
                  <fnm>SM</fnm>
               </au>
               <au>
                  <snm>Hon</snm>
                  <fnm>LS</fnm>
               </au>
               <au>
                  <snm>Baertsch</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Wood</snm>
                  <fnm>WI</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>Z</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2007</pubdate>
            <issue>35 Web Server</issue>
            <fpage>W152</fpage>
            <lpage>158</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1933245</pubid>
                  <pubid idtype="pmpid" link="fulltext">17545196</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B48">
            <title>
               <p>TissueDistributionDBs</p>
            </title>
            <url>http://genome.dkfz-heidelberg.de/menu/tissue_db/index.html</url>
         </bibl>
         <bibl id="B49">
            <title>
               <p>Zebrafish Model Organisms Database</p>
            </title>
            <url>http://zfin.org/cgi-bin/webdriver?MIval=aa-ZDB_home.apg</url>
         </bibl>
         <bibl id="B50">
            <title>
               <p>AmiGO</p>
            </title>
            <url>http://www.geneontology.org/amigo/help-front.shtml</url>
         </bibl>
         <bibl id="B51">
            <title>
               <p>Whole mount in situ hybridization of vertebrate embryos</p>
            </title>
            <aug>
               <au>
                  <snm>Wilkinson</snm>
                  <fnm>DG</fnm>
               </au>
            </aug>
            <source>In situ hybridization: A practical approach</source>
            <publisher>Oxford University Press, Oxford</publisher>
            <editor>Wilkinson DG</editor>
            <pubdate>1992</pubdate>
            <fpage>75</fpage>
            <lpage>84</lpage>
         </bibl>
      </refgrp>
   </bm>
</art>
