<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1741-7007-6-49</ui>
   <ji>1741-7007</ji>
   <fm>
      <dochead>Research article</dochead>
      <bibl>
         <title>
            <p>A comprehensive functional analysis of tissue specificity of human gene expression</p>
         </title>
         <aug>
            <au id="A1" ca="yes" ce="yes">
               <snm>Dezs&#337;</snm>
               <fnm>Zolt&#225;n</fnm>
               <insr iid="I1"/>
               <email>zoltan@genego.com</email>
            </au>
            <au id="A2" ce="yes">
               <snm>Nikolsky</snm>
               <fnm>Yuri</fnm>
               <insr iid="I1"/>
               <email>yuri@genego.com</email>
            </au>
            <au id="A3">
               <snm>Sviridov</snm>
               <fnm>Evgeny</fnm>
               <insr iid="I2"/>
               <email>sviridov@genego.com</email>
            </au>
            <au id="A4">
               <snm>Shi</snm>
               <fnm>Weiwei</fnm>
               <insr iid="I1"/>
               <email>weiwei@genego.com</email>
            </au>
            <au id="A5">
               <snm>Serebriyskaya</snm>
               <fnm>Tatiana</fnm>
               <insr iid="I2"/>
               <email>malus@genego.com</email>
            </au>
            <au id="A6">
               <snm>Dosymbekov</snm>
               <fnm>Damir</fnm>
               <insr iid="I2"/>
               <email>dosymbekov@genego.com</email>
            </au>
            <au id="A7">
               <snm>Bugrim</snm>
               <fnm>Andrej</fnm>
               <insr iid="I1"/>
               <email>andrej@genego.com</email>
            </au>
            <au id="A8">
               <snm>Rakhmatulin</snm>
               <fnm>Eugene</fnm>
               <insr iid="I1"/>
               <email>eugr@genego.com</email>
            </au>
            <au id="A9">
               <snm>Brennan</snm>
               <mi>J</mi>
               <fnm>Richard</fnm>
               <insr iid="I1"/>
               <email>richard@genego.com</email>
            </au>
            <au id="A10">
               <snm>Guryanov</snm>
               <fnm>Alexey</fnm>
               <insr iid="I2"/>
               <email>guryanov@genego.com</email>
            </au>
            <au id="A11">
               <snm>Li</snm>
               <fnm>Kelly</fnm>
               <insr iid="I3"/>
               <email>Kelly.Li@appliedbiosystems.com</email>
            </au>
            <au id="A12">
               <snm>Blake</snm>
               <fnm>Julie</fnm>
               <insr iid="I3"/>
               <email>blakejj@appliedbiosystems.com</email>
            </au>
            <au id="A13">
               <snm>Samaha</snm>
               <mi>R</mi>
               <fnm>Raymond</fnm>
               <insr iid="I3"/>
               <email>Raymond.Samaha@appliedbiosystems.com</email>
            </au>
            <au id="A14">
               <snm>Nikolskaya</snm>
               <fnm>Tatiana</fnm>
               <insr iid="I2"/>
               <email>tatiana@genego.com</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>GeneGo, Inc. Renaissance Drive, St. Joseph, MI 49085, USA</p>
            </ins>
            <ins id="I2">
               <p>Vavilov Institute of General Genetics, Russian Academy of Sciences, Gubkina Str, Moscow, Russia</p>
            </ins>
            <ins id="I3">
               <p>Applied Biosystems, Inc, Lincoln Center Drive, Foster City, CA 94404, USA</p>
            </ins>
         </insg>
         <source>BMC Biology</source>
         <issn>1741-7007</issn>
         <pubdate>2008</pubdate>
         <volume>6</volume>
         <issue>1</issue>
         <fpage>49</fpage>
         <url>http://www.biomedcentral.com/1741-7007/6/49</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">19014478</pubid>
               <pubid idtype="doi">10.1186/1741-7007-6-49</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>29</day>
               <month>10</month>
               <year>2008</year>
            </date>
         </rec>
         <acc>
            <date>
               <day>12</day>
               <month>11</month>
               <year>2008</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>12</day>
               <month>11</month>
               <year>2008</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2008</year>
         <collab>Dezs&#337; et al; licensee BioMed Central Ltd.</collab>
         <note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>In recent years, the maturation of microarray technology has allowed the genome-wide analysis of gene expression patterns to identify tissue-specific and ubiquitously expressed ('housekeeping') genes. We have performed a functional and topological analysis of housekeeping and tissue-specific networks to identify universally necessary biological processes, and those unique to or characteristic of particular tissues.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>We measured whole genome expression in 31 human tissues, identifying 2374 housekeeping genes expressed in all tissues, and genes uniquely expressed in each tissue. Comprehensive functional analysis showed that the housekeeping set is substantially larger than previously thought, and is enriched with vital processes such as oxidative phosphorylation, ubiquitin-dependent proteolysis, translation and energy metabolism. Network topology of the housekeeping network was characterized by higher connectivity and shorter paths between the proteins than the global network. Ontology enrichment scoring and network topology of tissue-specific genes were consistent with each tissue's function and expression patterns clustered together in accordance with tissue origin. Tissue-specific genes were twice as likely as housekeeping genes to be drug targets, allowing the identification of tissue 'signature networks' that will facilitate the discovery of new therapeutic targets and biomarkers of tissue-targeted diseases.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusion</p>
               </st>
               <p>A comprehensive functional analysis of housekeeping and tissue-specific genes showed that the biological function of housekeeping and tissue-specific genes was consistent with tissue origin. Network analysis revealed that tissue-specific networks have distinct network properties related to each tissue's function. Tissue 'signature networks' promise to be a rich source of targets and biomarkers for disease treatment and diagnosis.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>The issue of tissue and cell-type specificity of gene expression is central to human biology and biomedicine. It impacts such fundamental problems as tissue ontogenesis, evolution and carcinogenesis. It is generally believed that the most relevant disease biomarkers and drug targets predominantly are found among proteins specific for the tissue the disease affects. In 1965 Watson et al. described genes universally expressed to maintain cellular functions as 'housekeeping' (HK) genes <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>. Comprehensive experimentation in this area was, however, limited until, in the 1990s, microarrays enabled 'genome-wide' snapshots of gene expression. Numerous studies on tissue-specific expression have since been published <abbrgrp><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr><abbr bid="B6">6</abbr><abbr bid="B7">7</abbr><abbr bid="B8">8</abbr></abbrgrp>, identifying between 451 and 1789 genes <abbrgrp><abbr bid="B4">4</abbr></abbrgrp> as a HK core on different microarray platforms. There is no standard method for assigning a gene as HK, and no comprehensive functional analysis of HK and tissue-specific genes has previously been done.</p>
         <p>Here we report the identification of 2374 HK genes, based on a novel study of gene expression in 31 human tissues on a whole-genome ABI array. We set up a definitive 'HK baseline' for gene expression across tissues, and compared previously published HK data sets with ours. We also identified gene sets uniquely expressed in individual tissues and groups of tissues, and generated tissue-specific merged metabolic/signaling 'signature networks'. Both HK and tissue-specific genes were subjected to a comprehensive, three-phase functional analysis, including gene set enrichment across four ontologies, network topology analysis and tissue-specific network analysis. We clustered tissues into groups according to expression patterns and network parameters, and revealed associations between HK and tissue-specific genes with human diseases and drug targets.</p>
      </sec>
      <sec>
         <st>
            <p>Results</p>
         </st>
         <sec>
            <st>
               <p>Definition of housekeeping and tissue-specific genes</p>
            </st>
            <p>Whole-genome gene expression was analyzed in 31 human tissues using an ABI human genome array with probe sets for 27,868 individual human genes. A relatively conservative ten-fold 'signal-to-noise' ratio (SN10) across all three replicate arrays for each tissue was applied to identify the transcripts present in all tissues (HK set, see Methods). The SN10 HK set comprised 2374 genes (Additional file <supplr sid="S1">1</supplr>). We compared this set with four sets of HK genes published previously: a set of 535 genes from 11 tissues on a Affymetrix HuGeneFL array <abbrgrp><abbr bid="B3">3</abbr></abbrgrp>; 1789 genes from 79 tissues on custom 33,698 gene array <abbrgrp><abbr bid="B4">4</abbr></abbrgrp>; 574 genes from 47 tissues on a 7500 gene Affymtrix U95A array <abbrgrp><abbr bid="B2">2</abbr></abbrgrp> and 451 genes from 19 tissues on Affymetrix HuGeneFL arrays <abbrgrp><abbr bid="B5">5</abbr></abbrgrp>. Our HK set overlapped with between 42 and 82% of these sets, although only 97 genes were common between the four larger sets. Furthermore, there was 80% overlap between our set and the intersection of the previous studies (Figure <figr fid="F1">1</figr>). Our set contained 1419 additional HK genes not previously identified.</p>
            <suppl id="S1">
               <title>
                  <p>Additional file 1</p>
               </title>
               <text>
                  <p>
                     <b>List of all housekeeping genes</b>
                  </p>
               </text>
               <file name="1741-7007-6-49-S1.xls">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <fig id="F1">
               <title>
                  <p>Figure 1</p>
               </title>
               <caption>
                  <p>Comparison between housekeeping gene sets</p>
               </caption>
               <text>
                  <p><b>Comparison between housekeeping gene sets</b>. The intersections between Tu et al. <abbrgrp><abbr bid="B4">4</abbr></abbrgrp>, Warrington et al. <abbrgrp><abbr bid="B3">3</abbr></abbrgrp>, Eisenberg and Levanon <abbrgrp><abbr bid="B2">2</abbr></abbrgrp> and our set are shown. The smallest of the four previously published sets <abbrgrp><abbr bid="B5">5</abbr></abbrgrp> was not included in the figure for clarity. The numbers in parentheses are the number of unique genes that overlap between the sets located at the opposite sides.</p>
               </text>
               <graphic file="1741-7007-6-49-1"/>
            </fig>
            <p>Genes uniquely expressed in only one of the 31 tissues (at the SN10 cut-off) were defined as tissue specific. Tissue-specific sets varied in size from four genes for thymus to 484 genes for testis, with an average size of 43.8 genes, or 30.9 genes for somatic tissues only (excluding testis) (Table <tblr tid="T1">1</tblr>, Additional file <supplr sid="S2">2</supplr>). The gene set uniquely expressed in testis was substantially larger than those for somatic tissues, and highly enriched in meiosis-specific genes, confirming recent findings <abbrgrp><abbr bid="B9">9</abbr><abbr bid="B10">10</abbr></abbrgrp>. Around 40% of a recently published human testis specific gene set <abbrgrp><abbr bid="B11">11</abbr></abbrgrp> overlapped with our SN10 set.</p>
            <tbl id="T1">
               <title>
                  <p>Table 1</p>
               </title>
               <caption>
                  <p>Number of housekeeping and tissue-specific genes</p>
               </caption>
               <tblbdy cols="2">
                  <r>
                     <c ca="left">
                        <p>
                           <b>Tissues</b>
                        </p>
                     </c>
                     <c ca="right">
                        <p>
                           <b>SN10</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="2">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Housekeeping</p>
                     </c>
                     <c ca="right">
                        <p>2374</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Liver</p>
                     </c>
                     <c ca="right">
                        <p>22</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Skeletal muscle</p>
                     </c>
                     <c ca="right">
                        <p>37</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Fetal liver</p>
                     </c>
                     <c ca="right">
                        <p>16</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Testis</p>
                     </c>
                     <c ca="right">
                        <p>484</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Placenta</p>
                     </c>
                     <c ca="right">
                        <p>38</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Bone marrow</p>
                     </c>
                     <c ca="right">
                        <p>63</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Skin</p>
                     </c>
                     <c ca="right">
                        <p>75</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Adrenal gland</p>
                     </c>
                     <c ca="right">
                        <p>13</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Prostate</p>
                     </c>
                     <c ca="right">
                        <p>14</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Trachea</p>
                     </c>
                     <c ca="right">
                        <p>16</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Small intestine</p>
                     </c>
                     <c ca="right">
                        <p>35</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Peripheral blood lymphocytes</p>
                     </c>
                     <c ca="right">
                        <p>49</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Mammary gland</p>
                     </c>
                     <c ca="right">
                        <p>16</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Tonsil</p>
                     </c>
                     <c ca="right">
                        <p>24</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Thymus</p>
                     </c>
                     <c ca="right">
                        <p>4</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Spleen</p>
                     </c>
                     <c ca="right">
                        <p>14</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Fetal kidney</p>
                     </c>
                     <c ca="right">
                        <p>5</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Thyroid</p>
                     </c>
                     <c ca="right">
                        <p>7</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Brain</p>
                     </c>
                     <c ca="right">
                        <p>34</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Heart</p>
                     </c>
                     <c ca="right">
                        <p>26</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Lung</p>
                     </c>
                     <c ca="right">
                        <p>16</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Salivary gland</p>
                     </c>
                     <c ca="right">
                        <p>17</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Ovary</p>
                     </c>
                     <c ca="right">
                        <p>15</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Pancreas</p>
                     </c>
                     <c ca="right">
                        <p>20</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Fetal thymus</p>
                     </c>
                     <c ca="right">
                        <p>8</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Colon</p>
                     </c>
                     <c ca="right">
                        <p>9</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Spinal cord</p>
                     </c>
                     <c ca="right">
                        <p>24</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Retina</p>
                     </c>
                     <c ca="right">
                        <p>190</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Kidney</p>
                     </c>
                     <c ca="right">
                        <p>17</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Uterus</p>
                     </c>
                     <c ca="right">
                        <p>12</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Fetal brain</p>
                     </c>
                     <c ca="right">
                        <p>61</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b>Average</b>
                        </p>
                     </c>
                     <c ca="right">
                        <p>43.8</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b>Average, somatic tissues</b>
                        </p>
                     </c>
                     <c ca="right">
                        <p>30.9</p>
                     </c>
                  </r>
               </tblbdy>
            </tbl>
            <suppl id="S2">
               <title>
                  <p>Additional file 2</p>
               </title>
               <text>
                  <p>
                     <b>List of all tissue-specific genes</b>
                  </p>
               </text>
               <file name="1741-7007-6-49-S2.xls">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <p>An alternative method, using the Student's <it>t</it>-test, was also applied to identify tissue-specific genes. Functional analysis showed striking similarities in the biological processes encompassed in the genes specific to each tissue between the <it>t</it>-test and the threshold-based methods (description of method and results in Additional file <supplr sid="S3">3</supplr>).</p>
            <suppl id="S3">
               <title>
                  <p>Additional file 3</p>
               </title>
               <text>
                  <p>
                     <b>Table for intersection between genes, ontologies</b>
                  </p>
               </text>
               <file name="1741-7007-6-49-S3.xls">
                  <p>Click here for file</p>
               </file>
            </suppl>
         </sec>
         <sec>
            <st>
               <p>Ontological analysis of housekeeping and tissue-specific genes</p>
            </st>
            <p>We conducted enrichment analysis of HK and tissue-specific sets across four functional ontologies: canonical pathway maps, gene ontology (GO) processes, GeneGo (GG) process networks, and diseases, using the hypergeometric distribution <abbrgrp><abbr bid="B12">12</abbr></abbrgrp> and Gene Set Enrichment Analysis (GSEA) algorithms <abbrgrp><abbr bid="B13">13</abbr></abbrgrp> to determine enrichment. The hypergeometric <it>p</it>-values for the ten top-scored entities in each ontology for the HK set are shown in Table <tblr tid="T2">2</tblr> and Additional file <supplr sid="S4">4</supplr>. 'Oxidative phosphorylation' was the highest scored canonical pathway map (<it>p </it>&lt; 10&#8211;77). This vital, ATP-yielding pathway is the terminal part of cellular respiration, in which electrons are transferred through the electron transport chain (ETC). Almost all subunits of the ETC protein complexes were expressed in the HK set (Figure <figr fid="F2">2(I)</figr>). The second-highest scored map depicts another essential process in the mitochondrial respiratory chain, ubiquinone metbolism (<it>p </it>&lt; 10&#8211;40). Ubiquitin-proteasomal proteolysis was among the highest scored GG process networks (<it>p </it>&lt; 10&#8211;26), with virtually all essential proteins included in the HK gene set (Figure <figr fid="F2">2(II)</figr>). GG process ontology is a set of 120 networks covering most critical cellular processes. Each GG process comprises 100 to 350 functionally related proteins. Unlike GO terms, where proteins are not necessarily connected, the proteins in GG processes are linked via physical interactions, and can be visualized as networks. They therefore represent the signal flow and metabolic flux within each process. Translation initiation was another high-scored GG process (Figure <figr fid="F2">2(III)</figr>), with many translation initiation factors represented in the HK set. In GO biological processes, cellular metabolism, translation and RNA processing were among the top ten scoring categories. Neoplasm by site (<it>p </it>&lt; 10&#8211;16) and breast neoplasm (<it>p </it>&lt; 10&#8211;14) were the top-scored diseases (Table <tblr tid="T2">2</tblr>). Additionally, enrichment analysis of canonical pathway maps showed that nine out of the top ten maps are directly connected to 'growth' and 'viability' (Additional file <supplr sid="S5">5</supplr>).</p>
            <suppl id="S4">
               <title>
                  <p>Additional file 4</p>
               </title>
               <text>
                  <p>
                     <b>Complete enrichment analysis for all housekeeping sets</b>
                  </p>
               </text>
               <file name="1741-7007-6-49-S4.xls">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <tbl id="T2">
               <title>
                  <p>Table 2</p>
               </title>
               <caption>
                  <p>Enrichment analysis of housekeeping and three selected tissue-specific gene sets. </p>
               </caption>
               <tblbdy cols="5">
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>
                           <b>Housekeeping</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>Retina</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>Testis</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>Adrenal Gland</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="5">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>
                           <b>Canonical pathways maps</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Oxidative phosphorylation</p>
                     </c>
                     <c ca="left">
                        <p>Visual perception</p>
                     </c>
                     <c ca="left">
                        <p>CREM signaling in testis</p>
                     </c>
                     <c ca="left">
                        <p>Tyrosine metabolism (dopamine)</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Ubiquinone metabolism</p>
                     </c>
                     <c ca="left">
                        <p>CREB pathway</p>
                     </c>
                     <c ca="left">
                        <p>Glycolysis and gluconeogenesis</p>
                     </c>
                     <c ca="left">
                        <p>Catecholamine metabolism</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Cytoskeleton remodeling</p>
                     </c>
                     <c ca="left">
                        <p>TGF, WNT and cytoskeletal remodeling</p>
                     </c>
                     <c ca="left">
                        <p>ATM/ATR regulation of G2/M checkpoint</p>
                     </c>
                     <c ca="left">
                        <p>Estrogen biosynthesis</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Formation of Sin3A and NuRD complexes and their role in transcription regulation</p>
                     </c>
                     <c ca="left">
                        <p>Regulation of CDK5 in CNS</p>
                     </c>
                     <c ca="left">
                        <p>Role of APC in cell cycle regulation</p>
                     </c>
                     <c ca="left">
                        <p>Cortisone biosynthesis and metabolism</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>TGF, WNT and cytoskeleton remodeling</p>
                     </c>
                     <c ca="left">
                        <p>Role of Nek in cell cycle regulation</p>
                     </c>
                     <c ca="left">
                        <p>Rap1A regulation pathway</p>
                     </c>
                     <c ca="left">
                        <p>Androstenedione and testosterone biosynthesis and metabolism</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Regulation activity of EIF4F</p>
                     </c>
                     <c ca="left">
                        <p>Plasmin signaling</p>
                     </c>
                     <c ca="left">
                        <p>Androgen receptor nuclear signaling</p>
                     </c>
                     <c ca="left">
                        <p>Non-genomic (rapid) action of androgen receptor</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>WNT signaling pathway</p>
                     </c>
                     <c ca="left">
                        <p>Receptor-mediated HIF regulation</p>
                     </c>
                     <c ca="left">
                        <p>Insulin regulation of fatty acid metabolism</p>
                     </c>
                     <c ca="left">
                        <p>Androgen receptor nuclear signaling</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Role of tetraspanins in the integrin-mediated cell adhesion</p>
                     </c>
                     <c ca="left">
                        <p>Sphingolipid metabolism</p>
                     </c>
                     <c ca="left">
                        <p>Rap1B regulation pathway</p>
                     </c>
                     <c ca="left">
                        <p>WNT signaling pathway</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>BAD phosphorylation</p>
                     </c>
                     <c ca="left">
                        <p>Retinol metabolism</p>
                     </c>
                     <c ca="left">
                        <p>Nucleocytoplasmic transport of CDK/Cyclins</p>
                     </c>
                     <c ca="left">
                        <p>TGF, WNT and cytoskeletal remodeling</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>TCA</p>
                     </c>
                     <c ca="left">
                        <p>Regulation activity of EIF2</p>
                     </c>
                     <c ca="left">
                        <p>Estrogen biosynthesis</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>
                           <b>GeneGo processes</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Translation_Translation initiation</p>
                     </c>
                     <c ca="left">
                        <p>Signal transduction_Visual perception</p>
                     </c>
                     <c ca="left">
                        <p>Reproduction_Spermatogenesis</p>
                     </c>
                     <c ca="left">
                        <p>Transport_Potassium transport</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Translation_Elongation-Termination</p>
                     </c>
                     <c ca="left">
                        <p>Transmission of nerve impulse</p>
                     </c>
                     <c ca="left">
                        <p>Cell cycle_Meiosis</p>
                     </c>
                     <c ca="left">
                        <p>Cell adhesion cadherins</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Transcription_mRNA processing</p>
                     </c>
                     <c ca="left">
                        <p>Reproduction_GnRH signaling pathways</p>
                     </c>
                     <c ca="left">
                        <p>Signal transduction_CREM pathway</p>
                     </c>
                     <c ca="left">
                        <p>Development neurogenesis in general</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Proteolysis_Ubiquitin-proteasomal proteolysis</p>
                     </c>
                     <c ca="left">
                        <p>Development_Neurogenesis in general</p>
                     </c>
                     <c ca="left">
                        <p>Reproduction_Male sex differentiation</p>
                     </c>
                     <c ca="left">
                        <p>Signal transduction NOTCH signaling</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Translation_Regulation of initiation</p>
                     </c>
                     <c ca="left">
                        <p>Transport_Calcium transport</p>
                     </c>
                     <c ca="left">
                        <p>Signal transduction_Androgen receptor nuclear signaling</p>
                     </c>
                     <c ca="left">
                        <p>Signal transduction_Androgen receptor signaling cross talk</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Protein folding_Folding in normal condition</p>
                     </c>
                     <c ca="left">
                        <p>Development_Neurogenesis- synaptogenesis</p>
                     </c>
                     <c ca="left">
                        <p>Cell cycle_Mitosis</p>
                     </c>
                     <c ca="left">
                        <p>Signal transduction_CREM pathway</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Immune_Phagosome in antigen presentation</p>
                     </c>
                     <c ca="left">
                        <p>Reproduction_Progesterone signaling</p>
                     </c>
                     <c ca="left">
                        <p>Proteolysis_Ubiquitin-proteasomal proteolysis</p>
                     </c>
                     <c ca="left">
                        <p>Signal transduction_Neuropeptides signaling pathway</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Protein folding_Protein folding nucleus</p>
                     </c>
                     <c ca="left">
                        <p>Cell adhesion_Integrin priming</p>
                     </c>
                     <c ca="left">
                        <p>Cell cycle_Core</p>
                     </c>
                     <c ca="left">
                        <p>Development ossification and bone remodeling</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Protein folding_Response to unfolded protein</p>
                     </c>
                     <c ca="left">
                        <p>Translation_Regulation of initiation</p>
                     </c>
                     <c ca="left">
                        <p>Signal transduction_Visual perception</p>
                     </c>
                     <c ca="left">
                        <p>Muscle contraction</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Protein folding_ER and cytoplasm</p>
                     </c>
                     <c ca="left">
                        <p>Reproduction_Male sex differentiation</p>
                     </c>
                     <c ca="left">
                        <p>Cell cycle_G2-M</p>
                     </c>
                     <c ca="left">
                        <p>Cell adhesion amyloid proteins</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>
                           <b>GO processes</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Metabolic process</p>
                     </c>
                     <c ca="left">
                        <p>Sensory perception of light stimulus</p>
                     </c>
                     <c ca="left">
                        <p>Sexual reproduction</p>
                     </c>
                     <c ca="left">
                        <p>Feeding behavior</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Cellular metabolic process</p>
                     </c>
                     <c ca="left">
                        <p>Visual perception</p>
                     </c>
                     <c ca="left">
                        <p>Reproduction</p>
                     </c>
                     <c ca="left">
                        <p>Smooth muscle differentiation</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Translation</p>
                     </c>
                     <c ca="left">
                        <p>Sensory perception</p>
                     </c>
                     <c ca="left">
                        <p>Spermatogenesis</p>
                     </c>
                     <c ca="left">
                        <p>Oocyte development</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Macromolecule metabolic process</p>
                     </c>
                     <c ca="left">
                        <p>Neurological process</p>
                     </c>
                     <c ca="left">
                        <p>Male gamete generation</p>
                     </c>
                     <c ca="left">
                        <p>Oogenesis</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Primary metabolic process</p>
                     </c>
                     <c ca="left">
                        <p>Detection of visible light</p>
                     </c>
                     <c ca="left">
                        <p>Gametogenesis</p>
                     </c>
                     <c ca="left">
                        <p>Male gonad development</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Macromolecule biosynthetic process</p>
                     </c>
                     <c ca="left">
                        <p>Detection of light stimulus</p>
                     </c>
                     <c ca="left">
                        <p>Fertilization</p>
                     </c>
                     <c ca="left">
                        <p>Embryonic epithelial tube formation</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Cellular protein metabolic process</p>
                     </c>
                     <c ca="left">
                        <p>Detection of abiotic stimulus</p>
                     </c>
                     <c ca="left">
                        <p>Sperm motility</p>
                     </c>
                     <c ca="left">
                        <p>Development of primary male sexual characteristic</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Cellular macromolecule metabolic process</p>
                     </c>
                     <c ca="left">
                        <p>Phototransduction</p>
                     </c>
                     <c ca="left">
                        <p>Spermatid development</p>
                     </c>
                     <c ca="left">
                        <p>Germ cell development</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Protein metabolic process</p>
                     </c>
                     <c ca="left">
                        <p>Detection of light stimulus during visual perception</p>
                     </c>
                     <c ca="left">
                        <p>Spermatid differentiation</p>
                     </c>
                     <c ca="left">
                        <p>Cell-cell signaling</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Intracellular transport</p>
                     </c>
                     <c ca="left">
                        <p>Detection of light stimulus during sensory perception</p>
                     </c>
                     <c ca="left">
                        <p>Development of primary sexual characteristics</p>
                     </c>
                     <c ca="left">
                        <p>Dopamine biosynthetic process from tyrosine</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>
                           <b>Diseases</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Neoplasm by site</p>
                     </c>
                     <c ca="left">
                        <p>Vision disorder</p>
                     </c>
                     <c ca="left">
                        <p>Infertility</p>
                     </c>
                     <c ca="left">
                        <p>Diabetes insipidus</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Breast neoplasms</p>
                     </c>
                     <c ca="left">
                        <p>Eye diseases</p>
                     </c>
                     <c ca="left">
                        <p>Infertility, male</p>
                     </c>
                     <c ca="left">
                        <p>Hypopituitarism</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Breast disease</p>
                     </c>
                     <c ca="left">
                        <p>Retinal degeneration</p>
                     </c>
                     <c ca="left">
                        <p>Dyskeratosis congenita</p>
                     </c>
                     <c ca="left">
                        <p>Adrenal gland disease</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Genetic disease, inborn</p>
                     </c>
                     <c ca="left">
                        <p>Sensation disorders</p>
                     </c>
                     <c ca="left">
                        <p>Dysautonomia, familial</p>
                     </c>
                     <c ca="left">
                        <p>Adrenal cortex disease</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Digestive systems neoplasm</p>
                     </c>
                     <c ca="left">
                        <p>Night blindness</p>
                     </c>
                     <c ca="left">
                        <p>Ciliary motility disorders</p>
                     </c>
                     <c ca="left">
                        <p>Adrenal cortex neoplasm</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Poxviridae infections</p>
                     </c>
                     <c ca="left">
                        <p>Retinitis pigmentosa</p>
                     </c>
                     <c ca="left">
                        <p>Kartagener Syndrome</p>
                     </c>
                     <c ca="left">
                        <p>Adrenal gland neoplasm</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Lysosomal storage disease</p>
                     </c>
                     <c ca="left">
                        <p>Eye diseases, hereditary</p>
                     </c>
                     <c ca="left">
                        <p>Dextrocardia</p>
                     </c>
                     <c ca="left">
                        <p>Carcinoma, basal cell</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Mental retardation</p>
                     </c>
                     <c ca="left">
                        <p>Retinal diseases</p>
                     </c>
                     <c ca="left">
                        <p>Herpes zoster</p>
                     </c>
                     <c ca="left">
                        <p>Neoplasm, basal cell</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Aneuploidy</p>
                     </c>
                     <c ca="left">
                        <p>Blindness</p>
                     </c>
                     <c ca="left">
                        <p>Bronchiectasis</p>
                     </c>
                     <c ca="left">
                        <p>Myokymia</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Vaccinia</p>
                     </c>
                     <c ca="left">
                        <p>Retinitis</p>
                     </c>
                     <c ca="left">
                        <p>Autonomic nervous system diseases</p>
                     </c>
                     <c ca="left">
                        <p>Pathologic processes</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>Gene set enrichment with canonical maps, GeneGo processes, gene ontology processes and diseases was performed.</p>
               </tblfn>
            </tbl>
            <fig id="F2">
               <title>
                  <p>Figure 2</p>
               </title>
               <caption>
                  <p>Top-scored pathways maps and process networks for housekeeping proteins</p>
               </caption>
               <text>
                  <p><b>Top-scored pathways maps and process networks for housekeeping proteins</b>. (I) Oxidative phosphorylation map. The subunits of the complexes are grouped in blue boxes; the red 'thermometer' histograms mark housekeeping (HK) genes. (II) Network for ubiquitin-mediated protein degradation in proteosome. HK genes are marked with solid red circles. The most important network components functionally, identified as HK genes, are marked with circles: ubiquitins and small ubiquitin-like modifiers (Ubiquitin, Ubiquitin C, Sumo-1, Sumo-2 are encircled in red); the ubiquitin-activating enzyme (UBE1 is encircled in black); ubiquitin-conjugating enzymes (UBCH8, UBE2D2, UBE2D3, UBC13, UBCH7 are encircled in blue); proteins that may act as ubiquitin protein ligases E3 (DTX2, DTX3, Rnf14, Rnf103 are encircled in green); the chaperons (HSP70 and HSP90 encircled in orange) and proteasomal subunits (26S proteasome, 26S proteasome, immunoproteasome (11S regulator) are encircled in brown). (III) Network for the GeneGo process translation initiation. HK genes are marked with solid red circles and translation initiation factors are encircled in red.</p>
               </text>
               <graphic file="1741-7007-6-49-2"/>
            </fig>
            <suppl id="S5">
               <title>
                  <p>Additional file 5</p>
               </title>
               <text>
                  <p>
                     <b>Canonical pathway maps and GeneGo networks essential for growth and viability</b>
                  </p>
               </text>
               <file name="1741-7007-6-49-S5.xls">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <p>Using the GSEA procedure <abbrgrp><abbr bid="B13">13</abbr></abbrgrp> across the same ontologies produced similar distributions (data not shown). Importantly, HK sets from the previously published studies were generally enriched in similar categories to our own HK set across all four ontologies. For example, oxidative phosphorylation, cytoskeleton remodeling, translation and macromolecule biosynthetic process were among the top-scoring maps and processes for all HK sets (Additional file <supplr sid="S4">4</supplr>). Functional analysis of the unique parts of HK sets showed differences among top-scoring processes and maps. Our unique (not yet identified) HK genes showed enrichment with metabolic, translation and cell-cycle processes, validating the part of the HK set which was not identified by previous studies. Unique parts of some of the other sets, however, showed enrichment of more specific processes such as immune, inflammatory and cardiac-specific processes (Additional file <supplr sid="S6">6</supplr>).</p>
            <suppl id="S6">
               <title>
                  <p>Additional file 6</p>
               </title>
               <text>
                  <p>
                     <b>Enrichment analysis of the unique parts of the housekeeping gene sets</b>
                  </p>
               </text>
               <file name="1741-7007-6-49-S6.doc">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <p>All 31 tissue-specific gene lists were subjected to the same enrichment procedures as the HK set (Additional file <supplr sid="S7">7</supplr>). In most cases, <it>p</it>-value distributions of top-scored pathways, processes and diseases were strikingly consistent with the tissue of origin. For instance, 190 retina-specific genes were enriched for eye-specific categories in all four ontologies (Table <tblr tid="T2">2</tblr>). Visual perception (Figure <figr fid="F3">3(I)</figr>) and retinoid metabolism, both highly specific for retina tissue, were among the top ten scored pathways maps. The top-scored GG process network was Signal transduction_Visual perception (Table <tblr tid="T2">2</tblr>). All ten top-scored GO processes were related to vision, including sensory perception of light stimuli, visual perception and detection of visible light. All ten top-scored diseases were eye diseases, including retinal degradation, night blindness and retinitis pigmentosa. Adrenal gland genes were enriched in disease categories including diabetes insipidus, adrenal gland diseases and adrenal cortex diseases (Table <tblr tid="T2">2</tblr>). The testis-specific set was highly enriched in cell cycle (meiosis), DNA exchange and cell division in all three process ontologies (canonical pathways, GO processes and GG processes). The top-scored diseases included male infertility, dyskeratosis congenita and familial dysautonomia (Table <tblr tid="T2">2</tblr>). Brain-specific genes yielded GO processes such as transmission of nerve impulse and synaptic transmission. Disease enrichment for the brain set included epilepsy, seizures and mental disorders (Additional file <supplr sid="S7">7</supplr>). Using the GSEA procedure <abbrgrp><abbr bid="B13">13</abbr></abbrgrp> across the same ontologies produced similar distributions (Additional file <supplr sid="S8">8</supplr>).</p>
            <suppl id="S7">
               <title>
                  <p>Additional file 7</p>
               </title>
               <text>
                  <p>
                     <b>Complete enrichment analysis of all tissues in all four ontologies</b>
                  </p>
               </text>
               <file name="1741-7007-6-49-S7.xls">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <fig id="F3">
               <title>
                  <p>Figure 3</p>
               </title>
               <caption>
                  <p>Top-scored canonical maps for tissue-specific genesets</p>
               </caption>
               <text>
                  <p><b>Top-scored canonical maps for tissue-specific genesets</b>. (I) Retina, visual perception canonical map. The genes identified as expressed in retina are marked with red 'thermometer' icons. (II) Cortisone biosynthesis and metabolism canonical map. The genes identified as present in adrenal gland are marked with red 'thermometer' icons.</p>
               </text>
               <graphic file="1741-7007-6-49-3"/>
            </fig>
            <suppl id="S8">
               <title>
                  <p>Additional file 8</p>
               </title>
               <text>
                  <p>
                     <b>Gene Set Enrichment Analysis for selected tissues, ontologies</b>
                  </p>
               </text>
               <file name="1741-7007-6-49-S8.xls">
                  <p>Click here for file</p>
               </file>
            </suppl>
         </sec>
         <sec>
            <st>
               <p>Network properties and interactome analysis of HK and tissue-specific genes</p>
            </st>
            <p>We analyzed the 'interaction space' of the protein products of HK and tissue-specific genes in three steps. First, we calculated topological properties of the 'interactomes', such as degree of connectivity, average shortest path and clustering coefficient <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>. Second, we evaluated the interactome structure by parsing the interactions into three components (IN, OUT and giant strong component, GSC) using the 'bow-tie' classification of Broder et al. <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>. Third, we divided HK and tissue proteins into protein classes and calculated relative enrichment across particular classes. For topological analysis, we used the directional protein-protein interaction content in MetaCore&#8482; (GeneGo Inc.). We also analyzed, for reference, four previously published HK gene sets.</p>
            <p>All five HK sets shared similar network topology features: higher connectivity, a somewhat lower clustering coefficient, and shorter paths than the global human interactome (Figure <figr fid="F4">4A</figr>, Additional file <supplr sid="S9">9</supplr>). The tissue-specific networks varied substantially in their degree of connectivity, with colon and ovary-specific proteins the most interconnected, and prostate and salivary gland the least (Figure <figr fid="F4">4A</figr>, Additional file <supplr sid="S10">10</supplr>).</p>
            <fig id="F4">
               <title>
                  <p>Figure 4</p>
               </title>
               <caption>
                  <p>Network analysis and protein class enrichment of housekeeping and tissue-specific genes</p>
               </caption>
               <text>
                  <p><b>Network analysis and protein class enrichment of housekeeping and tissue-specific genes</b>. A. Average connectivity of global protein interaction, housekeeping (HK) and several tissue-specific gene sets. B. Protein class enrichment analysis. The actual (black or blue bars) and expected (open red box) number of transcription factors (first bar), receptors (second bar) and enzymes (third bar) for each tissue. The blue bars correspond to HK proteins. The circles identify several tissues enriched in transcriptional factors (blue), receptors (green) and enzymes (red). C. Network component enrichment analysis. The upper part of the figure shows the schematic illustration of IN, OUT and GSC (figure adopted from Brodel et al. <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>). The lower panel shows the actual (black and blue bars) and expected (open red boxes) number of genes in the GSC (first), IN (second) and OUT (third) components. The blue bars correspond to housekeeping proteins. The circles identify several tissues enriched in GSC proteins (red), IN proteins (green) and OUT (blue).</p>
               </text>
               <graphic file="1741-7007-6-49-4"/>
            </fig>
            <suppl id="S9">
               <title>
                  <p>Additional file 9</p>
               </title>
               <text>
                  <p>
                     <b>Network topological properties of all housekeeping sets</b>
                  </p>
               </text>
               <file name="1741-7007-6-49-S9.xls">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <suppl id="S10">
               <title>
                  <p>Additional file 10</p>
               </title>
               <text>
                  <p>
                     <b>Tissue-specific connectivity</b>
                  </p>
               </text>
               <file name="1741-7007-6-49-S10.doc">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <p>Network topology parameters were also used for tissue clustering. The topological distances between all possible tissue pairs were calculated as the ratio of the average shortest paths between proteins belonging to different tissues to the average shortest paths between proteins from the same tissue. Standard hierarchical clustering <abbrgrp><abbr bid="B16">16</abbr></abbrgrp> was applied to generate the tissue tree. Interestingly, some of the functionally related tissues such as brain, fetal brain, retina and spinal cord were grouped together and characterized by very similar average shortest paths (Additional file <supplr sid="S10">10</supplr>).</p>
            <p>An interactome can be divided into three structural parts, or 'components' <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>. The GSC is the most densely connected part of the network, characterized by the property that any two nodes can be connected through directed paths in both directions. Directed paths from the GSC lead out to the OUT component and paths from the IN component go in to the GSC (Figure <figr fid="F4">4C</figr>, upper panel). In order to facilitate interpretation, the interactions were divided by 16 mechanisms (Additional file <supplr sid="S11">11</supplr>). The IN component is enriched with 'outgoing' interactions such as ligand-receptor binding interactions, which occur five times more frequently in IN than in GSC, and ten times more frequently than in the OUT component. Interactions between IN and GSC components also have five times more ligand-receptor interactions than between GSC and OUT. The GSC component predominantly features 'transcriptional regulation' interactions, which appear ten times more often in the GSC than the OUT, and six times more often than the IN components. The OUT component is enriched with 'incoming' interactions such as transcription factor target. Indeed, 48% of interactions between the GSC and OUT are transcription regulation compared with 4% of interaction between IN and GSC being transcription regulation. The GSC encompassed around 50% of the HK network. Overall, the fraction of HK genes was larger in the GSC (<it>p </it>&lt; 10-5) and slightly lower in the IN and OUT component than a random gene set of the same size (Figure <figr fid="F4">4C</figr>). This suggests that HK proteins comprise a significant proportion of signal transduction interactions. Mammary gland and tonsil were enriched in GSC (<it>p </it>&lt; 0.08 and <it>p </it>&lt; 0.14); intestine and adult kidney in OUT component (<it>p </it>&lt; 0.03 and <it>p </it>&lt; 0.07), and skin (<it>p </it>&lt; 0.1) in IN component (Figure <figr fid="F4">4C</figr>, Additional file <supplr sid="S12">12</supplr>).</p>
            <suppl id="S11">
               <title>
                  <p>Additional file 11</p>
               </title>
               <text>
                  <p>
                     <b>Table for interactions mechanisms</b>
                  </p>
               </text>
               <file name="1741-7007-6-49-S11.xls">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <suppl id="S12">
               <title>
                  <p>Additional file 12</p>
               </title>
               <text>
                  <p>
                     <b>Table for component analysis, protein class analysis for all tissues</b>
                  </p>
               </text>
               <file name="1741-7007-6-49-S12.xls">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <p>Next, all HK and tissue-specific sets were divided into protein classes according to the MetaCore protein classification schema (Additional file <supplr sid="S13">13</supplr>). Enrichment in proteins of a certain class was then calculated from the hypergeometric distribution. The distribution of protein classes in the global interactome was used as a reference (Methods). HK proteins were enriched in enzymes (<it>p </it>&lt; 10-8). Intestine, liver and adrenal gland featured the highest fraction of enzymes. Fetal tissues were highly enriched in transcription factors, and retina and brain in membrane receptors (Figure <figr fid="F4">4B</figr>). Such distributions make intuitive biological sense, and both network structure and protein class enrichment were consistent with each other.</p>
            <suppl id="S13">
               <title>
                  <p>Additional file 13</p>
               </title>
               <text>
                  <p>
                     <b>MC legend with protein classes</b>
                  </p>
               </text>
               <file name="1741-7007-6-49-S13.doc">
                  <p>Click here for file</p>
               </file>
            </suppl>
         </sec>
         <sec>
            <st>
               <p>Tissue-specific networks</p>
            </st>
            <p>We generated tissue-specific networks for nine tissues using the Analyze Networks (AN) network algorithm in MetaCore. AN is a version of the 'shortest path' algorithm, optimized for larger data sets. AN generates overlapping sub-networks of up to 50 nodes and calculates enrichment of the networks with input data and canonical pathways <abbrgrp><abbr bid="B17">17</abbr><abbr bid="B18">18</abbr></abbrgrp>. Unlike pre-built GG process networks, AN networks are built from input lists of network objects using the manually curated interaction database in MetaCore. The networks comprise metabolic reactions, signaling interactions and canonical pathways. The tissue-specific networks for liver and adrenal gland are shown in Figure <figr fid="F5">5</figr>. The liver network is enriched with cholesterol metabolism, as well as with enzymes involved in aldosterone, estradiol and heme metabolism. The adrenal network reflects the hormonal function of that organ. It is comprised of reactions and enzymes taking part in adrenaline, noradrenaline and corticosteroid synthesis.</p>
            <fig id="F5">
               <title>
                  <p>Figure 5</p>
               </title>
               <caption>
                  <p>Tissue-specific networks</p>
               </caption>
               <text>
                  <p><b>Tissue-specific networks</b>. (I) Liver; (II) Adrenal gland. Genes identified as tissue-specific are marked with solid red circles.</p>
               </text>
               <graphic file="1741-7007-6-49-5"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>Clustering tissues based on gene expression patterns</p>
            </st>
            <p>Tissue gene expression patterns were clustered by Euclidean distance across the average normalized probe intensities of the replicate hybridizations for each tissue <abbrgrp><abbr bid="B18">18</abbr></abbrgrp> (Figure <figr fid="F6">6A</figr>). Most tissue clusters display evolutionary and functional relatedness. Fetal tissues clustered closest to their adult counterparts (brain-fetal brain, liver-fetal liver, thymus-fetal thymus, Figure <figr fid="F6">6A</figr>). Tissues of the same developmental origin (ectodermal, mesodermal or endodermal) tended to cluster together; for instance, heart and skeletal muscle are both derived from mesoderm, pancreas and salivary gland are part of the gastrointestinal system. Almost all of the most closely correlated tissue pairs were evolutionary or functionally related. Some genes were uniquely expressed in pairs of tissue (Figure <figr fid="F6">6B</figr>), rather than individual tissues (for instance, in brain and fetal brain) or in triplets and quadruplets of tissues (brain, fetal brain and spinal cord) (Additional file <supplr sid="S14">14</supplr>). Often, these proteins were specific for a group of diseases. For instance, proteins uniquely present in spinal cord, retina and brain are involved in central nervous system and neurodegenerative diseases (Figure <figr fid="F6">6C</figr>).</p>
            <fig id="F6">
               <title>
                  <p>Figure 6</p>
               </title>
               <caption>
                  <p>Clustering tissues into groups based on expression patterns</p>
               </caption>
               <text>
                  <p><b>Clustering tissues into groups based on expression patterns</b>. (a) Hierarchical clustering of genes based on gene expression data. (b) Tissue pairs having a significantly large number of genes uniquely expressed in them. (c) Proteins uniquely expressed in spinal cord, retina and brain are involved in diseases of the nervous system.</p>
               </text>
               <graphic file="1741-7007-6-49-6"/>
            </fig>
            <suppl id="S14">
               <title>
                  <p>Additional file 14</p>
               </title>
               <text>
                  <p>
                     <b>List of genes specific for tissue pairs and triplets</b>
                  </p>
               </text>
               <file name="1741-7007-6-49-S14.xls">
                  <p>Click here for file</p>
               </file>
            </suppl>
         </sec>
         <sec>
            <st>
               <p>Drug target distribution between housekeeping and tissue-specific genes</p>
            </st>
            <p>We compared the distribution of drug targets between HK and tissue-specific protein sets. Two sets of drug targets were compiled: 'therapeutic targets' &#8211; a set of 104 direct protein targets of commercially available drugs, and 'xenobiotic targets' &#8211; 518 proteins known to interact with xenobiotics. Therapeutic drugs are defined as key proteins or genes directly affected by a marketed or withdrawn drug from the market. Xenobiotic targets are proteins and genes known for physical interactions with a large set of bioactive compounds including drugs, drug candidates and lead compounds.</p>
            <p>On average, therapeutic targets were twice as prevalent among tissue-specific proteins than HK proteins (3.3% and 1.5% of all proteins, correspondingly) (Additional file <supplr sid="S15">15</supplr>). Therapeutic targets comprised as much as 25% of mammary gland and thymus-specific proteins. The distribution of xenobiotic targets was not essentially different between HK and the sum of tissue-specific proteins, with some 1.7% of HK proteins and 2.3% of tissue-specific proteins being xenobiotic targets. Some tissues, however, were highly enriched with xenobiotic targets. For instance, over 10% of proteins specific for mammary gland, ovary, retina, small intestine and spinal cord were identified as xenobiotic targets (Additional file <supplr sid="S15">15</supplr>).</p>
            <suppl id="S15">
               <title>
                  <p>Additional file 15</p>
               </title>
               <text>
                  <p>
                     <b>Drug targets</b>
                  </p>
               </text>
               <file name="1741-7007-6-49-S15.xls">
                  <p>Click here for file</p>
               </file>
            </suppl>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Discussion</p>
         </st>
         <p>The definition of a 'housekeeping' or 'universally-expressed' gene is not settled. Suggested approaches vary from an estimated 10% of all genes <abbrgrp><abbr bid="B4">4</abbr></abbrgrp> to transcript copy number <abbrgrp><abbr bid="B3">3</abbr></abbrgrp> to using 'present' calls <abbrgrp><abbr bid="B5">5</abbr></abbrgrp>. Elisenberg and Levanon <abbrgrp><abbr bid="B2">2</abbr></abbrgrp> assumed that HK genes are highly expressed 'by nature' and, therefore, must be selected for shorter introns. We followed the original 'common-sense' proposal of Watson et al. <abbrgrp><abbr bid="B1">1</abbr></abbrgrp> to define 2374 genes as HK because their transcripts were detected in all 31 human tissues tested, at a signal-to-noise ratio &#8805; 10, which we believe to be a relatively stringent cut-off (technical analysis of platform sensitivity at ABI has indicated that a signal-to-noise ratio of &#8805; 3 is sufficient to determine 'presence' of a transcript with a confidence of 99.9%. This set constitutes 8.2% of human genes based on the latest National Center for Biotechnology Information genome release <url>http://www.ncbi.nlm.nih.gov/mapview/stats/BuildStats.cgi?taxid=9606&amp;build=36&amp;ver=2</url>.</p>
         <p>Tissue-specific gene sets, comprising genes uniquely expressed in individual tissues by the SN10 criterion, averaged 43 genes for somatic tissues, and 485 mostly meiosis-specific genes in testis. The tissue specificity distribution is represented by a bimodal distribution with two peaks at 1 and 31 corresponding to tissue-specific and HK genes (Additional file <supplr sid="S16">16</supplr>). Interestingly, the female reproductive organ, ovary, expressed many fewer unique genes, probably due to the fact that ovary consists mainly of diploid tissue compared with the haploid-rich testis, as ovary produces far fewer eggs than testis does sperm.</p>
         <suppl id="S16">
            <title>
               <p>Additional file 16</p>
            </title>
            <text>
               <p>
                  <b>Tissue-specificity distribution of all genes</b>
               </p>
            </text>
            <file name="1741-7007-6-49-S16.docx">
               <p>Click here for file</p>
            </file>
         </suppl>
         <p>There is continuing debate in gene expression analysis over how to best identify genes differentially expressed between treatment conditions, disease states and so on. Arbitrarily applied cut-offs in relative magnitude of expression (fold-change) and statistical measures such as ANOVA and <it>t</it>-test are commonly used <abbrgrp><abbr bid="B19">19</abbr><abbr bid="B20">20</abbr></abbrgrp>. Renewed debate was sparked recently <abbrgrp><abbr bid="B21">21</abbr><abbr bid="B22">22</abbr></abbrgrp> following publication of the results of the Microarray Quality Control Consortium research efforts <abbrgrp><abbr bid="B23">23</abbr></abbrgrp>. It was shown that a combination of non-stringent statistical cut-offs and fold-change ranking of gene lists resulted in improved concordance between measurements made using different expression analysis technologies, an approach likely therefore to give the most accurate biological 'answer'.</p>
         <p>We applied both signal level (signal-to-noise) and statistical (<it>t</it>-test) methods to determine tissue specificity of gene expression. Although the resulting datasets did not overlap well in gene content (6% on average), they were highly similar by functional analysis. Importantly, the intersections between distributions of entities within functional ontologies for SN10 and <it>t</it>-test sets were substantially larger than between genes (Additional file <supplr sid="S3">3</supplr>), suggesting that the two methods both select biologically relevant genes, which can be interconnected on pathways and networks. Further, despite incomplete overlap between our HK set and four other previously published HK sets, all gave very similar results in ontological enrichment analyses. These observations support the use of functional descriptors (pathways, networks, ontological categories and so on) as tools for quantitative characterization of conditional gene expression <abbrgrp><abbr bid="B24">24</abbr></abbrgrp>.</p>
         <p>HK and tissue-specific gene sets were subjected to comprehensive functional analysis in three steps: enrichment analysis across functional ontologies using hypergeometric distribution <abbrgrp><abbr bid="B12">12</abbr></abbrgrp> and GSEA methods <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>; local interactome analysis; generation of signaling/metabolic networks using HK and tissue-specific gene content as input nodes. We used multiple functional ontologies for enrichment analysis: GO processes, GG processes, disease categories and canonical pathway maps. Each ontology is established using different criteria and reveals a different aspect of cellular functionality. Canonical pathways are experimentally confirmed multi-step chains of reactions. GO and GG processes differ, as connectivity is established at the level of functional association (GO processes) or binary interactions (GG processes). Diseases and disease network ontologies represent pathways involved in pathological processes, largely derived from literature data linking network objects (genes, proteins, metabolites) to disease.</p>
         <p>Ontology enrichment showed that HK genes, indeed, were enriched in vital cellular functions such as oxidative phosphorylation, ubiquitin-proteasomal proteolysis, translation and major pathways of endogenous metabolism. Importantly, many essential processes involve a large number of interconnected proteins (typically several hundred), and many of these genes belong to our HK set, confirming the functional integrity and completeness of the HK set.</p>
         <p>Recently it was shown in line with our results that the number of HK genes is larger than previously reported <abbrgrp><abbr bid="B4">4</abbr><abbr bid="B25">25</abbr></abbrgrp>. Our findings are complementary to the high-resolution study based on expressed sequence tag (EST) profiling reported by Zhu et al. <abbrgrp><abbr bid="B25">25</abbr></abbrgrp>. Indeed, both methods have advantages and drawbacks, EST data although have potentially higher resolution, suffers from poor annotation and low sampling depth <abbrgrp><abbr bid="B25">25</abbr></abbrgrp>. Our results are very consistent throughout the 31 tissues and are well confirmed by enrichment analysis in multiple functional ontologies. We expect that higher resolution data would not alter the main conclusions of the enrichment and network analysis.</p>
         <p>Ontology enrichment in tissue-specific genes was strikingly different from HK genes, and in most cases strikingly consistent with the tissue's function. Retina genes were enriched with such processes as visual perception, detection of visible light, detection of light stimulus and eye disease-related genes, and testis genes scored highest for reproductive processes, cell cycle, cell division and male reproductive diseases, for example. Importantly, almost all tissues featured a high level of consistency between the highest-scored categories in different ontologies. Further, consistency was observed whether hypergeometric <it>p</it>-values or GSEA analyses were employed. This important observation adds confidence to the conclusions drawn from the analysis of tissue-specific gene expression, and can be applied similarly to other studies of differentially expressed genes and proteins. It also will facilitate the development of a comprehensive set of 'meta-ontologies' derived from an understanding of the interplay and crossover between the different types of ontological categories available, leading to improved understanding of biological systems, and the identification of biologically meaningful biomarkers and new therapeutic targets.</p>
         <p>Network analysis <abbrgrp><abbr bid="B15">15</abbr></abbrgrp> showed that a large part of HK genes belonged to the most interconnected GSC core of the global interactome, with equal representation of 'in' and 'out' interactions. On the other hand, tissue-specific genes varied greatly in network component composition. Skin and fetal brain were enriched with 'IN' component, consistent with a larger proportion of ligand-receptor interactions in these tissues. Not surprisingly, these tissues featured a large fraction of receptors in the protein class enrichment test. In comparison, 'effector' organs such as intestine, liver and kidney, were enriched with 'OUT' interactions and enzymes. A transcription factor-enzyme pair represents a typical directed 'out' interaction &#8211; a final step in delivering a signal from stimulus to core 'effectors' such as endogenous metabolic pathways. These findings are in line with the 'bow-tie' structure of metabolic networks, which ensures robustness of major biological functions <abbrgrp><abbr bid="B26">26</abbr></abbrgrp>, and is needed for coordinated response to stimuli and perturbation <abbrgrp><abbr bid="B27">27</abbr></abbrgrp>.</p>
         <p>'Multi-dimensional' functional analysis of gene expression data adds to the ongoing debate on proper statistical procedures for gene set selection. A 'rule of thumb' opinion is that only gene sets and distributions with low <it>p</it>-values (typically, <it>p </it>&lt; 0.01) should be considered valid, although such <it>ad hoc </it>cut-offs may eliminate many condition-relevant genes from analysis. Functional analysis provides a different level of validation for such datasets by consideration of functional biological units. For instance, the retina-specific gene set is enriched in vision-related pathways, GO and GG ontologies, and diseases. The same set is enriched with OUT network component, and the protein class 'receptors'. Self-assembling retina-specific networks reconstruct rhodopsin-stimulated signal transduction and key steps in the metabolism of rhodopsin and its co-factors. Although the retina gene set is too small to achieve <it>p</it>-values &lt; 0.01 in most of these analyses, the combination of independent functional evidence builds a strong case for its relevance and importance as a highly likely source of specific biomarkers for eye conditions and ophthalmic drug response. The comprehensive biological interaction data in MetaCore allows multiple 'high-content' data types to be mapped onto networks. Gene expression data, proteomic, single nucleotide polymorphism and metabolomic data all can be addressed, visualized and used in network construction and analysis, independently or in concert <abbrgrp><abbr bid="B28">28</abbr></abbrgrp>, making it a universal tool for studies on eye and other diseases.</p>
         <p>Tissue-specific gene sets (and, therefore, networks) were twice as enriched in drug targets as the HK set. 'Therapeutic' targets comprised up to 25% of mammary gland and thymus-specific proteins. Most of the identified tissue-specific drug target genes are targets to drugs whose mechanism are consistent with the tissues. For example, three brain-specific proteins identified to have at least one drug target are GABA-A receptor beta-2 subunit, Na (V) I alpha and SCN10A. GABA-A receptor beta-2 subunit is a target for clomethiazole (sedative and anticonvulsant), Na (V) I alpha is a target for drugs such as levetiracetam (epilepsy), tetrodotoxin (anesthetics), toprimate (anticonsulvant) and SCN10A is target for bupivacaine racemic (anesthetics) and lidocaine (anesthetics).</p>
         <p>Tissue-specific proteins and their pathways and networks are therefore a potentially rich source of targets and biomarkers for disease treatment and diagnostics. The addition of data on which network components are detectable in blood and other readily accessible body liquids makes tissue-specific networks exceptionally attractive for the identification of putative biomarkers of tissue-specific disease, drug-response or toxicity.</p>
      </sec>
      <sec>
         <st>
            <p>Methods</p>
         </st>
         <sec>
            <st>
               <p>Data description</p>
            </st>
            <p>RNA from 31 normal human tissue RNAs (Table <tblr tid="T1">1</tblr>) was purchased from Clontech (Palo Alto, CA) while RNA from UHR (Universal Human Reference) was purchased from Strategene (San Diego, CA). All RNA samples were analyzed on the Agilent 2100 Bioanalyzer for RNA quality control. Three technical replicates were included for each of the tissue sample.</p>
            <p>The Applied Biosystems Human Genome Survey Microarray (P/N 4337467) contains 31,700 60-mer oligonucleotide probes representing 27,868 individual human genes. Digoxigenin-UTP labeled cRNA was generated and amplified from 1 &#956;g of total RNA from each sample using Applied Biosystems Chemiluminescent RT-IVT Labeling Kit v 1. 0 (P/N 4340472) according to the manufacturer's protocol (P/N 4339629). Fifteen micrograms of DIG-labeled cRNA was hybridized for 16 hrs at 55&#176;C and chemiluminescence detection, image acquisition and analysis were performed using Applied Biosystems Chemiluminescence Detection Kit (P/N 4342142) and Applied Biosystems 1700 Chemiluminescent Microarray Analyzer (P/N 4338036) following the manufacturer's protocol (P/N 4339629). Images were auto-gridded and the chemiluminescent signals were quantified, background subtracted, and finally, spot- and spatially-normalized using the Applied Biosystems 1700 Chemiluminescent Microarray Analyzer software v 1. 1 (P/N 4336391). Probe signals were normalized using the Limma method <abbrgrp><abbr bid="B29">29</abbr></abbrgrp>. The gene expression data is publicly available at GEO public repository website. (GSE7905).</p>
         </sec>
         <sec>
            <st>
               <p>Identification of HK and tissue-specific genes</p>
            </st>
            <p>A 'stringent' set of HK genes expressed in each of the 31 tissues was identified by applying cut-off to the ABI-calculated signal-to-noise (S/N) for each probe. The S/N is a metric that captures the confidence of the measurement 'detectability' above all known sources of noise. S/N is commonly used to bin genes or probes as 'Present' or 'Absent' at a desired level of confidence. Since the S/N expresses the number of standard deviations, the associated confidence can be looked up from a probability table for a normal distribution. For example, signals with S/N &#8805; 3 have > 99.9% confidence in the measurement. For our purposes in determining whether a gene was expressed in a given tissue, we additionally took into account the inflection points where the slope of the S/N curve significantly changes. We applied the threshold across all three replicate hybridizations on a probe-by-probe basis to identify the genes expressed in each tissue with a high level of confidence. The overlap between all 31 sets, that is, genes consistently expressed in all tissues, defines the HK set (more details in Additional file <supplr sid="S17">17</supplr>). Tissue-specific genes were defined as those uniquely expressed in a single given tissue at a S/N &#8805; 10. Genes expressed in highly related tissue pairs were also defined as tissue specific for some analyses.</p>
            <suppl id="S17">
               <title>
                  <p>Additional file 17</p>
               </title>
               <text>
                  <p>
                     <b>Definition of tissue specific and housekeeping genes</b>
                  </p>
               </text>
               <file name="1741-7007-6-49-S17.doc">
                  <p>Click here for file</p>
               </file>
            </suppl>
         </sec>
         <sec>
            <st>
               <p>Topological measures</p>
            </st>
            <sec>
               <st>
                  <p>Degree of nodes</p>
               </st>
               <p>The number of links connected to a node gives the node's degree. Since many real networks are directed, nodes are characterized by in and out-degree, giving the number of incoming and outgoing interactions.</p>
            </sec>
            <sec>
               <st>
                  <p>Average shortest path</p>
               </st>
               <p>The shortest distance between two nodes is the number of links along the shortest path. The average shortest path is the average over the shortest paths for all node pairs in the network. When we calculate the shortest paths for a subset of nodes in the network we consider also paths crossing through nodes that are not part of the subset.</p>
            </sec>
            <sec>
               <st>
                  <p>Average clustering coefficient</p>
               </st>
               <p>The clustering coefficient is a measure that captures to what degree node's neighbors are connected. It is defined as:</p>
               <p>
                  <display-formula>
                     <m:math name="1741-7007-6-49-i1" xmlns:m="http://www.w3.org/1998/Math/MathML">
                        <m:semantics>
                           <m:mrow>
                              <m:msub>
                                 <m:mi>C</m:mi>
                                 <m:mi>i</m:mi>
                              </m:msub>
                              <m:mo>=</m:mo>
                              <m:mfrac>
                                 <m:mrow>
                                    <m:mn>2</m:mn>
                                    <m:msub>
                                       <m:mi>n</m:mi>
                                       <m:mi>i</m:mi>
                                    </m:msub>
                                 </m:mrow>
                                 <m:mrow>
                                    <m:msub>
                                       <m:mi>k</m:mi>
                                       <m:mi>i</m:mi>
                                    </m:msub>
                                    <m:mo stretchy="false">(</m:mo>
                                    <m:msub>
                                       <m:mi>k</m:mi>
                                       <m:mi>i</m:mi>
                                    </m:msub>
                                    <m:mo>&#8722;</m:mo>
                                    <m:mn>1</m:mn>
                                    <m:mo stretchy="false">)</m:mo>
                                 </m:mrow>
                              </m:mfrac>
                           </m:mrow>
                           <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaem4qam0aaSbaaSqaaiabdMgaPbqabaGccqGH9aqpjuaGdaWcaaqaaiabikdaYiabd6gaUnaaBaaabaGaemyAaKgabeaaaeaacqWGRbWAdaWgaaqaaiabdMgaPbqabaGaeiikaGIaem4AaS2aaSbaaeaacqWGPbqAaeqaaiabgkHiTiabigdaXiabcMcaPaaaaaa@3D7F@</m:annotation>
                        </m:semantics>
                     </m:math>
                  </display-formula>
               </p>
               <p>where <it>n</it><sub><it>i </it></sub>is the number of links among the <it>k</it><sub><it>i </it></sub>neighbors of node <it>i</it>. As <it>k</it><sub><it>i</it></sub><it>(k</it><sub><it>i</it></sub><it>-1)/2 </it>is the maximum number of such links, the clustering coefficient is a number between 0 and 1. The average clustering coefficient is obtained by averaging over the clustering coefficient of individual nodes. A network with high clustering coefficient is characterized by highly connected sub-graphs.</p>
            </sec>
         </sec>
         <sec>
            <st>
               <p><it>P</it>-value calculation</p>
            </st>
            <p>The enrichment levels of the HK genes and tissue-specific genes in the different parts of the network (IN, OUT, GSC) and in different protein classes were calculated using the hyper-geometric distribution:</p>
            <p>
               <display-formula>
                  <m:math name="1741-7007-6-49-i2" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:mi>P</m:mi>
                           <m:mo stretchy="false">(</m:mo>
                           <m:mi>k</m:mi>
                           <m:mo>,</m:mo>
                           <m:mi>D</m:mi>
                           <m:mo>,</m:mo>
                           <m:mi>n</m:mi>
                           <m:mo>,</m:mo>
                           <m:mi>N</m:mi>
                           <m:mo stretchy="false">)</m:mo>
                           <m:mo>=</m:mo>
                           <m:mfrac>
                              <m:mrow>
                                 <m:mrow>
                                    <m:mo>(</m:mo>
                                    <m:mrow>
                                       <m:mtable>
                                          <m:mtr>
                                             <m:mtd>
                                                <m:mi>D</m:mi>
                                             </m:mtd>
                                          </m:mtr>
                                          <m:mtr>
                                             <m:mtd>
                                                <m:mi>k</m:mi>
                                             </m:mtd>
                                          </m:mtr>
                                       </m:mtable>
                                    </m:mrow>
                                    <m:mo>)</m:mo>
                                 </m:mrow>
                                 <m:mrow>
                                    <m:mo>(</m:mo>
                                    <m:mrow>
                                       <m:mtable>
                                          <m:mtr>
                                             <m:mtd>
                                                <m:mrow>
                                                   <m:mi>N</m:mi>
                                                   <m:mo>&#8722;</m:mo>
                                                   <m:mi>D</m:mi>
                                                </m:mrow>
                                             </m:mtd>
                                          </m:mtr>
                                          <m:mtr>
                                             <m:mtd>
                                                <m:mrow>
                                                   <m:mi>n</m:mi>
                                                   <m:mo>&#8722;</m:mo>
                                                   <m:mi>k</m:mi>
                                                </m:mrow>
                                             </m:mtd>
                                          </m:mtr>
                                       </m:mtable>
                                    </m:mrow>
                                    <m:mo>)</m:mo>
                                 </m:mrow>
                              </m:mrow>
                              <m:mrow>
                                 <m:mrow>
                                    <m:mo>(</m:mo>
                                    <m:mrow>
                                       <m:mtable>
                                          <m:mtr>
                                             <m:mtd>
                                                <m:mi>N</m:mi>
                                             </m:mtd>
                                          </m:mtr>
                                          <m:mtr>
                                             <m:mtd>
                                                <m:mi>n</m:mi>
                                             </m:mtd>
                                          </m:mtr>
                                       </m:mtable>
                                    </m:mrow>
                                    <m:mo>)</m:mo>
                                 </m:mrow>
                              </m:mrow>
                           </m:mfrac>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaemiuaaLaeiikaGIaem4AaSMaeiilaWIaemiraqKaeiilaWIaemOBa4MaeiilaWIaemOta4KaeiykaKIaeyypa0tcfa4aaSaaaeaadaqadaqaauaabeqaceaaaeaacqWGebaraeaacqWGRbWAaaaacaGLOaGaayzkaaWaaeWaaeaafaqabeGabaaabaGaemOta4KaeyOeI0IaemiraqeabaGaemOBa4MaeyOeI0Iaem4AaSgaaaGaayjkaiaawMcaaaqaamaabmaabaqbaeqabiqaaaqaaiabd6eaobqaaiabd6gaUbaaaiaawIcacaGLPaaaaaaaaa@48CC@</m:annotation>
                     </m:semantics>
                  </m:math>
               </display-formula>
            </p>
            <p>We calculated the <it>p</it>-value corresponding to the enrichment-level according to</p>
            <p>
               <display-formula>
                  <m:math name="1741-7007-6-49-i3" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:mi>p</m:mi>
                           <m:mo stretchy="false">(</m:mo>
                           <m:mi>k</m:mi>
                           <m:mo stretchy="false">)</m:mo>
                           <m:mo>=</m:mo>
                           <m:mstyle displaystyle="true">
                              <m:munderover>
                                 <m:mo>&#8721;</m:mo>
                                 <m:mrow>
                                    <m:mi>i</m:mi>
                                    <m:mo>=</m:mo>
                                    <m:mi>k</m:mi>
                                 </m:mrow>
                                 <m:mi>D</m:mi>
                              </m:munderover>
                              <m:mrow>
                                 <m:mi>P</m:mi>
                                 <m:mo stretchy="false">(</m:mo>
                                 <m:mi>i</m:mi>
                                 <m:mo>,</m:mo>
                                 <m:mi>D</m:mi>
                                 <m:mo>,</m:mo>
                                 <m:mi>n</m:mi>
                                 <m:mo>,</m:mo>
                                 <m:mi>N</m:mi>
                                 <m:mo stretchy="false">)</m:mo>
                              </m:mrow>
                           </m:mstyle>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaemiCaaNaeiikaGIaem4AaSMaeiykaKIaeyypa0ZaaabCaeaacqWGqbaucqGGOaakcqWGPbqAcqGGSaalcqWGebarcqGGSaalcqWGUbGBcqGGSaalcqWGobGtcqGGPaqkaSqaaiabdMgaPjabg2da9iabdUgaRbqaaiabdseaebqdcqGHris5aaaa@4327@</m:annotation>
                     </m:semantics>
                  </m:math>
               </display-formula>
            </p>
            <p>which gives the probability of having <it>k </it>or more marked elements in a sample of size <it>n </it>by random selection.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Competing interests</p>
         </st>
         <p>The authors declare that they have no competing interests.</p>
      </sec>
      <sec>
         <st>
            <p>Authors' contributions</p>
         </st>
         <p>TN, YN and ZD conceived the study and designed research. ZD drafted and YN wrote the final manuscript. RJB contributed to writing and editing of the manuscript. ZD, WS and ES performed the statistical analysis of the data. TS, DD, AB, AG, ES, ZD and YN performed research. ER contributed with making the figures in the manuscript. KL, JB, RRS designed and carried out the experiment</p>
      </sec>
   </bdy>
   <bm>
      <refgrp>
         <bibl id="B1">
            <aug>
               <au>
                  <snm>Watson</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Hopkins</snm>
                  <fnm>NH</fnm>
               </au>
               <au>
                  <snm>Roberts</snm>
                  <fnm>JW</fnm>
               </au>
               <au>
                  <snm>Steitz</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Weiner</snm>
                  <fnm>AM</fnm>
               </au>
            </aug>
            <source>Molecular Biology of the Gene</source>
            <publisher>Benjamin/Cummings</publisher>
            <edition>2</edition>
            <pubdate>1987</pubdate>
            <volume>1</volume>
            <fpage>704</fpage>
         </bibl>
         <bibl id="B2">
            <title>
               <p>Human housekeeping genes are compact</p>
            </title>
            <aug>
               <au>
                  <snm>Eisenberg</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Levanon</snm>
                  <fnm>EY</fnm>
               </au>
            </aug>
            <source>Trends Genet</source>
            <pubdate>2003</pubdate>
            <volume>19</volume>
            <issue>7</issue>
            <fpage>362</fpage>
            <lpage>365</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12850439</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>Comparison of human adult and fetal expression and identification of 535 housekeeping/maintenance genes</p>
            </title>
            <aug>
               <au>
                  <snm>Warrington</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Nair</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Mahadevappa</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Tsyganskaya</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Physiol Genomics</source>
            <pubdate>2000</pubdate>
            <volume>2</volume>
            <fpage>143</fpage>
            <lpage>147</lpage>
            <xrefbib>
               <pubid idtype="pmpid">11015593</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>Further understanding human disease genes by comparing with housekeeping genes and other genes</p>
            </title>
            <aug>
               <au>
                  <snm>Tu</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Xu</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Zhou</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Sun</snm>
                  <fnm>F</fnm>
               </au>
            </aug>
            <source>BMC Genomics</source>
            <pubdate>2006</pubdate>
            <volume>7</volume>
            <fpage>31</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1397819</pubid>
                  <pubid idtype="pmpid" link="fulltext">16504025</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>A compendium of gene expression in normal human tissues</p>
            </title>
            <aug>
               <au>
                  <snm>Hsiao</snm>
                  <fnm>LL</fnm>
               </au>
               <au>
                  <snm>Dangond</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Yoshida</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Hong</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Jensen</snm>
                  <fnm>RV</fnm>
               </au>
               <au>
                  <snm>Misra</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Dillon</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Lee</snm>
                  <fnm>KF</fnm>
               </au>
               <au>
                  <snm>Clark</snm>
                  <fnm>KE</fnm>
               </au>
               <au>
                  <snm>Haverty</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Weng</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Mutter</snm>
                  <fnm>GI</fnm>
               </au>
               <au>
                  <snm>Frosch</snm>
                  <fnm>MP</fnm>
               </au>
               <au>
                  <snm>MacDonald</snm>
                  <fnm>ME</fnm>
               </au>
               <au>
                  <snm>Milford</snm>
                  <fnm>EL</fnm>
               </au>
               <au>
                  <snm>Crum</snm>
                  <fnm>CP</fnm>
               </au>
               <au>
                  <snm>Bueno</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Pratt</snm>
                  <fnm>RE</fnm>
               </au>
               <au>
                  <snm>Mahadevappa</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Warrington</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Stephanopoulos</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Stephanopoulos</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Gullans</snm>
                  <fnm>SR</fnm>
               </au>
            </aug>
            <source>Physiol Genomics</source>
            <pubdate>2001</pubdate>
            <volume>7</volume>
            <fpage>97</fpage>
            <lpage>104</lpage>
            <xrefbib>
               <pubid idtype="pmpid">11773596</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>Tissue-specific regulatory elements in mammalian promoters</p>
            </title>
            <aug>
               <au>
                  <snm>Smith</snm>
                  <fnm>AD</fnm>
               </au>
               <au>
                  <snm>Sumazin</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>MQ</fnm>
               </au>
            </aug>
            <source>Mol Syst Biol</source>
            <pubdate>2007</pubdate>
            <volume>3</volume>
            <fpage>73</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1800356</pubid>
                  <pubid idtype="pmpid" link="fulltext">17224917</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>Housekeeping and tissue-specific genes in mouse tissues</p>
            </title>
            <aug>
               <au>
                  <snm>Kouadjo</snm>
                  <fnm>KE</fnm>
               </au>
               <au>
                  <snm>Nishida</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Cadrin-Girard</snm>
                  <fnm>JF</fnm>
               </au>
               <au>
                  <snm>Yoshioka</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>St-Amand</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>BMC Genomics</source>
            <pubdate>2007</pubdate>
            <volume>8</volume>
            <fpage>127</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1888706</pubid>
                  <pubid idtype="pmpid" link="fulltext">17519037</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>Identification of novel universal housekeeping genes by statistical analysis of microarray data</p>
            </title>
            <aug>
               <au>
                  <snm>Lee</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Jo</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Koh</snm>
                  <fnm>SS</fnm>
               </au>
               <au>
                  <snm>Kim</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>J Biochem Mol Biol</source>
            <pubdate>2007</pubdate>
            <volume>40</volume>
            <fpage>226</fpage>
            <lpage>231</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">17394773</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p>Detection of genes with tissue-specific expression patterns using Akaike's information criterion procedure</p>
            </title>
            <aug>
               <au>
                  <snm>Kadota</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Nishimura</snm>
                  <fnm>SI</fnm>
               </au>
               <au>
                  <snm>Bono</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Nakamura</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Hayashizaki</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Okazaki</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Takahashi</snm>
                  <fnm>K</fnm>
               </au>
            </aug>
            <source>Physiol Genomics</source>
            <pubdate>2003</pubdate>
            <volume>12</volume>
            <fpage>251</fpage>
            <lpage>259</lpage>
            <xrefbib>
               <pubid idtype="pmpid">12499447</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B10">
            <title>
               <p>A multiple of genes expressed solely in meiotic or postmeiotic spermatogenic cells offers a myriad of conservative targets</p>
            </title>
            <aug>
               <au>
                  <snm>Schultz</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Hamra</snm>
                  <fnm>FK</fnm>
               </au>
               <au>
                  <snm>Garbers</snm>
                  <fnm>DL</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2003</pubdate>
            <volume>100</volume>
            <fpage>12201</fpage>
            <lpage>12206</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">218736</pubid>
                  <pubid idtype="pmpid" link="fulltext">14526100</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>The conserved transcriptome in human and rodent male gametogenesis</p>
            </title>
            <aug>
               <au>
                  <snm>Chalmel</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Rolland</snm>
                  <fnm>AD</fnm>
               </au>
               <au>
                  <snm>Niederhauser-Wiederkehr</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Chung</snm>
                  <fnm>CSW</fnm>
               </au>
               <au>
                  <snm>Demougin</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Gattiker</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Moore</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Patard</snm>
                  <fnm>JJ</fnm>
               </au>
               <au>
                  <snm>Wolgemuth</snm>
                  <fnm>DJ</fnm>
               </au>
               <au>
                  <snm>J&#233;gou</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Primig</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2007</pubdate>
            <volume>104</volume>
            <fpage>8346</fpage>
            <lpage>8351</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1864911</pubid>
                  <pubid idtype="pmpid" link="fulltext">17483452</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>The Hypergeometric Series</p>
            </title>
            <aug>
               <au>
                  <snm>Feller</snm>
                  <fnm>W</fnm>
               </au>
            </aug>
            <source>An Introduction to Probability Theory and its Applications</source>
            <publisher>New York: Wiley</publisher>
            <edition>3</edition>
            <pubdate>1968</pubdate>
            <volume>1</volume>
            <fpage>41</fpage>
            <lpage>45</lpage>
         </bibl>
         <bibl id="B13">
            <title>
               <p>Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles</p>
            </title>
            <aug>
               <au>
                  <snm>Subramanian</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Tamayo</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Mootha</snm>
                  <fnm>VK</fnm>
               </au>
               <au>
                  <snm>Mukherjee</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Ebert</snm>
                  <fnm>BL</fnm>
               </au>
               <au>
                  <snm>Gillette</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Paulovich</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Pomeroy</snm>
                  <fnm>SL</fnm>
               </au>
               <au>
                  <snm>Golub</snm>
                  <fnm>TR</fnm>
               </au>
               <au>
                  <snm>Lander</snm>
                  <fnm>ES</fnm>
               </au>
               <au>
                  <snm>Mesirov</snm>
                  <fnm>JP</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2005</pubdate>
            <volume>102</volume>
            <fpage>15545</fpage>
            <lpage>15550</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1239896</pubid>
                  <pubid idtype="pmpid" link="fulltext">16199517</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B14">
            <title>
               <p>Network Biology: Understanding the cell's functional organization</p>
            </title>
            <aug>
               <au>
                  <snm>Barabasi</snm>
                  <fnm>AL</fnm>
               </au>
               <au>
                  <snm>Oltvai</snm>
                  <fnm>Z</fnm>
               </au>
            </aug>
            <source>Nat Rev Genet</source>
            <pubdate>2004</pubdate>
            <volume>5</volume>
            <fpage>101</fpage>
            <lpage>113</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">14735121</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B15">
            <title>
               <p>Graph structure in the web</p>
            </title>
            <aug>
               <au>
                  <snm>Broder</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Kumar</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Maghoul</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Raghavan</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Rajagopalan</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Stata</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Tomkins</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Wiener</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Computer Networks</source>
            <pubdate>2000</pubdate>
            <volume>33</volume>
            <fpage>309</fpage>
            <lpage>320</lpage>
         </bibl>
         <bibl id="B16">
            <title>
               <p>Cluster analysis and display of genome-wide expression patterns</p>
            </title>
            <aug>
               <au>
                  <snm>Eisen</snm>
                  <fnm>MB</fnm>
               </au>
               <au>
                  <snm>Spellman</snm>
                  <fnm>PT</fnm>
               </au>
               <au>
                  <snm>Brown</snm>
                  <fnm>PO</fnm>
               </au>
               <au>
                  <snm>Botstein</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>1998</pubdate>
            <volume>95</volume>
            <fpage>14863</fpage>
            <lpage>14868</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">24541</pubid>
                  <pubid idtype="pmpid" link="fulltext">9843981</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B17">
            <title>
               <p>Pathway mapping tools for analysis of high content data</p>
            </title>
            <aug>
               <au>
                  <snm>Ekins</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Nikolsky</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Bugrim</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Kirillov</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Nikolskaya</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Methods Mol Biol</source>
            <publisher>Humana Press</publisher>
            <pubdate>2007</pubdate>
            <volume>356</volume>
            <fpage>319</fpage>
            <lpage>350</lpage>
            <xrefbib>
               <pubid idtype="pmpid">16988414</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B18">
            <title>
               <p>Molecular definition of breast tumor heterogeneity</p>
            </title>
            <aug>
               <au>
                  <snm>Shipitsin</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Campbell</snm>
                  <fnm>LL</fnm>
               </au>
               <au>
                  <snm>Argani</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Weremowitz</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Bloushtain-Qimron</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Yao</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Nikolskaya</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Serebryiskaya</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Berouhim</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Hu</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Halushka</snm>
                  <fnm>MK</fnm>
               </au>
               <au>
                  <snm>Sukumar</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Parker</snm>
                  <fnm>LM</fnm>
               </au>
               <au>
                  <snm>Anderson</snm>
                  <fnm>KS</fnm>
               </au>
               <au>
                  <snm>Harris</snm>
                  <fnm>LN</fnm>
               </au>
               <au>
                  <snm>Garber</snm>
                  <fnm>JE</fnm>
               </au>
               <au>
                  <snm>Richardson</snm>
                  <fnm>AL</fnm>
               </au>
               <au>
                  <snm>Schnitt</snm>
                  <fnm>SJ</fnm>
               </au>
               <au>
                  <snm>Nikolsky</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Gelman</snm>
                  <fnm>RS</fnm>
               </au>
               <au>
                  <snm>Polyak</snm>
                  <fnm>K</fnm>
               </au>
            </aug>
            <source>Cancer Cell</source>
            <pubdate>2007</pubdate>
            <volume>11</volume>
            <fpage>259</fpage>
            <lpage>273</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">17349583</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B19">
            <title>
               <p>Quantitative Monitoring of Gene Expression Patterns with a Complementary DNA Microarray</p>
            </title>
            <aug>
               <au>
                  <snm>Schena</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Shalon</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Davis</snm>
                  <fnm>RW</fnm>
               </au>
               <au>
                  <snm>Brown</snm>
                  <fnm>PO</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>1995</pubdate>
            <volume>270</volume>
            <fpage>467</fpage>
            <lpage>470</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">7569999</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B20">
            <title>
               <p>Microarray expression profiling identifies genes with altered expression in HDL-deficient mice</p>
            </title>
            <aug>
               <au>
                  <snm>Callow</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Dudoit</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Gong</snm>
                  <fnm>EL</fnm>
               </au>
               <au>
                  <snm>Speed</snm>
                  <fnm>TP</fnm>
               </au>
               <au>
                  <snm>Rubin</snm>
                  <fnm>EM</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2000</pubdate>
            <volume>10</volume>
            <fpage>2022</fpage>
            <lpage>2029</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">313086</pubid>
                  <pubid idtype="pmpid" link="fulltext">11116096</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B21">
            <title>
               <p>Six things you won't find in the MAQC</p>
            </title>
            <aug>
               <au>
                  <snm>Perket</snm>
                  <fnm>JM</fnm>
               </au>
            </aug>
            <source>Scientist</source>
            <pubdate>2006</pubdate>
            <volume>20</volume>
            <fpage>68</fpage>
            <lpage>72</lpage>
         </bibl>
         <bibl id="B22">
            <title>
               <p>Statistical methods and microarray data</p>
            </title>
            <aug>
               <au>
                  <snm>Klebanov</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Qiu</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Welle</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Yakovlev</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Nat Biotechnol</source>
            <pubdate>2007</pubdate>
            <volume>25</volume>
            <fpage>25</fpage>
            <lpage>26</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">17211383</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B23">
            <title>
               <p>The MicroArray Quality Control (MAQC) project shows inter- and intraplatform reproducibility of gene expression measurements</p>
            </title>
            <aug>
               <au>
                  <cnm>MAQC Consortium</cnm>
               </au>
            </aug>
            <source>Nat Biotechnol</source>
            <pubdate>2006</pubdate>
            <volume>24</volume>
            <fpage>1151</fpage>
            <lpage>1169</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">16964229</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B24">
            <title>
               <p>A novel method for generation of signature networks as biomarkers from complex high-throughput data</p>
            </title>
            <aug>
               <au>
                  <snm>Nikolsky</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Ekins</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Nikolskaya</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Bugrim</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Toxicol Lett</source>
            <pubdate>2005</pubdate>
            <volume>158</volume>
            <issue>1</issue>
            <fpage>20</fpage>
            <lpage>29</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">15871913</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B25">
            <title>
               <p>How many human genes can be defined as housekeeping with current expression data?</p>
            </title>
            <aug>
               <au>
                  <snm>Zhu</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>He</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Song</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Yu</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>BMC Genomics</source>
            <pubdate>2008</pubdate>
            <volume>9</volume>
            <fpage>172</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">2396180</pubid>
                  <pubid idtype="pmpid" link="fulltext">18416810</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B26">
            <title>
               <p>Biological robustness</p>
            </title>
            <aug>
               <au>
                  <snm>Kitano</snm>
                  <fnm>H</fnm>
               </au>
            </aug>
            <source>Nat Rev Genet</source>
            <pubdate>2004</pubdate>
            <volume>5</volume>
            <fpage>826</fpage>
            <lpage>837</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">15520792</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B27">
            <title>
               <p>Bow ties, metabolism and disease</p>
            </title>
            <aug>
               <au>
                  <snm>Csete</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Doyle</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Trends Biotechnol</source>
            <pubdate>2004</pubdate>
            <volume>22</volume>
            <fpage>446</fpage>
            <lpage>450</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">15331224</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B28">
            <title>
               <p>Genetic regulators of large-scale transcriptional signatures in cancer</p>
            </title>
            <aug>
               <au>
                  <snm>Adler</snm>
                  <fnm>AS</fnm>
               </au>
               <au>
                  <snm>Lin</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Horlings</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Nuyten</snm>
                  <fnm>DSA</fnm>
               </au>
               <au>
                  <snm>Vijver</snm>
                  <mnm>van de</mnm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Chnag</snm>
                  <fnm>HY</fnm>
               </au>
            </aug>
            <source>Nat Genet</source>
            <pubdate>2006</pubdate>
            <volume>38</volume>
            <fpage>421</fpage>
            <lpage>430</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1435790</pubid>
                  <pubid idtype="pmpid" link="fulltext">16518402</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B29">
            <title>
               <p>Limma: Linear models for microarray data</p>
            </title>
            <aug>
               <au>
                  <snm>Smyth</snm>
                  <fnm>GK</fnm>
               </au>
            </aug>
            <source>Bioinformatics and Computational Biology Solutions using R and Bioconductor</source>
            <publisher>New York: Springer</publisher>
            <editor>Gentleman R, Carey V, Dudoit S, Irizarry R, Huber W</editor>
            <pubdate>2005</pubdate>
            <fpage>397</fpage>
            <lpage>420</lpage>
         </bibl>
      </refgrp>
   </bm>
</art>
