<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1471-2105-10-S1-S69</ui>
   <ji>1471-2105</ji>
   <fm>
      <dochead>Research</dochead>
      <bibl>
         <title>
            <p>Comparison of automated candidate gene prediction systems using genes implicated in type 2 diabetes by genome-wide association studies</p>
         </title>
         <aug>
            <au id="A1">
               <snm>Teber</snm>
               <mi>T</mi>
               <fnm>Erdahl</fnm>
               <insr iid="I1"/>
               <email>e.teber@victorchang.edu.au</email>
            </au>
            <au id="A2">
               <snm>Liu</snm>
               <mi>Y</mi>
               <fnm>Jason</fnm>
               <insr iid="I1"/>
               <email>j.liu@victorchang.edu.au</email>
            </au>
            <au id="A3">
               <snm>Ballouz</snm>
               <fnm>Sara</fnm>
               <insr iid="I1"/>
               <email>s.ballouz@victorchang.edu.au</email>
            </au>
            <au id="A4">
               <snm>Fatkin</snm>
               <fnm>Diane</fnm>
               <insr iid="I1"/>
               <insr iid="I2"/>
               <email>d.fatkin@victorchang.edu.au</email>
            </au>
            <au ca="yes" id="A5">
               <snm>Wouters</snm>
               <mi>A</mi>
               <fnm>Merridee</fnm>
               <insr iid="I1"/>
               <insr iid="I2"/>
               <email>m.wouters@victorchang.edu.au</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Victor Chang Cardiac Research Institute, 384 Victoria St, Darlinghurst, 2010, NSW, Australia</p>
            </ins>
            <ins id="I2">
               <p>School of Medical Sciences, University of New South Wales, Sydney, Australia</p>
            </ins>
         </insg>
         <source>BMC Bioinformatics</source>
         <supplement>
            <title>
               <p>Selected papers from the Seventh Asia-Pacific Bioinformatics Conference (APBC 2009)</p>
            </title>
            <editor>Michael Q Zhang, Michael S Waterman and Xuegong Zhang</editor>
            <note>Research</note>
         </supplement>
         <conference>
            <title>
               <p>The Seventh Asia Pacific Bioinformatics Conference (APBC 2009)</p>
            </title>
            <location>Beijing, China</location>
            <date-range>13&#8211;16 January 2009</date-range>
            <url>http://bioinfo.au.tsinghua.edu.cn/apbc2009/</url>
         </conference>
         <issn>1471-2105</issn>
         <pubdate>2009</pubdate>
         <volume>10</volume>
         <issue>Suppl 1</issue>
         <fpage>S69</fpage>
         <url>http://www.biomedcentral.com/1471-2105/10/S1/S69</url>
         <xrefbib>
            
         <pubidlist><pubid idtype="pmpid">19208173</pubid><pubid idtype="doi">10.1186/1471-2105-10-S1-S69</pubid></pubidlist></xrefbib>
      </bibl>
      <history>
         <pub>
            <date>
               <day>30</day>
               <month>1</month>
               <year>2009</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2009</year>
         <collab>Teber et al; licensee BioMed Central Ltd.</collab>
         <note>This is an open access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>Automated candidate gene prediction systems allow geneticists to hone in on disease genes more rapidly by identifying the most probable candidate genes linked to the disease phenotypes under investigation. Here we assessed the ability of eight different candidate gene prediction systems to predict disease genes in intervals previously associated with type 2 diabetes by benchmarking their performance against genes implicated by recent genome-wide association studies.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>Using a search space of 9556 genes, all but one of the systems pruned the genome in favour of genes associated with moderate to highly significant SNPs. Of the 11 genes associated with highly significant SNPs identified by the genome-wide association studies, eight were flagged as likely candidates by at least one of the prediction systems. A list of candidates produced by a previous consensus approach did not match any of the genes implicated by 706 moderate to highly significant SNPs flagged by the genome-wide association studies. We prioritized genes associated with medium significance SNPs.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusion</p>
               </st>
               <p>The study appraises the relative success of several candidate gene prediction systems against independent genetic data. Even when confronted with challengingly large intervals, the candidate gene prediction systems can successfully select likely disease genes. Furthermore, they can be used to filter statistically less-well-supported genetic data to select more likely candidates. We suggest consensus approaches fail because they penalize novel predictions made from independent underlying databases. To realize their full potential further work needs to be done on prioritization and annotation of genes.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>The process of linking genes to disease phenotypes is rapidly gaining momentum since the first disease-causing gene was identified 25 years ago <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>. Alternative approaches adopted in the past to identify disease genes are the candidate gene approach, where likely suspects are prioritised and screened on a genome-wide basis; and linkage analysis where specific loci are determined systematically using family studies. The two approaches have been synthesized into a pipeline by completion of the Human Genome Project; and further enabled by the increased availability of high-throughput experimental data and the development of sophisticated bioinformatics tools. In addition there have been efforts in the bioinformatics community to systematize and automate candidate gene prediction. Automated prediction systems provide geneticists with a reduced list of genes estimated to have a high probability of involvement in the disease phenotype by sifting through hundreds to thousands of genes. Ultimately, these tools aim to give the researcher the best possible guidance in honing in on the gene culprits for further biological confirmation. Since their introduction in the early 2000s, the predictive powers of automated candidate gene prediction systems have improved, largely due to increases in biological systems knowledge and more effective algorithms.</p>
         <p>Candidate gene prediction systems vary in their approach and the data sources they draw on in generating predictions. These are summarised in Figure <figr fid="F1">1</figr> and Table <tblr tid="T1">1</tblr>. Comparing the performance of these systems can be difficult because of the use of custom benchmark test sets by individual groups. Typically, benchmarking data is derived from genotype-phenotype information from the Online Mendelian Inheritance in Man (OMIM) database <abbrgrp><abbr bid="B2">2</abbr></abbrgrp>, but groups have used varying subsets of diseases. Several groups have tried to use standard benchmark sets <abbrgrp><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr></abbrgrp>, but these efforts have been limited. In addition, it is difficult to predict whether benchmarks which predominately contain data on well characterised diseases with Mendelian transmission patterns (i.e. dominant, recessive, X-linked) resulting from mutations in single genes <abbrgrp><abbr bid="B6">6</abbr></abbrgrp> will be effective in predicting genes involved in less well characterised diseases, or in complex diseases.</p>
         <tbl id="T1">
            <title>
               <p>Table 1</p>
            </title>
            <caption>
               <p>Automated Candidate Gene Prediction Systems</p>
            </caption>
            <tblbdy cols="1">
               <r>
                  <c ca="left">
                     <p>
                        <b>Semi-Automated Systems</b>
                     </p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p><it>GeneSeeker </it>is a semi-automated web-server tool which selects positional candidates based on expression and phenotypic data from human and mouse. Queries must be formulated by the end-user using Boolean expressions <abbrgrp><abbr bid="B13">13</abbr><abbr bid="B33">33</abbr></abbrgrp>. &#9824; &#9671;</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>
                        <b>Systems Biology Techniques</b>
                     </p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p><it>Prioritizer </it>uses pathway and interaction data from KEGG <abbrgrp><abbr bid="B17">17</abbr><abbr bid="B34">34</abbr></abbrgrp>, Reactome <abbrgrp><abbr bid="B35">35</abbr></abbrgrp>, and HPRD <abbrgrp><abbr bid="B36">36</abbr></abbrgrp>. Interactions are also predicted using a Bayesian technique based on GO keywords <abbrgrp><abbr bid="B23">23</abbr></abbrgrp> and other databases <abbrgrp><abbr bid="B5">5</abbr></abbrgrp>.</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>In <it>Gentrepid </it>Common Pathway Scanning (CPS), pathways are associated with phenotypes using either known disease genes, or by searching for enrichment of pathways across multiple disease intervals associated with the phenotype <abbrgrp><abbr bid="B4">4</abbr></abbrgrp>. &#9824;&#9671;</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p><it>Oti et al </it>use protein-protein interaction data from HPRD <abbrgrp><abbr bid="B36">36</abbr></abbrgrp>, Y2H <abbrgrp><abbr bid="B37">37</abbr><abbr bid="B38">38</abbr></abbrgrp>, and PCP <abbrgrp><abbr bid="B39">39</abbr><abbr bid="B40">40</abbr></abbrgrp> giving coverage of 10 894 human genes <abbrgrp><abbr bid="B24">24</abbr></abbrgrp>.</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>
                        <b>Genotype-Phenotype Mapping Methods</b>
                     </p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p><it>G2D </it><abbrgrp><abbr bid="B32">32</abbr></abbrgrp> uses biomedical literature to associate pathological conditions with GO terms <abbrgrp><abbr bid="B23">23</abbr></abbrgrp>. Candidate genes are identified by homology to GO-annotated disease-associated genes. &#9824;&#9671;</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p><it>Gentrepid </it>Common Module Profiling (CMP) searches for enrichment of particular domains in gene clusters associated with a particular phenotype. Domains are extracted either from known disease genes or by comparison of multiple disease intervals <abbrgrp><abbr bid="B4">4</abbr></abbrgrp>. &#9824;&#9671;</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p><it>POCUS </it>searches for over-representation of functional annotation among multiple loci associated with the same disease. Functional annotation is based on keywords from InterPro domains <abbrgrp><abbr bid="B22">22</abbr></abbrgrp> and GO <abbrgrp><abbr bid="B23">23</abbr></abbrgrp>. No <it>a priori </it>knowledge of the phenotype is required <abbrgrp><abbr bid="B3">3</abbr></abbrgrp>. &#9824;</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>
                        <b>Techniques based on a bipartite distribution of "disease" and "non-disease" genes</b>
                     </p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>The <it>eVOC </it>system uses text mining of biomedical literature to associate a phenotype with anatomy terms and links these with human expression data to produce a ranked list of disease genes. The classifier is a machine-learning technique, based on a bipartite training set of 17 known "disease genes" and 306 "non-disease genes" <abbrgrp><abbr bid="B30">30</abbr></abbrgrp>. &#9824;</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p><it>DGP (Disease Gene Prediction) </it>is a web tool which selects genes based on protein sequence properties. The properties analysed by DGP include protein length, degree of sequence conservation, the extent of phylogenetic relationship and paralogy patterns <abbrgrp><abbr bid="B31">31</abbr><abbr bid="B41">41</abbr></abbrgrp>. &#9824;</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p><it>PROSPECTR </it>(PRiOrization by Sequence and Phylogenetic Extent of CandidaTe Regions) uses an alternating decision tree to discriminate "disease genes" from "non-disease genes" using a classifier based on sequence features such as gene length, protein length, and similarity of homologs in other species <abbrgrp><abbr bid="B12">12</abbr></abbrgrp>. &#9824;</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>
                        <b>Hybrid techniques</b>
                     </p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p><it>SUSPECTS </it>combines a genotype-phenotype mapping method based on disease-gene-associated keywords from InterPro and GO, and expression libraries, with the <it>PROSPECTR </it>Boolean classifier. Disease genes are prioritized <abbrgrp><abbr bid="B21">21</abbr></abbrgrp>. &#9824; &#9671;</p>
                  </c>
               </r>
            </tblbdy>
            <tblfn>
               <p>&#9824; Assessed here, &#9671; Webserver.</p>
            </tblfn>
         </tbl>
         <fig id="F1">
            <title>
               <p>Figure 1</p>
            </title>
            <caption>
               <p>Data sources and approaches used in automated candidate gene prediction methods</p>
            </caption>
            <text>
               <p><b>Data sources and approaches used in automated candidate gene prediction methods</b>. <b>(A): </b>Most systems draw on at least two types of data. <it>SUSPECTS </it><abbrgrp><abbr bid="B21">21</abbr></abbrgrp> (not shown) uses keywords from InterPro <abbrgrp><abbr bid="B22">22</abbr></abbrgrp> and GO <abbrgrp><abbr bid="B23">23</abbr></abbrgrp>, co-expression data, and also incorporates the <it>PROSPECTR </it>module <abbrgrp><abbr bid="B12">12</abbr></abbrgrp> (shown on right). <b>(B)</b>: <it>Upper left </it>Gene clustering approaches associate a gene cluster with a phenotype via a group member. For example, Systems Biology approaches <abbrgrp><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr><abbr bid="B24">24</abbr></abbrgrp> group genes whose protein products interact; and link them to a phenotype using a group-member gene associated with the phenotype. Systems Biology methods assume oligogenic diseases are associated with disruption in proteins that participate in a common complex or pathway <abbrgrp><abbr bid="B25">25</abbr></abbrgrp>. Other gene clustering systems look for enrichment of keywords or domains associated with particular phenotypes and suggest candidate genes with similar properties. These systems are based on the principle that candidate genes have similar functions to disease genes already determined <abbrgrp><abbr bid="B26">26</abbr><abbr bid="B27">27</abbr><abbr bid="B28">28</abbr></abbrgrp>. <it>Upper right </it>Phenotype clustering approaches such as that of Freudenberg &amp; Propping <abbrgrp><abbr bid="B29">29</abbr></abbrgrp> group related phenotypes into superphenotypes. <it>Lower left </it>Most of the Machine Learning approaches do not use phenotype information and are based on the concept that the genome consists of a bipartite distribution of genes: those which cause diseases, and those that do not. By analysing these two gene sets with respect to discriminating variables, a profile for "non-disease genes" and "disease genes" is produced which enables training of a classifier. A novel gene submitted to the classifier is flagged as either "disease-causing" or "non-disease causing". Systems include <it>eVOC </it><abbrgrp><abbr bid="B30">30</abbr></abbrgrp>, <it>PROSPECTR </it><abbrgrp><abbr bid="B12">12</abbr></abbrgrp>, <it>SUSPECTS </it><abbrgrp><abbr bid="B21">21</abbr></abbrgrp> and <it>DGP </it><abbrgrp><abbr bid="B31">31</abbr></abbrgrp>. Finally <it>G2D</it>, <it>lower right</it>, is a transitive method that maps phenotypes to genes <abbrgrp><abbr bid="B32">32</abbr></abbrgrp> by interfacing literature- and keyword-based ontologies.</p>
            </text>
            <graphic file="1471-2105-10-S1-S69-1"/>
         </fig>
         <p>A recent effort by Tiffin and colleagues <abbrgrp><abbr bid="B7">7</abbr></abbrgrp> to identify candidate disease genes for the complex disease type II diabetes (T2D) and the related obesity trait predicted 12 genes in previously implicated chromosomal regions. The study also allowed a limited comparison of seven candidate gene prediction systems. Since that time two genome-wide association studies (GWAs) on T2D undertaken by the Wellcome Trust Case Control Consortium (WTCCC) and the Genetics Replication and Meta-analysis Consortium (DIAGRAM) have been published <abbrgrp><abbr bid="B8">8</abbr><abbr bid="B9">9</abbr></abbrgrp>. GWAs are a powerful tool for identifying genetic variants linked to complex diseases because they are more sensitive than linkage studies to small to moderate effect size contributions from polygenic and oligogenic diseases. The data from these GWAs allow the assessment of the predictions made by Tiffin <it>et al</it>., as well as evaluation of the effectiveness of predictions made by the individual automated candidate gene prediction systems used in their study and our system, <it>Gentrepid </it><abbrgrp><abbr bid="B4">4</abbr></abbrgrp>. We assessed the candidate gene predictions systems' ability to select robustly supported genes from the GWAs and used them to filter noisy data from statistically less well supported genes to select favoured candidates.</p>
      </sec>
      <sec>
         <st>
            <p>Results</p>
         </st>
         <sec>
            <st>
               <p>Predictions</p>
            </st>
            <p>All methods were given the starting set of 9556 genes mapped to chromosomal intervals implicated in T2D as assessed by Tiffin <it>et al</it>., except for <it>POCUS </it>which was run against a search space of 562 genes. The <it>POCUS </it>method was confined to the smaller search space because "poorly defined susceptibility regions or regions with questionable association with the disease are obscured by background noise" <abbrgrp><abbr bid="B7">7</abbr></abbrgrp>. The number of candidate gene predictions made by the eight methods varied from two to 3093. <it>POCUS </it>generated the smallest number of candidates but neither of the two predictions matched genes in either the highly significant (HS) or medium-to-highly significant (MHWD) data sets. Other candidate gene prediction methods made considerably more predictions. The largest numbers of predictions were made by <it>G2D </it>(3,093 candidate gene predictions) and <it>eVOC </it>(2,496 predictions). These comprise almost one third and one quarter of the search space respectively. Thus neither of these methods prune the search space particularly well. Excluding <it>POCUS</it>, the least number of predictions was made by <it>Gentrepid </it>comprising 502 genes in known-disease-gene mode.</p>
         </sec>
         <sec>
            <st>
               <p>Accuracy of predictions</p>
            </st>
            <p>To assess the accuracy of the predictions, all eight systems were compared with genes found in previously-implicated intervals strongly linked to T2D by the GWAs. Figure <figr fid="F2">2</figr> shows the comparative performance of seven of these methods in selecting the 11 genes in the HS GWA data set. Several metrics were calculated to assess accuracy. No metrics were calculated for <it>POCUS </it>as neither of its two predictions matched genes in either the HS or MHWD data sets.</p>
            <fig id="F2">
               <title>
                  <p>Figure 2</p>
               </title>
               <caption>
                  <p>Comparison of methods against the HS (left) and MHWD (right) T2D gene data sets</p>
               </caption>
               <text>
                  <p><b>Comparison of methods against the HS (left) and MHWD (right) T2D gene data sets</b>. Top: Relative Enrichment Ratios. Bottom: Comparisons based on Sensitivity and Specificity.</p>
               </text>
               <graphic file="1471-2105-10-S1-S69-2"/>
            </fig>
            <p>The Enrichment Ratio is a general measure of the system's ability to accurately prune the search space. Enrichment Ratios ranged from 1 to 5 for the seven remaining prediction systems. The highest Enrichments Ratios were obtained by <it>Gentrepid </it>and <it>GeneSeeker</it>. These results were robust when the upper and lower (not shown) 95% confidence interval limits were taken into account. The lowest Enrichment Ratios were associated with the Machine Learning methods. This is not surprising, as the classifiers are trained to distinguish "disease genes" from "non-disease genes" and are ignorant of any concept of phenotype. The Specificity of a system measures its ability to reject genes not associated with the phenotype. Specificity scores among all seven methods ranged from 0.68 to 0.99, with a median of 0.92. As a group, the Machine Learning methods were poorer at rejection. <it>G2D </it>also performed poorly on this metric, but this result is slightly misleading because it does not take into account <it>G2D</it>'s prioritization method which will be discussed later.</p>
            <p>The Sensitivity is a measure of a system's ability to find the disease genes in the search space. A caveat here is not all of the GWA predictions are currently confirmed. <it>G2D </it>is by far the standout performer in Sensitivity, with <it>eVOC </it>ranked second. However, as can be seen from the other metrics, this result is obtained at the expense of Specificity for both systems. <it>Gentrepid's </it>Sensitivity is on par with most of the Machine Learning methods but with higher Specificity. The high Specificity reflects the high quality of the data in the underlying databases. The lower Sensitivity is due to incompleteness of these databases with respect to all human genes.</p>
            <p>Figure <figr fid="F2">2</figr> shows the comparative performance of methods when assessed against the 61 T2D associated genes with moderate to strong SNP signals (MHWD) in the Tiffin chromosomal intervals. The MHWD data set is not as statistically well supported as the HS set, and would be expected to contain some genes associated with T2D and others that are false positives. Perhaps the most interesting metric to look at here is the Sensitivity which should fall compared to the values for the HS set because of the lower signal to noise ratio in the MHWD set. All the systems except one, <it>SUSPECTS</it>, passed this negative test. More importantly, application of the systems to this noisy genetic data allows selection of a subset of candidates on the basis of molecular data (see below).</p>
            <p>The results shown for <it>Gentrepid </it>in Figure <figr fid="F2">2</figr> are for the known-disease-gene mode. In <it>ab initio </it>mode, <it>Gentrepid</it>'s CPS method identified 506 pathways containing a total of 1980 candidate gene predictions. This resulted in Enrichment Ratios of 3.3 and 2.1 when the HS and MHWD full gene sets were considered (Table <tblr tid="T2">2</tblr>).</p>
            <tbl id="T2">
               <title>
                  <p>Table 2</p>
               </title>
               <caption>
                  <p>Gentrepid <it>ab initio </it>results</p>
               </caption>
               <tblbdy cols="8">
                  <r>
                     <c ca="left">
                        <p>Predictions</p>
                     </c>
                     <c ca="left">
                        <p>Reference list</p>
                     </c>
                     <c ca="right">
                        <p>ER</p>
                     </c>
                     <c ca="right">
                        <p>L95%</p>
                     </c>
                     <c ca="right">
                        <p>U95%</p>
                     </c>
                     <c ca="right">
                        <p>S</p>
                     </c>
                     <c ca="right">
                        <p>L95%</p>
                     </c>
                     <c ca="right">
                        <p>U95%</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="8">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>CPS rank 8+ pathways</p>
                     </c>
                     <c ca="left">
                        <p>HS</p>
                     </c>
                     <c ca="right">
                        <p>3.3</p>
                     </c>
                     <c ca="right">
                        <p>1.1</p>
                     </c>
                     <c ca="right">
                        <p>9.4</p>
                     </c>
                     <c ca="right">
                        <p>0.45</p>
                     </c>
                     <c ca="right">
                        <p>0.21</p>
                     </c>
                     <c ca="right">
                        <p>0.72</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>CPS rank 8+ pathways</p>
                     </c>
                     <c ca="left">
                        <p>HS &#8211; annotated</p>
                     </c>
                     <c ca="right">
                        <p>7.2</p>
                     </c>
                     <c ca="right">
                        <p>2.1</p>
                     </c>
                     <c ca="right">
                        <p>25</p>
                     </c>
                     <c ca="right">
                        <p>1.00</p>
                     </c>
                     <c ca="right">
                        <p>0.57</p>
                     </c>
                     <c ca="right">
                        <p>1.00</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>CPS rank 8+ pathways</p>
                     </c>
                     <c ca="left">
                        <p>MHWD</p>
                     </c>
                     <c ca="right">
                        <p>2.1</p>
                     </c>
                     <c ca="right">
                        <p>1.3</p>
                     </c>
                     <c ca="right">
                        <p>3.6</p>
                     </c>
                     <c ca="right">
                        <p>0.30</p>
                     </c>
                     <c ca="right">
                        <p>0.20</p>
                     </c>
                     <c ca="right">
                        <p>0.42</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>CPS rank 8+ pathways</p>
                     </c>
                     <c ca="left">
                        <p>MHWD &#8211; annotated</p>
                     </c>
                     <c ca="right">
                        <p>6.8</p>
                     </c>
                     <c ca="right">
                        <p>3.6</p>
                     </c>
                     <c ca="right">
                        <p>13</p>
                     </c>
                     <c ca="right">
                        <p>0.95</p>
                     </c>
                     <c ca="right">
                        <p>0.75</p>
                     </c>
                     <c ca="right">
                        <p>0.99</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>CPS interactions top 50%</p>
                     </c>
                     <c ca="left">
                        <p>HS</p>
                     </c>
                     <c ca="right">
                        <p>4.1</p>
                     </c>
                     <c ca="right">
                        <p>1.2</p>
                     </c>
                     <c ca="right">
                        <p>15</p>
                     </c>
                     <c ca="right">
                        <p>0.27</p>
                     </c>
                     <c ca="right">
                        <p>0.10</p>
                     </c>
                     <c ca="right">
                        <p>0.57</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>CPS interactions top 50%</p>
                     </c>
                     <c ca="left">
                        <p>HS &#8211; annotated</p>
                     </c>
                     <c ca="right">
                        <p>9.0</p>
                     </c>
                     <c ca="right">
                        <p>2.2</p>
                     </c>
                     <c ca="right">
                        <p>37</p>
                     </c>
                     <c ca="right">
                        <p>0.60</p>
                     </c>
                     <c ca="right">
                        <p>0.23</p>
                     </c>
                     <c ca="right">
                        <p>0.88</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>CPS interactions top 50%</p>
                     </c>
                     <c ca="left">
                        <p>MHWD</p>
                     </c>
                     <c ca="right">
                        <p>1.7</p>
                     </c>
                     <c ca="right">
                        <p>0.79</p>
                     </c>
                     <c ca="right">
                        <p>3.8</p>
                     </c>
                     <c ca="right">
                        <p>0.11</p>
                     </c>
                     <c ca="right">
                        <p>0.06</p>
                     </c>
                     <c ca="right">
                        <p>0.22</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>CPS interactions top 50%</p>
                     </c>
                     <c ca="left">
                        <p>MHWD &#8211; annotated</p>
                     </c>
                     <c ca="right">
                        <p>8.1</p>
                     </c>
                     <c ca="right">
                        <p>3.2</p>
                     </c>
                     <c ca="right">
                        <p>20</p>
                     </c>
                     <c ca="right">
                        <p>0.54</p>
                     </c>
                     <c ca="right">
                        <p>0.29</p>
                     </c>
                     <c ca="right">
                        <p>0.77</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>CMP top 10%</p>
                     </c>
                     <c ca="left">
                        <p>HS</p>
                     </c>
                     <c ca="right">
                        <p>2.2</p>
                     </c>
                     <c ca="right">
                        <p>0.3</p>
                     </c>
                     <c ca="right">
                        <p>17</p>
                     </c>
                     <c ca="right">
                        <p>0.1</p>
                     </c>
                     <c ca="right">
                        <p>0.02</p>
                     </c>
                     <c ca="right">
                        <p>0.38</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>CMP top 10%</p>
                     </c>
                     <c ca="left">
                        <p>MHWD</p>
                     </c>
                     <c ca="right">
                        <p>2.0</p>
                     </c>
                     <c ca="right">
                        <p>0.8</p>
                     </c>
                     <c ca="right">
                        <p>4.8</p>
                     </c>
                     <c ca="right">
                        <p>0.1</p>
                     </c>
                     <c ca="right">
                        <p>0.08</p>
                     </c>
                     <c ca="right">
                        <p>0.18</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>Abbreviations in Table: ER &#8211; Enrichment Ratio, L95% &#8211; Lower 95% confidence limit, U95% &#8211; Upper 95% confidence limit, S &#8211; Sensitivity</p>
               </tblfn>
            </tbl>
            <p>In <it>ab initio </it>mode, the CMP method generated 527 predictions by limiting the selection to the top 10% most probable genes. This resulted in correct prediction of one gene from the HS set and five from the MHWD set, yielding a Enrichment Ratio of 2.2 when applied to the HS and 2.0 for the MHWD gene data sets. </p>
            <p>It is also interesting to note the effect of lack of annotation on these results. Only five of 11 genes in the HS dataset, and 19 of 61 genes in the MHWD set contained KEGG or BioCarta pathway annotations.When we included only genes containing pathway information from the gene datasets (designated 'annotated' in Table <tblr tid="T2">2</tblr>) we observed Enrichment Ratios of 7.2 against the HS and 6.8 against the MHWD pathway-annotated sets. Sensitivities also improved by a factor of 2 for the HS dataset. By extrapolation, if all genes were pathway annotated, we could expect approximately two- to three-fold improvement in Enrichment and Sensitivity scores.</p>
         </sec>
         <sec>
            <st>
               <p>Prioritization</p>
            </st>
            <p>Although the metrics discussed provide useful measures of a candidate gene prediction system's performance, another criterion of importance to geneticists is the system's ability to prioritize predictions. Although several methods claim the ability to prioritize (Table <tblr tid="T1">1</tblr>), only <it>G2D </it>provided prioritized predictions in Tiffin <it>et al</it>. <abbrgrp><abbr bid="B7">7</abbr></abbrgrp>. Hence only <it>G2D </it>and <it>Gentrepid </it>will be discussed here. Because <it>Gentrepid </it>only made 502 predictions <it>in toto</it>, we took the top 502 predictions made by <it>G2D </it>and recalculated the Enrichment Ratio, Sensitivity and Specificity for this restricted set of favoured predictions. ER and Specificity significantly improved to 3.1 and 0.95 such that <it>G2D </it>surpassed <it>Gentrepid</it>'s gross ER and almost equalled <it>Gentrepid </it>in Specificity. The improvement in these two metrics came at the expense of Sensitivity which was reduced to 0.16, but the <it>G2D </it>system still managed to maintain its lead on this metric.</p>
            <p>In <it>G2D</it>'s prioritization system, a GO-metric is calculated for each gene in the search space based on how well its GO profile fits the GO profile of the disease genes inferred from MeSH terms. An R-score is calculated for each gene by normalizing against the number of genes in the genome with better GO-metrics for the phenotype. Genes with R-scores closer to zero are better fits to the phenotype.</p>
            <p><it>Gentrepid </it>CPS ranks genes by the number of loci in the search space involved in a particular pathway. In <it>ab initio </it>mode, of the 53 intervals searched, the top pathway, focal adhesion, was represented in 35 of these. All five of the HS dataset genes were represented by pathways found in at least eight intervals. Pathways implicated in at least eight intervals constituted the top 40% of the 506 pathways containing 1749 candidates. In these pathways, <it>Gentrepid </it>identified all five pathway annotated genes from the HS dataset, and 18 of the 19 pathway annotated genes from the MHWD gene data set. Other figures for CMP are given in Table <tblr tid="T2">2</tblr>.</p>
            <p>CMP <it>ab initio </it>looks for protein domains enriched in the search space compared to the genome by taking a census of domains in the search space and the genome. Two expectation values are calculated to estimate the frequency of occurrence of genes with domains of interest based on a random combination of these domains <it>e</it><sub><it>a </it></sub>and the rarest domain <it>e</it><sub><it>b </it></sub><abbrgrp><abbr bid="B4">4</abbr></abbrgrp>. Figure <figr fid="F3">3</figr> shows the data ranked on a <it>&#967;</it><sup>2 </sup>test based on e<sub>a </sub>was most effective in prioritizing the HS data. This reflects our experience of phenotypes with genotypes encoded by multi-domain proteins, as would be expected for diseases associated with signaling. For metabolic diseases associated with single-domain proteins, <it>e</it><sub><it>b </it></sub>may be a better measure.</p>
            <fig id="F3">
               <title>
                  <p>Figure 3</p>
               </title>
               <caption>
                  <p>MHWD dataset filtered against prioritized automatic candidate gene predictions</p>
               </caption>
               <text>
                  <p><b>MHWD dataset filtered against prioritized automatic candidate gene predictions</b>. Genes in bold are robustly supported genes from the GWA studies (HS set).</p>
               </text>
               <graphic file="1471-2105-10-S1-S69-3"/>
            </fig>
            <p>Although the <it>G2D </it>prioritization system appears more sensitive than the coarse-grained prioritization of <it>Gentrepid</it>, the performance of both systems was roughly equivalent against the HS set. Both systems were moderately successful in prioritizing the HS data. For example, of the seven genes in the HS dataset predicted by <it>G2D</it>, four were ranked in the top 15% by <it>G2D</it>'s prioritization method (bold in Figure <figr fid="F3">3</figr>). Significant work needs to be done to improve the prioritization schemes of both <it>G2D </it>and <it>Gentrepid</it>. </p>
            <p>Finally, we used the candidate gene predictions systems to filter the less statistically-well-supported MHWD data set (MHWD &#8211; HS): effectively adding more power to the GWA study. Prioritized predictions are the unbolded genes in Figure <figr fid="F3">3</figr>. Additional unprioritized predictions made for the MHWD dataset using the other candidate predictions systems are given as supplementary data in Additional file <supplr sid="S1">1</supplr>.</p>
            <suppl id="S1">
               <title>
                  <p>Additional file 1</p>
               </title>
               <text>
                  <p><b>Additional unprioritized MHWD matches</b>. Additional unprioritized MHWD matches data.</p>
               </text>
               <file name="1471-2105-10-S1-S69-S1.txt">
                  <p>Click here for file</p>
               </file>
            </suppl>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Discussion</p>
         </st>
         <p>Candidate disease gene prediction is a rapidly moving area of bioinformatics research with the potential to deliver great benefits to human health. By assisting geneticists to use existing biological information to investigate disease loci obtained by linkage analysis and association studies, disease genes can be identified more rapidly. The need for good applications in the area of candidate gene prediction is becoming increasing important as the proliferation of SNP-based association studies produces valuable genetic information in need of analysis.</p>
         <p>The biggest problem facing candidate gene prediction today is the accuracy and completeness of the underlying databases. Failure to make a prediction is mostly due to incomplete data coverage. For example, 65% of human proteins have GO terms but only 25% of these are manually annotated. Systems drawing on GO terms like <it>G2D </it>are potentially able to make predictions for 65% of genes but only around one third of these are likely to be accurate. Systems Biology methods like <it>Gentrepid </it>CPS are reliant on pathway and protein-protein interaction data. One of the databases CPS draws on is OPHID <abbrgrp><abbr bid="B10">10</abbr></abbrgrp>, one of the most complete protein-protein interaction datasets, containing over 48 000 interactions. However these 48,000 interactions are estimated to be only 13% of the complete human interactome <abbrgrp><abbr bid="B11">11</abbr></abbrgrp>. Completeness of the underlying data clearly impacts the Sensitivity of the <it>Gentrepid </it>CPS method. As time goes on this constraint will ease as these databases are further populated. In the meantime, we have shown that the use of independent biological data to make complementary candidate gene predictions is one way to ameliorate the problem of incomplete data coverage (see Figure <figr fid="F3">3</figr>) <abbrgrp><abbr bid="B4">4</abbr></abbrgrp>.</p>
         <p>In addition to the predictions made by the individual candidate gene prediction systems in Tiffin <it>et al</it>., a set of nine "winners" were chosen using a consensus approach <abbrgrp><abbr bid="B7">7</abbr></abbrgrp>. These nine candidate genes were independently predicted by six of the seven prediction systems studied. A larger consensus set, chosen by five of the seven methods, contained 94 genes <abbrgrp><abbr bid="B7">7</abbr></abbrgrp>. None of the genes in either of these consensus lists matched any of the genes in the HS and MHWD gene sets. Even if we compile a third tier of consensus genes from any four of the seven methods (269 genes) only one gene (VEGF) fell within the HS data set and only three genes (CHN2, B4GALT5, VEGF) matched the MHWD data set. Clearly the consensus approach is not working and it is easy to see why when the underlying databases are considered (Figure <figr fid="F1">1A</figr>). Candidate gene prediction systems that use an independent data set, not drawn upon by most of the other methods, will be penalized. Possibly the only benefit of a consensus approach is to give the user a false sense of accuracy when confronted with noisy data.</p>
         <p>Clearly much work still remains to improve the sensitivity and specificity of candidate gene prediction methods but some general conclusions are possible. Machine Learning methods were not as effective as other methods. Most of the Machine Learning approaches do not use phenotype information and are based on the concept that the genome consists of a bipartite distribution of genes: those which cause diseases, and those that do not. The evidence supporting this assumption is limited <abbrgrp><abbr bid="B12">12</abbr></abbrgrp>. We believe the concept that there is a difference between "disease genes" and "nondisease genes" is intrinsically flawed and no such Boolean classification exists. We hypothesize that the ability of these methods to predict disease genes in test sets is based on selection effects in the data: possibly rare, highly penetrant monogenic diseases, such as those involved in metabolic syndromes, are over-represented among known disease genes because they have been easier targets to identify. Although these systems were not as effective as the other candidate gene prediction systems, their performance was not greatly different. However, we believe that unlike systems which attempt to map genotype to phenotype, Machine Learning systems based on the disease gene/non-disease gene concept will not improve as more biological data becomes available.</p>
      </sec>
      <sec>
         <st>
            <p>Conclusion</p>
         </st>
         <p>Candidate gene prediction systems have typically been benchmarked on well characterized oligogenic phenotypes. GeneSeeker <abbrgrp><abbr bid="B13">13</abbr></abbrgrp> produced a 10-fold enrichment using a data set consisting of eight diseases. <it>Gentrepid</it>'s combined methods <abbrgrp><abbr bid="B4">4</abbr></abbrgrp> produced an Enrichment Ratio of 13 when 29 diseases with a total of 170 known disease genes were used. For 29 diseases with 163 genes, POCUS <abbrgrp><abbr bid="B3">3</abbr></abbrgrp> reported Enrichment Ratios between 12 to 42-fold, depending on the size of the intervals in the search space. The PRIORITIZER <abbrgrp><abbr bid="B5">5</abbr></abbrgrp> method yielded a 2.8-fold enrichment using a data set consisting of 96 heritable disorders. In summary, Enrichment Ratios of 3 to 13 have been reported in benchmarks, but a substantial part of the data used for these studies has been limited to oligogenic phenotypes, where several different genes may cause the disease, but a single mutation in each case or family has a large effect.</p>
         <p>Some doubts have been raised about the ability of systems to predict candidates for complex polygenic diseases such as T2D where multiple genes interact to create a permissive gene pool for disease genesis. The candidate gene prediction systems did prune the genome in favour of moderately to highly significant SNPs identified by the GWAs under semi-blind testing on a complex polygenic disease. Enrichment Ratios calculated in this study suggest that most of the oligogenic benchmarks have been reasonably good predictors of system performance.</p>
      </sec>
      <sec>
         <st>
            <p>Methods</p>
         </st>
         <sec>
            <st>
               <p>Benchmark datasets</p>
            </st>
            <p>Eight candidate gene prediction systems were assessed on their ability to predict genes involved in T2D by comparison against genes implicated by recent GWAs. Two data sets of T2D-implicated genes were used as the benchmark: a <b>H</b>ighly <b>S</b>ignificant gene set (HS) of 21 genes and a <b>M</b>oderate to <b>H</b>ighly significant gene set derived from the <b>W</b>TCCC and <b>D</b>IAGRAM studies (MHWD) of 172 genes <abbrgrp><abbr bid="B8">8</abbr><abbr bid="B9">9</abbr></abbrgrp>. The HS gene set contained 11 genes which mapped to the chromosomal regions investigated by Tiffin <it>et al. </it>(hereafter Tiffin intervals) <abbrgrp><abbr bid="B7">7</abbr></abbrgrp>. Genes associated with 706 moderately significant SNPs with a frequentist additive p-value of &lt;0.001, good clustering and intact NCBI build 36 reference ids were taken from WTCCC T2D data <abbrgrp><abbr bid="B8">8</abbr></abbrgrp>. SNPs positioned between the 5' UTR and 3' UTR of a known gene structure, 1000 bases upstream of a 5' UTR or 1000 bases downstream of 3' UTR of a known gene were considered to implicate the gene in T2D disease susceptibility. This moderately significant list was combined with the genes from the HS data set to generate the MHWD dataset, yielding 172 genes genome wide of which 61 genes mapped to the Tiffin intervals <abbrgrp><abbr bid="B7">7</abbr></abbrgrp>.</p>
         </sec>
         <sec>
            <st>
               <p>Data sources for predictions</p>
            </st>
            <p>The search space available to all eight automated candidate gene prediction systems consisted of 9556 genes in 53 chromosomal loci assessed by Tiffin <it>et al</it>. to be involved in T2D by various linkage and association studies. We matched 96.5% of all Ensembl gene entries <abbrgrp><abbr bid="B14">14</abbr></abbrgrp> provided to NCBI Entrez ids. All remaining genes were unable to be matched due either to the Ensembl entry having an unknown gene symbol label or because the entry was ambiguous or associated with a redundant gene symbol name entry. Ensembl entries and NCBI id matching was carried out at four levels, in order: approved symbol name, previous symbol names, Uniprot/SwissProt Accession and RefSeq Ids. Data conversion keys for matching between databases were acquired from BioMart <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>.</p>
            <p>Predictions made by seven candidate gene prediction methods were also obtained from Tiffin <it>et al</it>. <abbrgrp><abbr bid="B7">7</abbr></abbrgrp>. Nine disease-implicated genes were available to the systems as seeds (PPARG, GYS1, IRS1, INS, KCNJ11, ABCC8, SLC2A1, PPARGC1, CAPN10). Two of the genes &#8211; PPARG and KCNJ11, are implicated by the highly significant SNPs detected by GWAs but are not in the Tiffin intervals and are thus not included in the search space or benchmark set.</p>
         </sec>
         <sec>
            <st>
               <p>Candidate gene predictions</p>
            </st>
            <p>The candidate gene predictions for seven of the systems are detailed elsewhere <abbrgrp><abbr bid="B7">7</abbr></abbrgrp>. Briefly, <it>GeneSeeker </it>selected genes from the search space using a Boolean expression based on 14 keywords selected by an expert user <abbrgrp><abbr bid="B7">7</abbr></abbrgrp>. <it>PROSPECTR</it>, <it>DGP </it>and <it>eVOC </it>are Boolean classifiers which require only the search space as input. <it>G2D </it>and <it>Gentrepid </it>in <it>ab initio </it>mode, also only require the search space. <it>POCUS </it>potentially only needs the search space as input, but this was restricted to the seven best supported intervals of the 53 available, as judged by the <it>POCUS </it>team. <it>SUSPECTS </it>and <it>Gentrepid</it>, in known-disease-gene mode, used the nine known disease genes associated with the phenotype as seeds. <it>SUSPECTS </it>additionally draws on predictions from the <it>PROSPECTR </it>Boolean classifier.</p>
            <p><it>Gentrepid </it>predictions are discussed in detail here for the first time. <it>Gentrepid </it>implements two different modules to derive predictions: CPS &#8211; a systems biology method; and CMP &#8211; a method that associates phenotypes with particular domains. CPS and CMP can be used in two input modes: using known disease genes as a seed or using only the search space (<it>ab initio </it>mode).</p>
            <p>In known-disease-gene input mode, CPS searches all pathway and interaction data in BioCarta <abbrgrp><abbr bid="B16">16</abbr></abbrgrp>, KEGG <abbrgrp><abbr bid="B17">17</abbr></abbrgrp> and I2D (formerly OPHID) <abbrgrp><abbr bid="B10">10</abbr></abbrgrp> to extract all genes associated with the disease gene, and then filters this list against implicated loci. Genes are ranked based on the total number of genes implicated in the pathway. For example, if two known disease gene seeds and three genes in the loci being investigated are found in the same pathway, the pathway is given a rank of five against the phenotype. CMP parses the protein sequences of the known disease genes associated with the phenotype into domains using the Pfam library of Hidden Markov Models (HMMs) <abbrgrp><abbr bid="B18">18</abbr></abbrgrp> and then retrieves any other genes with related domain content from the genome. A score between 0 and 1 is generated reflecting the candidate gene's similarity to a known disease gene <abbrgrp><abbr bid="B4">4</abbr></abbrgrp>. The same nine disease genes and 53 chromosomal cytogenetic bands were used by <it>Gentrepid </it>as per Tiffin <it>et al</it>..</p>
            <p>In <it>ab initio </it>mode, <it>Gentrepid </it>can make predictions in the absence of known disease genes if two or more loci are provided as input. <it>Gentrepid</it>'s CPS <it>ab initio </it>method is based on the premise that pathways whose genes are more prevalent within disease-implicated loci (chromosomal regions) compared to the entire genome have a higher probability of involvement in the pathoetiology of the disease phenotype of interest. Analogous to the known disease gene mode, pathways are ranked by the number of loci involved. The CMP <it>ab initio </it>method searches for enrichment of domains in the loci with respect to the genome and ranks genes based on the statistical significance of the domain enrichment (equations 2 and 4 in <abbrgrp><abbr bid="B4">4</abbr></abbrgrp> where <it>mn </it>is replaced by &#931; &#8211; the total number of genes in the intervals examined).</p>
            <p>For each input mode, a final list of predictions is made by consolidating all predictions from both the CMP and CPS modules.</p>
         </sec>
         <sec>
            <st>
               <p>Metrics for comparisons</p>
            </st>
            <p>Systems were compared using three metrics: Enrichment Ratio, Sensitivity and Specificity. The Enrichment Ratio calculations were calculated as below <abbrgrp><abbr bid="B4">4</abbr></abbrgrp>:</p>
            <p>
               <display-formula id="M1">
                  <m:math xmlns:m="http://www.w3.org/1998/Math/MathML" name="1471-2105-10-S1-S69-i1">
                     <m:semantics>
                        <m:mrow>
                           <m:mi>E</m:mi>
                           <m:mi>n</m:mi>
                           <m:mi>r</m:mi>
                           <m:mi>i</m:mi>
                           <m:mi>c</m:mi>
                           <m:mi>h</m:mi>
                           <m:mi>m</m:mi>
                           <m:mi>e</m:mi>
                           <m:mi>n</m:mi>
                           <m:mi>t</m:mi>
                           <m:mi>R</m:mi>
                           <m:mi>a</m:mi>
                           <m:mi>t</m:mi>
                           <m:mi>i</m:mi>
                           <m:mi>o</m:mi>
                           <m:mo>=</m:mo>
                           <m:mfrac>
                              <m:mrow>
                                 <m:mi>T</m:mi>
                                 <m:mi>P</m:mi>
                                 <m:mo>/</m:mo>
                                 <m:mo stretchy="false">(</m:mo>
                                 <m:mi>T</m:mi>
                                 <m:mi>P</m:mi>
                                 <m:mo>+</m:mo>
                                 <m:mi>F</m:mi>
                                 <m:mi>P</m:mi>
                                 <m:mo stretchy="false">)</m:mo>
                              </m:mrow>
                              <m:mrow>
                                 <m:mstyle displaystyle="true">
                                    <m:mo>&#8721;</m:mo>
                                    <m:mrow>
                                       <m:mi>g</m:mi>
                                       <m:mi>e</m:mi>
                                       <m:mi>n</m:mi>
                                       <m:mi>e</m:mi>
                                       <m:msub>
                                          <m:mi>s</m:mi>
                                          <m:mrow>
                                             <m:mi>i</m:mi>
                                             <m:mi>m</m:mi>
                                             <m:mi>p</m:mi>
                                             <m:mi>l</m:mi>
                                             <m:mi>i</m:mi>
                                             <m:mi>c</m:mi>
                                             <m:mi>a</m:mi>
                                             <m:mi>t</m:mi>
                                             <m:mi>e</m:mi>
                                             <m:mi>d</m:mi>
                                          </m:mrow>
                                       </m:msub>
                                    </m:mrow>
                                 </m:mstyle>
                                 <m:mo>/</m:mo>
                                 <m:mstyle displaystyle="true">
                                    <m:mo>&#8721;</m:mo>
                                    <m:mrow>
                                       <m:mi>g</m:mi>
                                       <m:mi>e</m:mi>
                                       <m:mi>n</m:mi>
                                       <m:mi>e</m:mi>
                                       <m:msub>
                                          <m:mi>s</m:mi>
                                          <m:mrow>
                                             <m:mi>a</m:mi>
                                             <m:mi>l</m:mi>
                                             <m:mi>l</m:mi>
                                          </m:mrow>
                                       </m:msub>
                                    </m:mrow>
                                 </m:mstyle>
                              </m:mrow>
                           </m:mfrac>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaemyrauKaemOBa4MaemOCaiNaemyAaKMaem4yamMaemiAaGMaemyBa0MaemyzauMaemOBa4MaemiDaqNaemOuaiLaemyyaeMaemiDaqNaemyAaKMaem4Ba8Maeyypa0tcfa4aaSaaaeaacqWGubavcqWGqbaucqGGVaWlcqGGOaakcqWGubavcqWGqbaucqGHRaWkcqWGgbGrcqWGqbaucqGGPaqkaeaadaaeabqaaiabdEgaNjabdwgaLjabd6gaUjabdwgaLjabdohaZnaaBaaabaGaemyAaKMaemyBa0MaemiCaaNaemiBaWMaemyAaKMaem4yamMaemyyaeMaemiDaqNaemyzauMaemizaqgabeaaaeqabeGaeyyeIuoacqGGVaWldaaeabqaaiabdEgaNjabdwgaLjabd6gaUjabdwgaLjabdohaZnaaBaaabaGaemyyaeMaemiBaWMaemiBaWgabeaaaeqabeGaeyyeIuoaaaaaaa@70B3@</m:annotation>
                     </m:semantics>
                  </m:math>
               </display-formula>
            </p>
            <p>The denominator was obtained by dividing the number of T2D implicated genes by the total number of genes within all surveyed chromosomal regions.</p>
            <p>Sensitivity and Specificity were calculated as below:</p>
            <p>
               <display-formula id="M2">
                  <m:math xmlns:m="http://www.w3.org/1998/Math/MathML" name="1471-2105-10-S1-S69-i2">
                     <m:semantics>
                        <m:mrow>
                           <m:mi>S</m:mi>
                           <m:mi>e</m:mi>
                           <m:mi>n</m:mi>
                           <m:mi>s</m:mi>
                           <m:mi>i</m:mi>
                           <m:mi>t</m:mi>
                           <m:mi>i</m:mi>
                           <m:mi>v</m:mi>
                           <m:mi>i</m:mi>
                           <m:mi>t</m:mi>
                           <m:mi>y</m:mi>
                           <m:mo>=</m:mo>
                           <m:mfrac>
                              <m:mrow>
                                 <m:mi>T</m:mi>
                                 <m:mi>P</m:mi>
                              </m:mrow>
                              <m:mrow>
                                 <m:mo stretchy="false">(</m:mo>
                                 <m:mi>T</m:mi>
                                 <m:mi>P</m:mi>
                                 <m:mo>+</m:mo>
                                 <m:mi>F</m:mi>
                                 <m:mi>N</m:mi>
                                 <m:mo stretchy="false">)</m:mo>
                              </m:mrow>
                           </m:mfrac>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaem4uamLaemyzauMaemOBa4Maem4CamNaemyAaKMaemiDaqNaemyAaKMaemODayNaemyAaKMaemiDaqNaemyEaKNaeyypa0tcfa4aaSaaaeaacqWGubavcqWGqbauaeaacqGGOaakcqWGubavcqWGqbaucqGHRaWkcqWGgbGrcqWGobGtcqGGPaqkaaaaaa@4682@</m:annotation>
                     </m:semantics>
                  </m:math>
               </display-formula>
            </p>
            <p>
               <display-formula id="M3">
                  <m:math xmlns:m="http://www.w3.org/1998/Math/MathML" name="1471-2105-10-S1-S69-i3">
                     <m:semantics>
                        <m:mrow>
                           <m:mi>S</m:mi>
                           <m:mi>p</m:mi>
                           <m:mi>e</m:mi>
                           <m:mi>c</m:mi>
                           <m:mi>i</m:mi>
                           <m:mi>f</m:mi>
                           <m:mi>i</m:mi>
                           <m:mi>c</m:mi>
                           <m:mi>i</m:mi>
                           <m:mi>t</m:mi>
                           <m:mi>y</m:mi>
                           <m:mo>=</m:mo>
                           <m:mfrac>
                              <m:mrow>
                                 <m:mi>T</m:mi>
                                 <m:mi>N</m:mi>
                              </m:mrow>
                              <m:mrow>
                                 <m:mo stretchy="false">(</m:mo>
                                 <m:mi>T</m:mi>
                                 <m:mi>N</m:mi>
                                 <m:mo>+</m:mo>
                                 <m:mi>F</m:mi>
                                 <m:mi>P</m:mi>
                                 <m:mo stretchy="false">)</m:mo>
                              </m:mrow>
                           </m:mfrac>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaem4uamLaemiCaaNaemyzauMaem4yamMaemyAaKMaemOzayMaemyAaKMaem4yamMaemyAaKMaemiDaqNaemyEaKNaeyypa0tcfa4aaSaaaeaacqWGubavcqWGobGtaeaacqGGOaakcqWGubavcqWGobGtcqGHRaWkcqWGgbGrcqWGqbaucqGGPaqkaaaaaa@4620@</m:annotation>
                     </m:semantics>
                  </m:math>
               </display-formula>
            </p>
            <p>TP is the number of true positives, FP is the number of false positives, TN is the number of true negatives and FN is the number of false negatives. Sensitivity is the proportion of true positives among all disease genes in the chromosomal regions. Specificity is the proportion of true negatives among genes not associated with the disease in chromosomal regions. Confidence intervals were estimated using the method of Newcombe <abbrgrp><abbr bid="B19">19</abbr></abbrgrp> implemented using the CIcalculator software <abbrgrp><abbr bid="B20">20</abbr></abbrgrp>.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>List of abbreviations used</p>
         </st>
         <p>T2D: Type II diabetes; WTCCC: Welcome Trust Case Control Consortium; DIAGRAM: Genetics Replication and Meta-analysis Consortium; SNP: Single nucleotide polymorphism; GWA: Genome wide association studies; HPRD: Human Protein Reference Database; BIND: Biomolecular Interaction Network Database; DGP: Disease Gene Prediction; PROSPECTR: PRiOrization by Sequence and Phylogenetic Extent of CandidaTe Regions; OMIM: Online Mendelian Inheritance in Man; OPHID: Online Predicted Human Interaction Database; KEGG: Kyoto Encyclopedia of Genes and Genomes; GO: Gene Ontology; MeSH: Medical Subject Headings; HS: Highly Significant gene set; MHWD: Moderate to Highly gene set derived from WTCCC and DIAGRAM studies; ER: Enrichment Ratio; S: Sensitivity; TP: true positives; FP: false positives; TN: true negatives; FN: false negatives.</p>
      </sec>
      <sec>
         <st>
            <p>Competing interests</p>
         </st>
         <p>The authors declare that they have no competing interests.</p>
      </sec>
      <sec>
         <st>
            <p>Authors' contributions</p>
         </st>
         <p>MW design, concept and oversight of study, and manuscript author. ET manuscript author, and data generation for study. JL implementation, construction and maintenance of database. SB additional data generation, figures and manuscript preparation. DF Genetics consultant. All authors read and approved the final manuscript.</p>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>The authors wish to acknowledge funding from the Ronald Geoffrey Arnott Foundation.</p>
            <p>This article has been published as part of <it>BMC Bioinformatics </it>Volume 10 Supplement 1, 2009: Proceedings of The Seventh Asia Pacific Bioinformatics Conference (APBC) 2009. The full contents of the supplement are available online at <url>http://www.biomedcentral.com/1471-2105/10?issue=S1</url></p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>A Polymorphic DNA Marker Genetically Linked to Huntingtons-Disease</p>
            </title>
            <aug>
               <au>
                  <snm>Gusella</snm>
                  <fnm>JF</fnm>
               </au>
               <au>
                  <snm>Wexler</snm>
                  <fnm>NS</fnm>
               </au>
               <au>
                  <snm>Conneally</snm>
                  <fnm>PM</fnm>
               </au>
               <au>
                  <snm>Naylor</snm>
                  <fnm>SL</fnm>
               </au>
               <au>
                  <snm>Anderson</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Tanzi</snm>
                  <fnm>RE</fnm>
               </au>
               <au>
                  <snm>Watkins</snm>
                  <fnm>PC</fnm>
               </au>
               <au>
                  <snm>Ottina</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Wallace</snm>
                  <fnm>MR</fnm>
               </au>
               <au>
                  <snm>Sakaguchi</snm>
                  <fnm>AY</fnm>
               </au>
               <au>
                  <snm>Young</snm>
                  <fnm>AB</fnm>
               </au>
               <au>
                  <snm>Shoulson</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Bonilla</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Martin</snm>
                  <fnm>JB</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>1983</pubdate>
            <volume>306</volume>
            <issue>5940</issue>
            <fpage>234</fpage>
            <lpage>238</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/306234a0</pubid>
                  <pubid idtype="pmpid">6316146</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <title>
               <p>Online Mendelian Inheritance in Man (OMIM), a knowledgebase of human genes and genetic disorders</p>
            </title>
            <aug>
               <au>
                  <snm>Hamosh</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Scott</snm>
                  <fnm>AF</fnm>
               </au>
               <au>
                  <snm>Amberger</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Bocchini</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Valle</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>McKusick</snm>
                  <fnm>VA</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Research</source>
            <pubdate>2002</pubdate>
            <volume>30</volume>
            <fpage>52</fpage>
            <lpage>55</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">99152</pubid>
                  <pubid idtype="pmpid" link="fulltext">11752252</pubid>
                  <pubid idtype="doi">10.1093/nar/30.1.52</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>POCUS: mining genomic sequence annotation to predict disease genes</p>
            </title>
            <aug>
               <au>
                  <snm>Turner</snm>
                  <fnm>FS</fnm>
               </au>
               <au>
                  <snm>Clutterbuck</snm>
                  <fnm>DR</fnm>
               </au>
               <au>
                  <snm>Semple</snm>
                  <fnm>CAM</fnm>
               </au>
            </aug>
            <source>Genome Biology</source>
            <pubdate>2003</pubdate>
            <volume>4</volume>
            <issue>11</issue>
            <fpage>R75</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">329128</pubid>
                  <pubid idtype="pmpid" link="fulltext">14611661</pubid>
                  <pubid idtype="doi">10.1186/gb-2003-4-11-r75</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>Analysis of protein sequence and interaction data for candidate disease gene prediction</p>
            </title>
            <aug>
               <au>
                  <snm>George</snm>
                  <fnm>RA</fnm>
               </au>
               <au>
                  <snm>Liu</snm>
                  <fnm>JY</fnm>
               </au>
               <au>
                  <snm>Feng</snm>
                  <fnm>LL</fnm>
               </au>
               <au>
                  <snm>Bryson-Richardson</snm>
                  <fnm>RJ</fnm>
               </au>
               <au>
                  <snm>Fatkin</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Wouters</snm>
                  <fnm>MA</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Research</source>
            <pubdate>2006</pubdate>
            <volume>34</volume>
            <issue>19</issue>
            <fpage>e130</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1636487</pubid>
                  <pubid idtype="pmpid" link="fulltext">17020920</pubid>
                  <pubid idtype="doi">10.1093/nar/gkl707</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>Reconstruction of a functional human gene network, with an application for prioritizing positional candidate genes</p>
            </title>
            <aug>
               <au>
                  <snm>Franke</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>van Bakel</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Fokkens</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>de Jong</snm>
                  <fnm>ED</fnm>
               </au>
               <au>
                  <snm>Egmont-Petersen</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Wijmenga</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>American Journal of Human Genetics</source>
            <pubdate>2006</pubdate>
            <volume>78</volume>
            <issue>6</issue>
            <fpage>1011</fpage>
            <lpage>1025</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1474084</pubid>
                  <pubid idtype="pmpid" link="fulltext">16685651</pubid>
                  <pubid idtype="doi">10.1086/504300</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>Genetics of complex diseases</p>
            </title>
            <aug>
               <au>
                  <snm>Motulsky</snm>
                  <fnm>AG</fnm>
               </au>
            </aug>
            <source>J Zhejiang Univ Sci B</source>
            <pubdate>2006</pubdate>
            <volume>7</volume>
            <issue>2</issue>
            <fpage>167</fpage>
            <lpage>8</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1363767</pubid>
                  <pubid idtype="pmpid" link="fulltext">16421979</pubid>
                  <pubid idtype="doi">10.1631/jzus.2006.B0167</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>Computational disease gene identification: a concert of methods prioritizes type 2 diabetes and obesity candidate genes</p>
            </title>
            <aug>
               <au>
                  <snm>Tiffin</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Adie</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Turner</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Brunner</snm>
                  <fnm>HG</fnm>
               </au>
               <au>
                  <snm>van Driel</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Oti</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Lopez-Bigas</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Ouzounis</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Perez-Iratxeta</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Andrade-Navarro</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Adeyemo</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Patti</snm>
                  <fnm>ME</fnm>
               </au>
               <au>
                  <snm>Semple</snm>
                  <fnm>CAM</fnm>
               </au>
               <au>
                  <snm>Hide</snm>
                  <fnm>W</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Research</source>
            <pubdate>2006</pubdate>
            <volume>34</volume>
            <issue>10</issue>
            <fpage>3067</fpage>
            <lpage>3081</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1475747</pubid>
                  <pubid idtype="pmpid" link="fulltext">16757574</pubid>
                  <pubid idtype="doi">10.1093/nar/gkl381</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls</p>
            </title>
            <aug>
               <au>
                  <snm/>
                  <fnm>Wellcome Trust Case Control Consortium</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2007</pubdate>
            <volume>447</volume>
            <issue>7145</issue>
            <fpage>661</fpage>
            <lpage>678</lpage>
            <url>http://dx.doi.org/10.1038/nature05911</url>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature05911</pubid>
                  <pubid idtype="pmpid" link="fulltext">17554300</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p>Meta-analysis of genome-wide association data and large-scale replication identifies additional susceptibility loci for type 2 diabetes</p>
            </title>
            <aug>
               <au>
                  <snm>Zeggini</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Scott</snm>
                  <fnm>LJ</fnm>
               </au>
               <au>
                  <snm>Saxena</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Voight</snm>
                  <fnm>BF</fnm>
               </au>
               <au>
                  <snm>Marchini</snm>
                  <fnm>JL</fnm>
               </au>
               <au>
                  <snm>Hu</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>de Bakker</snm>
                  <fnm>PIW</fnm>
               </au>
               <au>
                  <snm>Abecasis</snm>
                  <fnm>GR</fnm>
               </au>
               <au>
                  <snm>Almgren</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Andersen</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Ardlie</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Bostrom</snm>
                  <fnm>KB</fnm>
               </au>
               <au>
                  <snm>Bergman</snm>
                  <fnm>RN</fnm>
               </au>
               <au>
                  <snm>Bonnycastle</snm>
                  <fnm>LL</fnm>
               </au>
               <au>
                  <snm>Borch-Johnsen</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Burtt</snm>
                  <fnm>NP</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Chines</snm>
                  <fnm>PS</fnm>
               </au>
               <au>
                  <snm>Daly</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Deodhar</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Ding</snm>
                  <fnm>CJ</fnm>
               </au>
               <au>
                  <snm>Doney</snm>
                  <fnm>ASF</fnm>
               </au>
               <au>
                  <snm>Duren</snm>
                  <fnm>WL</fnm>
               </au>
               <au>
                  <snm>Elliott</snm>
                  <fnm>KS</fnm>
               </au>
               <au>
                  <snm>Erdos</snm>
                  <fnm>MR</fnm>
               </au>
               <au>
                  <snm>Frayling</snm>
                  <fnm>TM</fnm>
               </au>
               <au>
                  <snm>Freathy</snm>
                  <fnm>RM</fnm>
               </au>
               <au>
                  <snm>Gianniny</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Grallert</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Grarup</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Groves</snm>
                  <fnm>CJ</fnm>
               </au>
               <au>
                  <snm>Guiducci</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Hansen</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Herder</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Hitman</snm>
                  <fnm>GA</fnm>
               </au>
               <au>
                  <snm>Hughes</snm>
                  <fnm>TE</fnm>
               </au>
               <au>
                  <snm>Isomaa</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Jackson</snm>
                  <fnm>AU</fnm>
               </au>
               <au>
                  <snm>Jorgensen</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Kong</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Kubalanza</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Kuruvilla</snm>
                  <fnm>FG</fnm>
               </au>
               <au>
                  <snm>Kuusisto</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Langenberg</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Lango</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Lauritzen</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Lindgren</snm>
                  <fnm>CM</fnm>
               </au>
               <au>
                  <snm>Lyssenko</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Marvelle</snm>
                  <fnm>AF</fnm>
               </au>
               <au>
                  <snm>Meisinger</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Midthjell</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Mohlke</snm>
                  <fnm>KL</fnm>
               </au>
               <au>
                  <snm>Morken</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Morris</snm>
                  <fnm>AD</fnm>
               </au>
               <au>
                  <snm>Narisu</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Nilsson</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Owen</snm>
                  <fnm>KR</fnm>
               </au>
               <au>
                  <snm>Palmer</snm>
                  <fnm>CNA</fnm>
               </au>
               <au>
                  <snm>Payne</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Perry</snm>
                  <fnm>JRB</fnm>
               </au>
               <au>
                  <snm>Pettersen</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Platou</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Prokopenko</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Qi</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Qin</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Rayner</snm>
                  <fnm>NW</fnm>
               </au>
               <au>
                  <snm>Rees</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Roix</snm>
                  <fnm>JJ</fnm>
               </au>
               <au>
                  <snm>Sandbaek</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Shields</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Sjogren</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Steinthorsdottir</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Stringham</snm>
                  <fnm>HM</fnm>
               </au>
               <au>
                  <snm>Swift</snm>
                  <fnm>AJ</fnm>
               </au>
               <au>
                  <snm>Thorleifsson</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Thorsteinsdottir</snm>
                  <fnm>U</fnm>
               </au>
               <au>
                  <snm>Timpson</snm>
                  <fnm>NJ</fnm>
               </au>
               <au>
                  <snm>Tuomi</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Tuomilehto</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Walker</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Watanabe</snm>
                  <fnm>RM</fnm>
               </au>
               <au>
                  <snm>Weedon</snm>
                  <fnm>MN</fnm>
               </au>
               <au>
                  <snm>Willer</snm>
                  <fnm>CJ</fnm>
               </au>
               <au>
                  <snm>Illig</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Hveem</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Hu</snm>
                  <fnm>FB</fnm>
               </au>
               <au>
                  <snm>Laakso</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Stefansson</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Pedersen</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Wareham</snm>
                  <fnm>NJ</fnm>
               </au>
               <au>
                  <snm>Barroso</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Hattersley</snm>
                  <fnm>AT</fnm>
               </au>
               <au>
                  <snm>Collins</snm>
                  <fnm>FS</fnm>
               </au>
               <au>
                  <snm>Groop</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>McCarthy</snm>
                  <fnm>MI</fnm>
               </au>
               <au>
                  <snm>Boehnke</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Altshuler</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Nat Genet</source>
            <pubdate>2008</pubdate>
            <volume>40</volume>
            <issue>5</issue>
            <fpage>638</fpage>
            <lpage>645</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/ng.120</pubid>
                  <pubid idtype="pmpid" link="fulltext">18372903</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B10">
            <title>
               <p>Online predicted human interaction database</p>
            </title>
            <aug>
               <au>
                  <snm>Brown</snm>
                  <fnm>KR</fnm>
               </au>
               <au>
                  <snm>Jurisica</snm>
                  <fnm>I</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2005</pubdate>
            <volume>21</volume>
            <issue>9</issue>
            <fpage>2076</fpage>
            <lpage>2082</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/bti273</pubid>
                  <pubid idtype="pmpid" link="fulltext">15657099</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>Consolidating the set of known human protein-protein interactions in preparation for large-scale mapping of the human interactome</p>
            </title>
            <aug>
               <au>
                  <snm>Ramani</snm>
                  <fnm>AK</fnm>
               </au>
               <au>
                  <snm>Bunescu</snm>
                  <fnm>RC</fnm>
               </au>
               <au>
                  <snm>Mooney</snm>
                  <fnm>RJ</fnm>
               </au>
               <au>
                  <snm>Marcotte</snm>
                  <fnm>EM</fnm>
               </au>
            </aug>
            <source>Genome Biology</source>
            <pubdate>2005</pubdate>
            <volume>6</volume>
            <issue>5</issue>
            <fpage>R40</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1175952</pubid>
                  <pubid idtype="pmpid" link="fulltext">15892868</pubid>
                  <pubid idtype="doi">10.1186/gb-2005-6-5-r40</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>Speeding disease gene discovery by sequence based candidate prioritization</p>
            </title>
            <aug>
               <au>
                  <snm>Adie</snm>
                  <fnm>EA</fnm>
               </au>
               <au>
                  <snm>Adams</snm>
                  <fnm>RR</fnm>
               </au>
               <au>
                  <snm>Evans</snm>
                  <fnm>KL</fnm>
               </au>
               <au>
                  <snm>Porteous</snm>
                  <fnm>DJ</fnm>
               </au>
               <au>
                  <snm>Pickard</snm>
                  <fnm>BS</fnm>
               </au>
            </aug>
            <source>BMC Bioinformatics</source>
            <pubdate>2005</pubdate>
            <volume>6</volume>
            <fpage>55</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1274252</pubid>
                  <pubid idtype="pmpid" link="fulltext">15766383</pubid>
                  <pubid idtype="doi">10.1186/1471-2105-6-55</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B13">
            <title>
               <p>GeneSeeker: extraction and integration of human disease-related information from web-based genetic databases</p>
            </title>
            <aug>
               <au>
                  <snm>van Driel</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Cuelenaere</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Kemmeren</snm>
                  <fnm>PPCW</fnm>
               </au>
               <au>
                  <snm>Leunissen</snm>
                  <fnm>JAM</fnm>
               </au>
               <au>
                  <snm>Brunner</snm>
                  <fnm>HG</fnm>
               </au>
               <au>
                  <snm>Vriend</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2005</pubdate>
            <volume>33</volume>
            <issue>Web Server issue</issue>
            <fpage>W758</fpage>
            <lpage>W761</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/nar/gki435</pubid>
                  <pubid idtype="pmpid" link="fulltext">15980578</pubid>
                  <pubid idtype="pmcid">1160196</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B14">
            <title>
               <p>Ensembl 2006</p>
            </title>
            <aug>
               <au>
                  <snm>Birney</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Andrews</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Caccamo</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Clarke</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Coates</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Cox</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Cunningham</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Curwen</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Cutts</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Down</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Durbin</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Fernandez-Suarez</snm>
                  <fnm>XM</fnm>
               </au>
               <au>
                  <snm>Flicek</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Graf</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Hammond</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Herrero</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Howe</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Iyer</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Jekosch</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Kahari</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Kasprzyk</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Keefe</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Kokocinski</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Kulesha</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>London</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Longden</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Melsopp</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Meidl</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Overduin</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Parker</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Proctor</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Prlic</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Rae</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Rios</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Redmond</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Schuster</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Sealy</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Searle</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Severin</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Slater</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Smedley</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Smith</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Stabenau</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Stalker</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Trevanion</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Ureta-Vidal</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Vogel</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>White</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Woodwark</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Hubbard</snm>
                  <fnm>TJP</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Research</source>
            <pubdate>2006</pubdate>
            <volume>34</volume>
            <fpage>D556</fpage>
            <lpage>D561</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1347495</pubid>
                  <pubid idtype="pmpid" link="fulltext">16381931</pubid>
                  <pubid idtype="doi">10.1093/nar/gkj133</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B15">
            <title>
               <p>A generic system for fast and flexible access to biological data</p>
            </title>
            <aug>
               <au>
                  <snm>Kasprzyk</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Keefe</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Smedley</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>London</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Spooner</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Melsopp</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Hammond</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Rocca-Serra</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Cox</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Birney</snm>
                  <fnm>E</fnm>
               </au>
            </aug>
            <source>Genome Research</source>
            <pubdate>2004</pubdate>
            <volume>14</volume>
            <fpage>160</fpage>
            <lpage>169</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">314293</pubid>
                  <pubid idtype="pmpid" link="fulltext">14707178</pubid>
                  <pubid idtype="doi">10.1101/gr.1645104</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B16">
            <title>
               <p>BioCarta</p>
            </title>
            <url>http://www.biocarta.com</url>
         </bibl>
         <bibl id="B17">
            <title>
               <p>The KEGG resource for deciphering the genome</p>
            </title>
            <aug>
               <au>
                  <snm>Kanehisa</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Goto</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Kawashima</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Okuno</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Hattori</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Research</source>
            <pubdate>2004</pubdate>
            <volume>32</volume>
            <fpage>D277</fpage>
            <lpage>D280</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">308797</pubid>
                  <pubid idtype="pmpid" link="fulltext">14681412</pubid>
                  <pubid idtype="doi">10.1093/nar/gkh063</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B18">
            <title>
               <p>The Pfam Protein Families Database</p>
            </title>
            <aug>
               <au>
                  <snm>Bateman</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Birney</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Cerruti</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Durbin</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Etwiller</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Eddy</snm>
                  <fnm>SR</fnm>
               </au>
               <au>
                  <snm>Griffiths-Jones</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Howe</snm>
                  <fnm>KL</fnm>
               </au>
               <au>
                  <snm>Marshall</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Sonnhammer</snm>
                  <fnm>ELL</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Research</source>
            <pubdate>2002</pubdate>
            <volume>30</volume>
            <fpage>276</fpage>
            <lpage>280</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">99071</pubid>
                  <pubid idtype="pmpid" link="fulltext">11752314</pubid>
                  <pubid idtype="doi">10.1093/nar/30.1.276</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B19">
            <title>
               <p>Improved confidence intervals for the difference between binomial proportions based on paired data</p>
            </title>
            <aug>
               <au>
                  <snm>Newcombe</snm>
                  <fnm>RG</fnm>
               </au>
            </aug>
            <source>Statistics in Medicine</source>
            <pubdate>1998</pubdate>
            <volume>17</volume>
            <issue>22</issue>
            <fpage>2635</fpage>
            <lpage>2650</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1002/(SICI)1097-0258(19981130)17:22&lt;2635::AID-SIM954&gt;3.0.CO;2-C</pubid>
                  <pubid idtype="pmpid" link="fulltext">9839354</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B20">
            <title>
               <p>CIcalculator software</p>
            </title>
            <note>http://www.pedro.fhs.usyd.edu.au/calculator.html</note>
         </bibl>
         <bibl id="B21">
            <title>
               <p>SUSPECTS: enabling fast and effective prioritization of positional candidates</p>
            </title>
            <aug>
               <au>
                  <snm>Adie</snm>
                  <fnm>EA</fnm>
               </au>
               <au>
                  <snm>Adams</snm>
                  <fnm>RR</fnm>
               </au>
               <au>
                  <snm>Evans</snm>
                  <fnm>KL</fnm>
               </au>
               <au>
                  <snm>Porteous</snm>
                  <fnm>DJ</fnm>
               </au>
               <au>
                  <snm>Pickard</snm>
                  <fnm>BS</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2006</pubdate>
            <volume>22</volume>
            <issue>6</issue>
            <fpage>773</fpage>
            <lpage>774</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/btk031</pubid>
                  <pubid idtype="pmpid" link="fulltext">16423925</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B22">
            <title>
               <p>New developments in the InterPro database</p>
            </title>
            <aug>
               <au>
                  <snm>Mulder</snm>
                  <fnm>NJ</fnm>
               </au>
               <au>
                  <snm>Apweiler</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Attwood</snm>
                  <fnm>TK</fnm>
               </au>
               <au>
                  <snm>Bairoch</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Bateman</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Binns</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Bork</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Buillard</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Cerutti</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Copley</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Courcelle</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Das</snm>
                  <fnm>U</fnm>
               </au>
               <au>
                  <snm>Daugherty</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Dibley</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Finn</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Fleischmann</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Gough</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Haft</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Hulo</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Hunter</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Kahn</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Kanapin</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Kejariwal</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Labarga</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Langendijk-Genevaux</snm>
                  <fnm>PS</fnm>
               </au>
               <au>
                  <snm>Lonsdale</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Lopez</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Letunic</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Madera</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Maslen</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>McAnulla</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>McDowall</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Mistry</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Mitchell</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Nikolskaya</snm>
                  <fnm>AN</fnm>
               </au>
               <au>
                  <snm>Orchard</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Orengo</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Petryszak</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Selengut</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Sigrist</snm>
                  <fnm>CJA</fnm>
               </au>
               <au>
                  <snm>Thomas</snm>
                  <fnm>PD</fnm>
               </au>
               <au>
                  <snm>Valentin</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Wilson</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Wu</snm>
                  <fnm>CH</fnm>
               </au>
               <au>
                  <snm>Yeats</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Research</source>
            <pubdate>2007</pubdate>
            <volume>35</volume>
            <issue>Database issue</issue>
            <fpage>D224</fpage>
            <lpage>D228</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/nar/gkl841</pubid>
                  <pubid idtype="pmpid" link="fulltext">17202162</pubid>
                  <pubid idtype="pmcid">1899100</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B23">
            <title>
               <p>Gene Ontology: tool for the unification of biology</p>
            </title>
            <aug>
               <au>
                  <snm>Ashburner</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Ball</snm>
                  <fnm>CA</fnm>
               </au>
               <au>
                  <snm>Blake</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Botstein</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Butler</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Cherry</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Davis</snm>
                  <fnm>AP</fnm>
               </au>
               <au>
                  <snm>Dolinski</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Dwight</snm>
                  <fnm>SS</fnm>
               </au>
               <au>
                  <snm>Eppig</snm>
                  <fnm>JT</fnm>
               </au>
               <au>
                  <snm>Harris</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Hill</snm>
                  <fnm>DP</fnm>
               </au>
               <au>
                  <snm>Issel-Tarver</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Kasarskis</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Lewis</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Matese</snm>
                  <fnm>JC</fnm>
               </au>
               <au>
                  <snm>Richardson</snm>
                  <fnm>JE</fnm>
               </au>
               <au>
                  <snm>Ringwald</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Rubin</snm>
                  <fnm>GM</fnm>
               </au>
               <au>
                  <snm>Sherlock</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>Nature Genetics</source>
            <pubdate>2000</pubdate>
            <volume>25</volume>
            <fpage>25</fpage>
            <lpage>29</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/75556</pubid>
                  <pubid idtype="pmpid" link="fulltext">10802651</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B24">
            <title>
               <p>Predicting disease genes using protein-protein interactions</p>
            </title>
            <aug>
               <au>
                  <snm>Oti</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Snel</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Huynen</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Brunner</snm>
                  <fnm>HG</fnm>
               </au>
            </aug>
            <source>Journal of Medical Genetics</source>
            <pubdate>2006</pubdate>
            <volume>43</volume>
            <issue>8</issue>
            <fpage>691</fpage>
            <lpage>8</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1136/jmg.2006.041376</pubid>
                  <pubid idtype="pmpid" link="fulltext">16611749</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B25">
            <title>
               <p>Beyond Mendel: An evolving view of human genetic disease transmission</p>
            </title>
            <aug>
               <au>
                  <snm>Badano</snm>
                  <fnm>JL</fnm>
               </au>
               <au>
                  <snm>Katsanis</snm>
                  <fnm>N</fnm>
               </au>
            </aug>
            <source>Nature Reviews Genetics</source>
            <pubdate>2002</pubdate>
            <volume>3</volume>
            <issue>10</issue>
            <fpage>779</fpage>
            <lpage>789</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nrg910</pubid>
                  <pubid idtype="pmpid" link="fulltext">12360236</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B26">
            <title>
               <p>Human disease genes</p>
            </title>
            <aug>
               <au>
                  <snm>Jimenez-Sanchez</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Childs</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Valle</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2001</pubdate>
            <volume>409</volume>
            <issue>6822</issue>
            <fpage>853</fpage>
            <lpage>855</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/35057050</pubid>
                  <pubid idtype="pmpid" link="fulltext">11237009</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B27">
            <title>
               <p>A global view of pleiotropy and phenotypically derived gene function in yeast</p>
            </title>
            <aug>
               <au>
                  <snm>Dudley</snm>
                  <fnm>AM</fnm>
               </au>
               <au>
                  <snm>Janse</snm>
                  <fnm>DM</fnm>
               </au>
               <au>
                  <snm>Tanay</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Shamir</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Church</snm>
                  <fnm>GM</fnm>
               </au>
            </aug>
            <source>Molecular Systems Biology</source>
            <pubdate>2005</pubdate>
            <fpage>2005.0001</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1681449</pubid>
                  <pubid idtype="pmpid" link="fulltext">16729036</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B28">
            <title>
               <p>High-dimensional and large-scale phenotyping of yeast mutants</p>
            </title>
            <aug>
               <au>
                  <snm>Ohya</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Sese</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Yukawa</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Sano</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Nakatani</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Saito</snm>
                  <fnm>TL</fnm>
               </au>
               <au>
                  <snm>Saka</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Fukuda</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Ishihara</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Oka</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Suzuki</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Watanabe</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Hirata</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Ohtani</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Sawai</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Fraysse</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Latge</snm>
                  <fnm>JP</fnm>
               </au>
               <au>
                  <snm>Francois</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Aebi</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Tanaka</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Muramatsu</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Araki</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Sonoike</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Nogami</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Morishita</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Proceedings of the National Academy of Sciences of the United States of America</source>
            <pubdate>2005</pubdate>
            <volume>102</volume>
            <issue>52</issue>
            <fpage>19015</fpage>
            <lpage>19020</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1316885</pubid>
                  <pubid idtype="pmpid" link="fulltext">16365294</pubid>
                  <pubid idtype="doi">10.1073/pnas.0509436102</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B29">
            <title>
               <p>A similarity-based method for genome-wide prediction of disease-relevant human genes</p>
            </title>
            <aug>
               <au>
                  <snm>Freudenberg</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Propping</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2002</pubdate>
            <volume>18</volume>
            <fpage>S110</fpage>
            <lpage>S115</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmpid" link="fulltext">12385992</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B30">
            <title>
               <p>Integration of text- and data-mining using ontologies successfully selects disease gene candidates</p>
            </title>
            <aug>
               <au>
                  <snm>Tiffin</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Kelso</snm>
                  <fnm>JF</fnm>
               </au>
               <au>
                  <snm>Powell</snm>
                  <fnm>AR</fnm>
               </au>
               <au>
                  <snm>Pan</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Bajic</snm>
                  <fnm>VB</fnm>
               </au>
               <au>
                  <snm>Hide</snm>
                  <fnm>WA</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Research</source>
            <pubdate>2005</pubdate>
            <volume>33</volume>
            <issue>5</issue>
            <fpage>1544</fpage>
            <lpage>1552</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1065256</pubid>
                  <pubid idtype="pmpid" link="fulltext">15767279</pubid>
                  <pubid idtype="doi">10.1093/nar/gki296</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B31">
            <title>
               <p>Genome-wide identification of genes likely to be involved in human genetic disease</p>
            </title>
            <aug>
               <au>
                  <snm>Lopez-Bigas</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Ouzounis</snm>
                  <fnm>CA</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Research</source>
            <pubdate>2004</pubdate>
            <volume>32</volume>
            <issue>10</issue>
            <fpage>3108</fpage>
            <lpage>3114</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">434425</pubid>
                  <pubid idtype="pmpid" link="fulltext">15181176</pubid>
                  <pubid idtype="doi">10.1093/nar/gkh605</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B32">
            <title>
               <p>Association of genes to genetically inherited diseases using data mining</p>
            </title>
            <aug>
               <au>
                  <snm>Perez-Iratxeta</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Bork</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Andrade</snm>
                  <fnm>MA</fnm>
               </au>
            </aug>
            <source>Nature Genetics</source>
            <pubdate>2002</pubdate>
            <volume>31</volume>
            <issue>3</issue>
            <fpage>316</fpage>
            <lpage>319</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmpid" link="fulltext">12006977</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B33">
            <title>
               <p>GeneSeeker web tool</p>
            </title>
            <url>http://www.cmbi.ru.nl/geneseeker</url>
         </bibl>
         <bibl id="B34">
            <title>
               <p>The Biomolecular Interaction Network Database and related tools 2005 update</p>
            </title>
            <aug>
               <au>
                  <snm>Alfarano</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Andrade</snm>
                  <fnm>CE</fnm>
               </au>
               <au>
                  <snm>Anthony</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Bahroos</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Bajec</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Bantoft</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Betel</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Bobechko</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Boutilier</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Burgess</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Buzadzija</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Cavero</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>D'Abreo</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Donaldson</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Dorairajoo</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Dumontier</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Dumontier</snm>
                  <fnm>MR</fnm>
               </au>
               <au>
                  <snm>Earles</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Farrall</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Feldman</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Garderman</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Gong</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Gonzaga</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Grytsan</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Gryz</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Gu</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Haldorsen</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Halupa</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Haw</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Hrvojic</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Hurrell</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Isserlin</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Jack</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Juma</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Khan</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Kon</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Konopinsky</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Le</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Lee</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Ling</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Magidin</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Moniakis</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Montojo</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Moore</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Muskat</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Ng</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Paraiso</snm>
                  <fnm>JP</fnm>
               </au>
               <au>
                  <snm>Parker</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Pintilie</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Pirone</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Salama</snm>
                  <fnm>JJ</fnm>
               </au>
               <au>
                  <snm>Sgro</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Shan</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Shu</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Siew</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Skinner</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Snyder</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Stasiuk</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Strumpf</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Tuekam</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Tao</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>White</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Willis</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Wolting</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Wong</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Wrong</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Xin</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Yao</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Yates</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Zheng</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Pawson</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Ouellette</snm>
                  <fnm>BFF</fnm>
               </au>
               <au>
                  <snm>Hogue</snm>
                  <fnm>CWV</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Research</source>
            <pubdate>2005</pubdate>
            <volume>33</volume>
            <fpage>D418</fpage>
            <lpage>D424</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">540005</pubid>
                  <pubid idtype="pmpid" link="fulltext">15608229</pubid>
                  <pubid idtype="doi">10.1093/nar/gki051</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B35">
            <title>
               <p>Reactome: a knowledgebase of biological pathways</p>
            </title>
            <aug>
               <au>
                  <snm>Joshi-Tope</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Gillespie</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Vastrik</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>D'Eustachio</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Schmidt</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>de Bono</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Jassal</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Gopinath</snm>
                  <fnm>GR</fnm>
               </au>
               <au>
                  <snm>Wu</snm>
                  <fnm>GR</fnm>
               </au>
               <au>
                  <snm>Matthews</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Lewis</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Birney</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Stein</snm>
                  <fnm>L</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Research</source>
            <pubdate>2005</pubdate>
            <volume>33</volume>
            <fpage>D428</fpage>
            <lpage>D432</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">540026</pubid>
                  <pubid idtype="pmpid" link="fulltext">15608231</pubid>
                  <pubid idtype="doi">10.1093/nar/gki072</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B36">
            <title>
               <p>Development of human protein reference database as an initial platform for approaching systems biology in humans</p>
            </title>
            <aug>
               <au>
                  <snm>Peri</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Navarro</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Amanchy</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Kristiansen</snm>
                  <fnm>TZ</fnm>
               </au>
               <au>
                  <snm>Jonnalagadda</snm>
                  <fnm>CK</fnm>
               </au>
               <au>
                  <snm>Surendranath</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Niranjan</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Muthusamy</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Gandhi</snm>
                  <fnm>TKB</fnm>
               </au>
               <au>
                  <snm>Gronborg</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Ibarrola</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Deshpande</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Shanker</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Shivashankar</snm>
                  <fnm>HN</fnm>
               </au>
               <au>
                  <snm>Rashmi</snm>
                  <fnm>BP</fnm>
               </au>
               <au>
                  <snm>Ramya</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Zhao</snm>
                  <fnm>ZX</fnm>
               </au>
               <au>
                  <snm>Chandrika</snm>
                  <fnm>KN</fnm>
               </au>
               <au>
                  <snm>Padma</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Harsha</snm>
                  <fnm>HC</fnm>
               </au>
               <au>
                  <snm>Yatish</snm>
                  <fnm>AJ</fnm>
               </au>
               <au>
                  <snm>Kavitha</snm>
                  <fnm>MP</fnm>
               </au>
               <au>
                  <snm>Menezes</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Choudhury</snm>
                  <fnm>DR</fnm>
               </au>
               <au>
                  <snm>Suresh</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Ghosh</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Saravana</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Chandran</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Krishna</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Joy</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Anand</snm>
                  <fnm>SK</fnm>
               </au>
               <au>
                  <snm>Madavan</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Joseph</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Wong</snm>
                  <fnm>GW</fnm>
               </au>
               <au>
                  <snm>Schiemann</snm>
                  <fnm>WP</fnm>
               </au>
               <au>
                  <snm>Constantinescu</snm>
                  <fnm>SN</fnm>
               </au>
               <au>
                  <snm>Huang</snm>
                  <fnm>LL</fnm>
               </au>
               <au>
                  <snm>Khosravi-Far</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Steen</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Tewari</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Ghaffari</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Blobe</snm>
                  <fnm>GC</fnm>
               </au>
               <au>
                  <snm>Dang</snm>
                  <fnm>CV</fnm>
               </au>
               <au>
                  <snm>Garcia</snm>
                  <fnm>JGN</fnm>
               </au>
               <au>
                  <snm>Pevsner</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Jensen</snm>
                  <fnm>ON</fnm>
               </au>
               <au>
                  <snm>Roepstorff</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Deshpande</snm>
                  <fnm>KS</fnm>
               </au>
               <au>
                  <snm>Chinnaiyan</snm>
                  <fnm>AM</fnm>
               </au>
               <au>
                  <snm>Hamosh</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Chakravarti</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Pandey</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Genome Research</source>
            <pubdate>2003</pubdate>
            <volume>13</volume>
            <issue>10</issue>
            <fpage>2363</fpage>
            <lpage>2371</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">403728</pubid>
                  <pubid idtype="pmpid" link="fulltext">14525934</pubid>
                  <pubid idtype="doi">10.1101/gr.1680803</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B37">
            <title>
               <p>A comprehensive two-hybrid analysis to explore the yeast protein interactome</p>
            </title>
            <aug>
               <au>
                  <snm>Ito</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Chiba</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Ozawa</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Yoshida</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Hattori</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Sakaki</snm>
                  <fnm>Y</fnm>
               </au>
            </aug>
            <source>Proceedings of the National Academy of Sciences of the United States of America</source>
            <pubdate>2001</pubdate>
            <volume>98</volume>
            <issue>8</issue>
            <fpage>4569</fpage>
            <lpage>4574</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">31875</pubid>
                  <pubid idtype="pmpid" link="fulltext">11283351</pubid>
                  <pubid idtype="doi">10.1073/pnas.061034498</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B38">
            <title>
               <p>A comprehensive analysis of protein-protein interactions in Saccharomyces cerevisiae</p>
            </title>
            <aug>
               <au>
                  <snm>Uetz</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Giot</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Cagney</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Mansfield</snm>
                  <fnm>TA</fnm>
               </au>
               <au>
                  <snm>Judson</snm>
                  <fnm>RS</fnm>
               </au>
               <au>
                  <snm>Knight</snm>
                  <fnm>JR</fnm>
               </au>
               <au>
                  <snm>Lockshon</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Narayan</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Srinivasan</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Pochart</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Qureshi-Emili</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Godwin</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Conover</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Kalbfleisch</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Vijayadamodar</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Yang</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Johnston</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Fields</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Rothberg</snm>
                  <fnm>JM</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2000</pubdate>
            <volume>403</volume>
            <issue>6770</issue>
            <fpage>623</fpage>
            <lpage>627</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/35001009</pubid>
                  <pubid idtype="pmpid" link="fulltext">10688190</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B39">
            <title>
               <p>Functional organization of the yeast proteome by systematic analysis of protein complexes</p>
            </title>
            <aug>
               <au>
                  <snm>Gavin</snm>
                  <fnm>AC</fnm>
               </au>
               <au>
                  <snm>Bosche</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Krause</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Grandi</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Marzioch</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Bauer</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Schultz</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Rick</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Michon</snm>
                  <fnm>AM</fnm>
               </au>
               <au>
                  <snm>Cruciat</snm>
                  <fnm>CM</fnm>
               </au>
               <au>
                  <snm>Remor</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Hofert</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Schelder</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Brajenovic</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Ruffner</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Merino</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Klein</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Hudak</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Dickson</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Rudi</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Gnau</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Bauch</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Bastuck</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Huhse</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Leutwein</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Heurtier</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Copley</snm>
                  <fnm>RR</fnm>
               </au>
               <au>
                  <snm>Edelmann</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Querfurth</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Rybin</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Drewes</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Raida</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Bouwmeester</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Bork</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Seraphin</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Kuster</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Neubauer</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Superti-Furga</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2002</pubdate>
            <volume>415</volume>
            <issue>6868</issue>
            <fpage>141</fpage>
            <lpage>147</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/415141a</pubid>
                  <pubid idtype="pmpid" link="fulltext">11805826</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B40">
            <title>
               <p>Systematic identification of protein complexes in <it>Saccharomyces cerevisiae </it>by mass spectrometry</p>
            </title>
            <aug>
               <au>
                  <snm>Ho</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Gruhler</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Heilbut</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Bader</snm>
                  <fnm>GD</fnm>
               </au>
               <au>
                  <snm>Moore</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Adams</snm>
                  <fnm>SL</fnm>
               </au>
               <au>
                  <snm>Millar</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Taylor</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Bennett</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Boutilier</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Yang</snm>
                  <fnm>LY</fnm>
               </au>
               <au>
                  <snm>Wolting</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Donaldson</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Schandorff</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Shewnarane</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Vo</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Taggart</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Goudreault</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Muskat</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Alfarano</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Dewar</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Lin</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Michalickova</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Willems</snm>
                  <fnm>AR</fnm>
               </au>
               <au>
                  <snm>Sassi</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Nielsen</snm>
                  <fnm>PA</fnm>
               </au>
               <au>
                  <snm>Rasmussen</snm>
                  <fnm>KJ</fnm>
               </au>
               <au>
                  <snm>Andersen</snm>
                  <fnm>JR</fnm>
               </au>
               <au>
                  <snm>Johansen</snm>
                  <fnm>LE</fnm>
               </au>
               <au>
                  <snm>Hansen</snm>
                  <fnm>LH</fnm>
               </au>
               <au>
                  <snm>Jespersen</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Podtelejnikov</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Nielsen</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Crawford</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Poulsen</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Sorensen</snm>
                  <fnm>BD</fnm>
               </au>
               <au>
                  <snm>Matthiesen</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Hendrickson</snm>
                  <fnm>RC</fnm>
               </au>
               <au>
                  <snm>Gleeson</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Pawson</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Moran</snm>
                  <fnm>MF</fnm>
               </au>
               <au>
                  <snm>Durocher</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Mann</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Hogue</snm>
                  <fnm>CWV</fnm>
               </au>
               <au>
                  <snm>Figeys</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Tyers</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2002</pubdate>
            <volume>415</volume>
            <issue>6868</issue>
            <fpage>180</fpage>
            <lpage>183</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/415180a</pubid>
                  <pubid idtype="pmpid" link="fulltext">11805837</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B41">
            <title>
               <p>DGP web tool</p>
            </title>
            <url>http://cgg.ebi.ac.uk/services/dgp</url>
         </bibl>
      </refgrp>
   </bm>
</art>