<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>gb-2008-9-s1-s1</ui>
   <ji>GBJ</ji>
   <fm>
      <dochead>Introduction</dochead>
      <bibl>
         <title>
            <p>A race through the maze of genomic evidence</p>
         </title>
         <aug>
            <au id="A1" ca="yes">
               <snm>Hughes</snm>
               <mi>R</mi>
               <fnm>Timothy</fnm>
               <insr iid="I1"/>
               <email>t.hughes@utoronto.ca</email>
            </au>
            <au id="A2" ca="yes">
               <snm>Roth</snm>
               <mi>P</mi>
               <fnm>Frederick</fnm>
               <insr iid="I2"/>
               <email>fritz_roth@hms.harvard.edu</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Donnelly Centre for Cellular and Biomolecular Research and Banting and Best Department of Medical Research, University of Toronto, Toronto, Ontario M5S3E1, Canada</p>
            </ins>
            <ins id="I2">
               <p>Department of Biological Chemistry and Molecular Pharmacology, Harvard Medical School and Center for Cancer Systems Biology, Dana-Farber Cancer Institute, Boston, Massachusetts 02115, USA</p>
            </ins>
         </insg>
         <source>Genome Biology</source>
         <supplement>
            <title>
               <p>Quantitative inference of gene function from diverse large-scale datasets</p>
            </title>
            <editor>Timothy R Hughes and Frederick P Roth </editor>
            <note>Research</note>
         </supplement>
         <issn>1465-6906</issn>
         <pubdate>2008</pubdate>
         <volume>9</volume>
         <issue>Suppl 1</issue>
         <fpage>S1</fpage>
         <url>http://genomebiology.com/2008/9/S1/S1</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">18613945</pubid>
               <pubid idtype="doi">10.1186/gb-2008-9-s1-s1</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <pub>
            <date>
               <day>27</day>
               <month>6</month>
               <year>2008</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2008</year>
         <collab>Hughes and Roth; licensee BioMed Central Ltd.</collab>
         <note>This is an open access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
   </fm>
   <bdy>
      <sec>
         <st>
            <p/>
         </st>
         <p>One of the most surprising aspects of the completed human and mouse genome sequences <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr></abbrgrp> has been the relatively small number of protein-coding genes. The current estimate of &lt;24,000 protein-coding genes in human and mouse is only four times that of budding yeast <abbrgrp><abbr bid="B4">4</abbr></abbrgrp>. A complete encyclopedia of biochemical, cellular, and physiological gene functions is now an immediate rather than a long-term goal.</p>
         <p>The unifying theme for papers in this supplement to <it>Genome Biology </it>is the automated inference of molecular function of gene products, and of their membership within cellular components and biological processes. In each study, thousands of variables describing genes and gene-gene relationships have been integrated using machine learning methods to infer Gene Ontology (GO) terms for essentially all genes in the studied genome.</p>
         <p>Systematic biochemical and genetic experimentation in simpler model organisms has contributed to the rapid increase in the proportion of characterized genes. For example, about 80% of yeast genes have some annotated function <abbrgrp><abbr bid="B5">5</abbr></abbrgrp>. Beyond simpler model systems such as yeast, however, cost and time requirements may preclude many systematic analyses, for example, resource-intensive phenotype assays in adult animals. Fortunately, efforts in simpler model organisms have illustrated that gene functions can be inferred on the basis of data types that are easier to collect systematically. Protein sequence features, expression patterns, and protein-protein interactions, for example, can provide powerful clues to function. This raises the prospect of directing resource-intensive experimentation toward the genes most likely to yield positive results. In yeast, this concept is well established, and the tradeoffs between performance measures, the efficacy of combinations of different data types in making different types of predictions, and the applicability of diverse inference algorithms are topics of active research.</p>
         <p>Large scale experimentation in mammals is coming of age. A wide variety of mRNA expression analysis experiments are available in public data repositories, for example, the Gene Expression Omnibus <abbrgrp><abbr bid="B6">6</abbr></abbrgrp>. The majority of human genes have at least one moveable open reading frame clone <abbrgrp><abbr bid="B7">7</abbr><abbr bid="B8">8</abbr><abbr bid="B9">9</abbr></abbrgrp>, enabling expression studies <it>in vitro </it>and in model systems. 'Knockdown' reagents targeting most mouse and human genes are now available <abbrgrp><abbr bid="B10">10</abbr></abbrgrp>, facilitating analysis of biochemical and cellular gene functions. Efforts are underway to create a mutant allele of each mouse gene <abbrgrp><abbr bid="B11">11</abbr></abbrgrp>, which will enable analysis of physiological and developmental roles.</p>
         <p>The first paper in this supplemental issue <abbrgrp><abbr bid="B12">12</abbr></abbrgrp> describes the 'MouseFunc' challenge, in which nine bioinformatics teams independently predicted mouse GO terms. Importantly, each used a common collection of training data and common benchmarks, which allowed comparison among the inference methods, data sets, and categories of gene functions. Predictions were tested using cross-validation (annotation for a subset of genes was hidden from the participants). Predictions were further tested by two forms of prospective evaluation: first, using GO annotations that had been added to the database since the inception of the study; and second, literature related to top-scoring novel predictions was investigated intensively by experienced mouse biologists.</p>
         <p>Each of the companion papers in this issue is connected to the MouseFunc challenge, either in the nature of the algorithms used, in the datasets employed, or both. Guan and coworkers (led by Olga Troyanska) apply a support vector machine approach to predict mouse gene function <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>. They go on to apply this approach to the more tractable model eukaryote <it>Saccharomyces cerevisiae </it>and to test specific predictions experimentally. Mostafavi and colleagues (led by Quaid Morris) apply a ridge regression approach to predict mouse and yeast gene function <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>. The approach is quite fast, permitting their 'GeneMania' software to perform predictions 'on the fly' with a training set provided by the user. Kim and coworkers (led by Edward Marcotte) infer mouse gene function both directly and via a functional linkage network <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>. Functional linkage graphs contain connections between genes weighted by confidence that are functionally related <abbrgrp><abbr bid="B16">16</abbr></abbrgrp>. Obozinski and colleagues (led by William Noble) investigate the possibility of inconsistency between predictions of different functions <abbrgrp><abbr bid="B17">17</abbr></abbrgrp>. For example, it is possible for some approaches to assign a higher prediction score to 'DNA helicase activity' than to its logical parent term 'helicase activity'. They show that 'reconciliation' methods that enforce consistency between different GO term predictions can improve performance. Tian and coworkers <abbrgrp><abbr bid="B18">18</abbr></abbrgrp> and Tasan and colleagues <abbrgrp><abbr bid="B19">19</abbr></abbrgrp>, both teams led by one of us (FPR), each combine guilt-by-profiling and guilt-by-association approaches to make predictions. Tian and coworkers describe the methodology and apply it to predict <it>S. cerevisiae </it>gene functions, while Tasan and colleagues apply the methodology to predict both functions and phenotypes for mouse genes.</p>
         <p>Many other quantitative fields have benefitted by standardization of training and test sets. For example, the Critical Assessment of Techniques for Protein Structure Prediction (CASP) challenge <abbrgrp><abbr bid="B20">20</abbr></abbrgrp> has made rigorous comparisons among protein structure predictions. This special issue suggests the value of similar standardization in the arena of function prediction.</p>
         <p>Importantly, inferences about function and phenotype made in this issue are not black or white, but rather are expressed in shades of gray. Biology will long remain in the 'working model' phase, in which each statement about a gene's role must be accompanied by some uncertainty. An honest assessment of our uncertainties could allow us to direct resources efficiently to those experiments most likely to resolve these uncertainties. Quantitative predictions allow individual users requiring highly stringent predictions to impose a high prediction score threshold, while users may lower their threshold and include additional false positives if they wish to cast a wide net and catch more true positives.</p>
         <p>The approaches taken in this issue have common limitations. To reduce the scope of the computational problem and eliminate the potential for inflated performance estimates due to circular reasoning, participants did not have access to GO annotations from other species. Although the training data did incorporate many previous transfers of annotation from other species by orthology, these methods could also benefit from a similar standardization and benchmarking strategy.</p>
         <p>We also note that identifying the best strategies does not always help us to understand why the best strategies worked well. Because of the computationally intensive nature of function prediction, only a limited number of variant approaches were evaluated. A full factorial analysis of variations on the most successful strategies will help provide this understanding and allow future optimization.</p>
         <p>The high precision of top predictions for many GO terms illustrates the richness and value of data sources that have become available for mammals over recent years. However, one lesson learned is that it is difficult to achieve both high precision and high recall. Currently, no algorithms achieve both for most functional categories. Improvements in either the inference methods, the problem setup, or in the information content of the data sets themselves will be needed in order to make a major dent in the more than 10,000 currently uncharacterized mouse and human genes.</p>
      </sec>
      <sec>
         <st>
            <p>Abbreviations</p>
         </st>
         <p>GO, Gene Ontology.</p>
      </sec>
      <sec>
         <st>
            <p>Competing interests</p>
         </st>
         <p>The authors declare that they have no competing interests.</p>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>This article has been published as part of <it>Genome Biology </it>Volume 9 Supplement 1, 2008: Quantitative inference of gene function from diverse large-scale datasets. The full contents of the supplement are available online at <url>http://genomebiology.com/supplements/9/S1</url></p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>Initial sequencing and analysis of the human genome.</p>
            </title>
            <aug>
               <au>
                  <snm>Lander</snm>
                  <fnm>ES</fnm>
               </au>
               <au>
                  <snm>Linton</snm>
                  <fnm>LM</fnm>
               </au>
               <au>
                  <snm>Birren</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Nusbaum</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Zody</snm>
                  <fnm>MC</fnm>
               </au>
               <au>
                  <snm>Baldwin</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Devon</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Dewar</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Doyle</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>FitzHugh</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Funke</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Gage</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Harris</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Heaford</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Howland</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Kann</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Lehoczky</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>LeVine</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>McEwan</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>McKernan</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Meldrim</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Mesirov</snm>
                  <fnm>JP</fnm>
               </au>
               <au>
                  <snm>Miranda</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Morris</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Naylor</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Raymond</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Rosetti</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Santos</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Sheridan</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Sougnez</snm>
                  <fnm>C</fnm>
               </au>
               <etal/>
            </aug>
            <source>Nature</source>
            <pubdate>2001</pubdate>
            <volume>409</volume>
            <fpage>860</fpage>
            <lpage>921</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/35057062</pubid>
                  <pubid idtype="pmpid" link="fulltext">11237011</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <title>
               <p>The sequence of the human genome.</p>
            </title>
            <aug>
               <au>
                  <snm>Venter</snm>
                  <fnm>JC</fnm>
               </au>
               <au>
                  <snm>Adams</snm>
                  <fnm>MD</fnm>
               </au>
               <au>
                  <snm>Myers</snm>
                  <fnm>EW</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>PW</fnm>
               </au>
               <au>
                  <snm>Mural</snm>
                  <fnm>RJ</fnm>
               </au>
               <au>
                  <snm>Sutton</snm>
                  <fnm>GG</fnm>
               </au>
               <au>
                  <snm>Smith</snm>
                  <fnm>HO</fnm>
               </au>
               <au>
                  <snm>Yandell</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Evans</snm>
                  <fnm>CA</fnm>
               </au>
               <au>
                  <snm>Holt</snm>
                  <fnm>RA</fnm>
               </au>
               <au>
                  <snm>Gocayne</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Amanatides</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Ballew</snm>
                  <fnm>RM</fnm>
               </au>
               <au>
                  <snm>Huson</snm>
                  <fnm>DH</fnm>
               </au>
               <au>
                  <snm>Wortman</snm>
                  <fnm>JR</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>Q</fnm>
               </au>
               <au>
                  <snm>Kodira</snm>
                  <fnm>CD</fnm>
               </au>
               <au>
                  <snm>Zheng</snm>
                  <fnm>XH</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Skupski</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Subramanian</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Thomas</snm>
                  <fnm>PD</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Gabor Miklos</snm>
                  <fnm>GL</fnm>
               </au>
               <au>
                  <snm>Nelson</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Broder</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Clark</snm>
                  <fnm>AG</fnm>
               </au>
               <au>
                  <snm>Nadeau</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>McKusick</snm>
                  <fnm>VA</fnm>
               </au>
               <au>
                  <snm>Zinder</snm>
                  <fnm>N</fnm>
               </au>
               <etal/>
            </aug>
            <source>Science</source>
            <pubdate>2001</pubdate>
            <volume>291</volume>
            <fpage>1304</fpage>
            <lpage>1351</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1058040</pubid>
                  <pubid idtype="pmpid" link="fulltext">11181995</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>Initial sequencing and comparative analysis of the mouse genome.</p>
            </title>
            <aug>
               <au>
                  <cnm>Mouse Genome Sequencing Consortium</cnm>
               </au>
               <au>
                  <snm>Waterston</snm>
                  <fnm>RH</fnm>
               </au>
               <au>
                  <snm>Lindblad-Toh</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Birney</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Rogers</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Abril</snm>
                  <fnm>JF</fnm>
               </au>
               <au>
                  <snm>Agarwal</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Agarwala</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Ainscough</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Alexandersson</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>An</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Antonarakis</snm>
                  <fnm>SE</fnm>
               </au>
               <au>
                  <snm>Attwood</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Baertsch</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Bailey</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Barlow</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Beck</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Berry</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Birren</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Bloom</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Bork</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Botcherby</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Bray</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Brent</snm>
                  <fnm>MR</fnm>
               </au>
               <au>
                  <snm>Brown</snm>
                  <fnm>DG</fnm>
               </au>
               <au>
                  <snm>Brown</snm>
                  <fnm>SD</fnm>
               </au>
               <au>
                  <snm>Bult</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Burton</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Butler</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Campbell</snm>
                  <fnm>RD</fnm>
               </au>
               <etal/>
            </aug>
            <source>Nature</source>
            <pubdate>2002</pubdate>
            <volume>420</volume>
            <fpage>520</fpage>
            <lpage>562</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature01262</pubid>
                  <pubid idtype="pmpid" link="fulltext">12466850</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>Life with 6000 genes.</p>
            </title>
            <aug>
               <au>
                  <snm>Goffeau</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Barrell</snm>
                  <fnm>BG</fnm>
               </au>
               <au>
                  <snm>Bussey</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Davis</snm>
                  <fnm>RW</fnm>
               </au>
               <au>
                  <snm>Dujon</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Feldmann</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Galibert</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Hoheisel</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Jacq</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Johnston</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Louis</snm>
                  <fnm>EJ</fnm>
               </au>
               <au>
                  <snm>Mewes</snm>
                  <fnm>HW</fnm>
               </au>
               <au>
                  <snm>Murakami</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Philippsen</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Tettelin</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Oliver</snm>
                  <fnm>SG</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>1996</pubdate>
            <volume>274</volume>
            <fpage>563</fpage>
            <lpage>547</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1126/science.274.5287.546</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>Why are there still over 1000 uncharacterized yeast genes?</p>
            </title>
            <aug>
               <au>
                  <snm>Pena-Castillo</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Hughes</snm>
                  <fnm>TR</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>2007</pubdate>
            <volume>176</volume>
            <fpage>7</fpage>
            <lpage>14</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1893027</pubid>
                  <pubid idtype="pmpid" link="fulltext">17435240</pubid>
                  <pubid idtype="doi">10.1534/genetics.107.074468</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>Gene Expression Omnibus: microarray data storage, submission, retrieval, and analysis.</p>
            </title>
            <aug>
               <au>
                  <snm>Barrett</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Edgar</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Methods Enzymol</source>
            <pubdate>2006</pubdate>
            <volume>411</volume>
            <fpage>352</fpage>
            <lpage>369</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1619900</pubid>
                  <pubid idtype="pmpid" link="fulltext">16939800</pubid>
                  <pubid idtype="doi">10.1016/S0076-6879(06)11019-8</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>hORFeome v3.1: a resource of human open reading frames representing over 10,000 human genes.</p>
            </title>
            <aug>
               <au>
                  <snm>Lamesch</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Milstein</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Fan</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Hao</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Szabo</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Hu</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Venkatesan</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Bethel</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Martin</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Rogers</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Lawlor</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>McLaren</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Dricot</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Borick</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Cusick</snm>
                  <fnm>ME</fnm>
               </au>
               <au>
                  <snm>Vandenhaute</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Dunham</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Hill</snm>
                  <fnm>DE</fnm>
               </au>
               <au>
                  <snm>Vidal</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Genomics</source>
            <pubdate>2007</pubdate>
            <volume>89</volume>
            <fpage>307</fpage>
            <lpage>315</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.ygeno.2006.11.012</pubid>
                  <pubid idtype="pmpid" link="fulltext">17207965</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>Exploration of human ORFeome: high-throughput preparation of ORF clones and efficient characterization of their protein products.</p>
            </title>
            <aug>
               <au>
                  <snm>Nagase</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Yamakawa</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Tadokoro</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Nakajima</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Inoue</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Yamaguchi</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Itokawa</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Kikuno</snm>
                  <fnm>RF</fnm>
               </au>
               <au>
                  <snm>Koga</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Ohara</snm>
                  <fnm>O</fnm>
               </au>
            </aug>
            <source>DNA Res</source>
            <pubdate>2008</pubdate>
            <inpress/>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">18316326</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p>The full-ORF clone resource of the German cDNA Consortium.</p>
            </title>
            <aug>
               <au>
                  <snm>Bechtel</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Rosenfelder</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Duda</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Schmidt</snm>
                  <fnm>CP</fnm>
               </au>
               <au>
                  <snm>Ernst</snm>
                  <fnm>U</fnm>
               </au>
               <au>
                  <snm>Wellenreuther</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Mehrle</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Schuster</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Bahr</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Bl&#246;cker</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Heubner</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Hoerlein</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Michel</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Wedler</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>K&#246;hrer</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Ottenw&#228;lder</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Poustka</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Wiemann</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Schupp</snm>
                  <fnm>I</fnm>
               </au>
            </aug>
            <source>BMC Genomics</source>
            <pubdate>2007</pubdate>
            <volume>8</volume>
            <fpage>399</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">2213676</pubid>
                  <pubid idtype="pmpid" link="fulltext">17974005</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B10">
            <title>
               <p>Second-generation shRNA libraries covering the mouse and human genomes.</p>
            </title>
            <aug>
               <au>
                  <snm>Silva</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>MZ</fnm>
               </au>
               <au>
                  <snm>Chang</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Ge</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Golding</snm>
                  <fnm>MC</fnm>
               </au>
               <au>
                  <snm>Rickles</snm>
                  <fnm>RJ</fnm>
               </au>
               <au>
                  <snm>Siolas</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Hu</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Paddison</snm>
                  <fnm>PJ</fnm>
               </au>
               <au>
                  <snm>Schlabach</snm>
                  <fnm>MR</fnm>
               </au>
               <au>
                  <snm>Sheth</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Bradshaw</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Burchard</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Kulkarni</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Cavet</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Sachidanandam</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>McCombie</snm>
                  <fnm>WR</fnm>
               </au>
               <au>
                  <snm>Cleary</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Elledge</snm>
                  <fnm>SJ</fnm>
               </au>
               <au>
                  <snm>Hannon</snm>
                  <fnm>GJ</fnm>
               </au>
            </aug>
            <source>Nat Genet</source>
            <pubdate>2005</pubdate>
            <volume>37</volume>
            <fpage>1281</fpage>
            <lpage>1288</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">16200065</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>A mouse for all reasons.</p>
            </title>
            <aug>
               <au>
                  <snm>Collins</snm>
                  <fnm>FS</fnm>
               </au>
               <au>
                  <snm>Rossant</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Wurst</snm>
                  <fnm>W</fnm>
               </au>
            </aug>
            <source>Cell</source>
            <pubdate>2007</pubdate>
            <volume>128</volume>
            <fpage>9</fpage>
            <lpage>13</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.cell.2006.12.018</pubid>
                  <pubid idtype="pmpid" link="fulltext">17218247</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>A critical assessment of <it>Mus musculus </it>gene function prediction using integrated genomic evidence.</p>
            </title>
            <aug>
               <au>
                  <snm>Pe&#241;a-Castillo</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Tasan</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Myers</snm>
                  <fnm>CL</fnm>
               </au>
               <au>
                  <snm>Lee</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Joshi</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Guan</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Leone</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Pagnani</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Kim</snm>
                  <fnm>WK</fnm>
               </au>
               <au>
                  <snm>Krumpelman</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Tian</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Obozinski</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Qi</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Mostafavi</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Lin</snm>
                  <fnm>GN</fnm>
               </au>
               <au>
                  <snm>Berriz</snm>
                  <fnm>GF</fnm>
               </au>
               <au>
                  <snm>Gibbons</snm>
                  <fnm>FD</fnm>
               </au>
               <au>
                  <snm>Lanckriet</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Qiu</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Grant</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Barutcuoglu</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Hill</snm>
                  <fnm>DP</fnm>
               </au>
               <au>
                  <snm>Warde-Farley</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Grouios</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Ray</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Blake</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Deng</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Jordan</snm>
                  <fnm>MI</fnm>
               </au>
               <au>
                  <snm>Noble</snm>
                  <fnm>WS</fnm>
               </au>
               <etal/>
            </aug>
            <source>Genome Biol</source>
            <pubdate>2008</pubdate>
            <volume>9</volume>
            <issue>suppl 1</issue>
            <fpage>S2</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">2447536</pubid>
                  <pubid idtype="pmpid" link="fulltext">18613946</pubid>
                  <pubid idtype="doi">10.1186/gb-2008-9-s1-s2</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B13">
            <title>
               <p>Predicting gene function in a hierarchical context with an ensemble of classifiers.</p>
            </title>
            <aug>
               <au>
                  <snm>Guan</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Myers</snm>
                  <fnm>CL</fnm>
               </au>
               <au>
                  <snm>Hess</snm>
                  <fnm>DC</fnm>
               </au>
               <au>
                  <snm>Barutcuoglu</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Caudy</snm>
                  <fnm>AA</fnm>
               </au>
               <au>
                  <snm>Troyanskaya</snm>
                  <fnm>OG</fnm>
               </au>
            </aug>
            <source>Genome Biol</source>
            <pubdate>2008</pubdate>
            <volume>9</volume>
            <issue>suppl 1</issue>
            <fpage>S3</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">2447537</pubid>
                  <pubid idtype="pmpid" link="fulltext">18613947</pubid>
                  <pubid idtype="doi">10.1186/gb-2008-9-s1-s3</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B14">
            <title>
               <p>GeneMANIA: a real-time multiple association network integration algorithm for predicting gene function.</p>
            </title>
            <aug>
               <au>
                  <snm>Mostafavi</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Ray</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Warde-Farley</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Grouios</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Morris</snm>
                  <fnm>Q</fnm>
               </au>
            </aug>
            <source>Genome Biol</source>
            <pubdate>2008</pubdate>
            <volume>9</volume>
            <issue>suppl 1</issue>
            <fpage>S4</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">2447538</pubid>
                  <pubid idtype="pmpid" link="fulltext">18613948</pubid>
                  <pubid idtype="doi">10.1186/gb-2008-9-s1-s4</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B15">
            <title>
               <p>Inferring mouse gene functions from genomic-scale data using a combined functional network/classification strategy.</p>
            </title>
            <aug>
               <au>
                  <snm>Kim</snm>
                  <fnm>WK</fnm>
               </au>
               <au>
                  <snm>Krumpelman</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Marcotte</snm>
                  <fnm>EM</fnm>
               </au>
            </aug>
            <source>Genome Biol</source>
            <pubdate>2008</pubdate>
            <volume>9</volume>
            <issue>suppl 1</issue>
            <fpage>S5</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">2447539</pubid>
                  <pubid idtype="pmpid" link="fulltext">18613949</pubid>
                  <pubid idtype="doi">10.1186/gb-2008-9-s1-s5</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B16">
            <title>
               <p>A probabilistic functional network of yeast genes.</p>
            </title>
            <aug>
               <au>
                  <snm>Lee</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Date</snm>
                  <fnm>SV</fnm>
               </au>
               <au>
                  <snm>Adai</snm>
                  <fnm>AT</fnm>
               </au>
               <au>
                  <snm>Marcotte</snm>
                  <fnm>EM</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2004</pubdate>
            <volume>306</volume>
            <fpage>1555</fpage>
            <lpage>1558</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1099511</pubid>
                  <pubid idtype="pmpid" link="fulltext">15567862</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B17">
            <title>
               <p>Consistent probabilistic outputs for protein function prediction.</p>
            </title>
            <aug>
               <au>
                  <snm>Obozinski</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Lanckriet</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Grant</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Jordan</snm>
                  <fnm>MI</fnm>
               </au>
               <au>
                  <snm>Stafford Noble</snm>
                  <fnm>W</fnm>
               </au>
            </aug>
            <source>Genome Biol</source>
            <pubdate>2008</pubdate>
            <volume>9</volume>
            <issue>suppl 1</issue>
            <fpage>S6</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">2447540</pubid>
                  <pubid idtype="pmpid" link="fulltext">18613950</pubid>
                  <pubid idtype="doi">10.1186/gb-2008-9-s1-s6</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B18">
            <title>
               <p>Combining guilt-by-association and guilt-by-profiling to predict <it>Saccharomyces cerevisiae </it>gene function.</p>
            </title>
            <aug>
               <au>
                  <snm>Tian</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>LV</fnm>
               </au>
               <au>
                  <snm>Ta&#351;an</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Gibbons</snm>
                  <fnm>FD</fnm>
               </au>
               <au>
                  <snm>King</snm>
                  <fnm>OD</fnm>
               </au>
               <au>
                  <snm>Park</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Wunderlich</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Cherry</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Roth</snm>
                  <fnm>FP</fnm>
               </au>
            </aug>
            <source>Genome Biol</source>
            <pubdate>2008</pubdate>
            <volume>9</volume>
            <issue>Suppl 1</issue>
            <fpage>S7</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">2447541</pubid>
                  <pubid idtype="pmpid" link="fulltext">18613951</pubid>
                  <pubid idtype="doi">10.1186/gb-2008-9-s1-s7</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B19">
            <title>
               <p>An <it>en masse </it>phenotype and function prediction system for <it>Mus musculus</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Ta&#351;an</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Tian</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Hill</snm>
                  <fnm>DP</fnm>
               </au>
               <au>
                  <snm>Gibbons</snm>
                  <fnm>FD</fnm>
               </au>
               <au>
                  <snm>Blake</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Roth</snm>
                  <fnm>FP</fnm>
               </au>
            </aug>
            <source>Genome Biol</source>
            <pubdate>2008</pubdate>
            <volume>9</volume>
            <issue>suppl 1</issue>
            <fpage>S8</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">2447542</pubid>
                  <pubid idtype="pmpid" link="fulltext">18613952</pubid>
                  <pubid idtype="doi">10.1186/gb-2008-9-s1-s8</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B20">
            <title>
               <p>Rigorous performance evaluation in protein structure modelling and implications for computational biology.</p>
            </title>
            <aug>
               <au>
                  <snm>Moult</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Phil Trans R Soc Lond B Biol Sci</source>
            <pubdate>2006</pubdate>
            <volume>361</volume>
            <fpage>453</fpage>
            <lpage>458</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1098/rstb.2005.1810</pubid>
            </xrefbib>
         </bibl>
      </refgrp>
   </bm>
</art>
