<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
	<ui>1752-0509-7-4</ui>
	<ji>1752-0509</ji>
	<fm>
		<dochead>Software</dochead>
		<bibl>
			<title>
				<p>EnRICH: <b>E</b>xtractio<b>n </b>and <b>R</b>anking using <b>I</b>ntegration and <b>C</b>riteria <b>H</b>euristics</p>
			</title>
			<aug>
				<au id="A1"><snm>Zhang</snm><fnm>Xia</fnm><insr iid="I1"/><insr iid="I3"/><email>xiazhang@iastate.edu</email></au>
				<au id="A2" ca="yes"><snm>Greenlee</snm><fnm>M Heather West</fnm><insr iid="I2"/><insr iid="I3"/><insr iid="I4"/><email>mheather@iastate.edu</email></au>
				<au id="A3"><snm>Serb</snm><mi>M</mi><fnm>Jeanne</fnm><insr iid="I1"/><insr iid="I3"/><email>serb@iastate.edu</email></au>
			</aug>
			<insg>
				<ins id="I1"><p>Department of Ecology, Evolution, and Organismal Biology, Iowa State University, Ames, Iowa, USA</p></ins>
				<ins id="I2"><p>Department of Biomedical Sciences, 2008 Veterinary Medicine, Iowa State University, Ames, IA 50010, USA</p></ins>
				<ins id="I3"><p>Interdepartmental Genetics Program, Iowa State University, Ames, Iowa, USA</p></ins>
				<ins id="I4"><p>Bioinformatics and Computational Biology Program, Iowa State University, Ames, Iowa, USA</p></ins>
			</insg>
			<source>BMC Systems Biology</source>
			<section><title><p>Methods, software and technology</p></title></section><issn>1752-0509</issn>
			<pubdate>2013</pubdate>
			<volume>7</volume>
			<issue>1</issue>
			<fpage>4</fpage>
			<url>http://www.biomedcentral.com/1752-0509/7/4</url>
			<xrefbib><pubidlist><pubid idtype="doi">10.1186/1752-0509-7-4</pubid><pubid idtype="pmpid">23320748</pubid></pubidlist></xrefbib>
		</bibl>
		<history><rec><date><day>11</day><month>4</month><year>2012</year></date></rec><acc><date><day>7</day><month>1</month><year>2013</year></date></acc><pub><date><day>15</day><month>1</month><year>2013</year></date></pub></history>
		<cpyrt><year>2013</year><collab>Zhang et al.; licensee BioMed Central Ltd.</collab><note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note></cpyrt>
		<kwdg>
			<kwd>Qualitative integration</kwd>
			<kwd>High-throughput data</kwd>
			<kwd>Heterogeneous data</kwd>
			<kwd>Network</kwd>
			<kwd>Network visualization</kwd>
			<kwd>Candidate prioritization</kwd>
		</kwdg>
		<abs>
			<sec>
				<st>
					<p>Abstract</p>
				</st>
				<sec>
					<st>
						<p>Background</p>
					</st><p>High throughput screening technologies enable biologists to generate candidate genes at a rate that, due to time and cost constraints, cannot be studied by experimental approaches in the laboratory. Thus, it has become increasingly important to prioritize candidate genes for experiments. To accomplish this, researchers need to apply selection requirements based on their knowledge, which necessitates qualitative integration of heterogeneous data sources and filtration using multiple criteria. A similar approach can also be applied to putative candidate gene relationships. While automation can assist in this routine and imperative procedure, flexibility of data sources and criteria must not be sacrificed. A tool that can optimize the trade-off between automation and flexibility to simultaneously filter and qualitatively integrate data is needed to prioritize candidate genes and generate composite networks from heterogeneous data sources.</p>
				</sec>
				<sec>
					<st>
						<p>Results</p>
					</st><p>We developed the java application, EnRICH (<b>
							<ul>E</ul>
						</b>xtractio<b>
							<ul>n</ul>
						</b> and <b>
							<ul>R</ul>
						</b>anking using <b>
							<ul>I</ul>
						</b>ntegration and <b>
							<ul>C</ul>
						</b>riteria <b>
							<ul>H</ul>
						</b>euristics), in order to alleviate this need. Here we present a case study in which we used EnRICH to integrate and filter multiple candidate gene lists in order to identify potential retinal disease genes. As a result of this procedure, a candidate pool of several hundred genes was narrowed down to five candidate genes, of which four are confirmed retinal disease genes and one is associated with a retinal disease state.</p>
				</sec>
				<sec>
					<st>
						<p>Conclusions</p>
					</st><p>We developed a platform-independent tool that is able to qualitatively integrate multiple heterogeneous datasets and use different selection criteria to filter each of them, provided the datasets are tables that have distinct identifiers (required) and attributes (optional). With the flexibility to specify data sources and filtering criteria, EnRICH automatically prioritizes candidate genes or gene relationships for biologists based on their specific requirements. Here, we also demonstrate that this tool can be effectively and easily used to apply highly specific user-defined criteria and can efficiently identify high quality candidate genes from relatively sparse datasets.</p>
				</sec>
			</sec>
		</abs>
	</fm>
	<bdy>
		<sec>
			<st>
				<p>Background</p>
			</st><p>Hundreds to thousands of candidate genes, or genes of interest, can now be generated from a single experiment utilizing high throughput screening technologies. However, the number of candidate genes that can be experimentally studied in-depth is often constrained by time and cost. Therefore, prioritization of candidate genes is a critical step in the experimental process. Approaches to identify &#8216;the most promising&#8217; candidates are becoming increasingly more sophisticated. For example, when microarray studies were initially reported, &#8216;the most promising&#8217; candidates were often the most differentially expressed and could be obtained by a simple ranking of candidates based on fold change. As more data has become available, biologists have begun to look for ways <abbrgrp>
					<abbr bid="B1">1</abbr>
					<abbr bid="B2">2</abbr>
					<abbr bid="B3">3</abbr>
					<abbr bid="B4">4</abbr>
				</abbrgrp> to use multiple data sources to increase the accuracy of candidate gene prioritization. Some tools have already been developed to address this need <abbrgrp>
					<abbr bid="B5">5</abbr>
					<abbr bid="B6">6</abbr>
					<abbr bid="B7">7</abbr>
					<abbr bid="B8">8</abbr>
					<abbr bid="B9">9</abbr>
					<abbr bid="B10">10</abbr>
					<abbr bid="B11">11</abbr>
				</abbrgrp>. These tools prioritize candidates by their similarity to genes already known to be important for a particular biological process (e.g., genes known to regulate cell cycle in yeast). Multiple data sources including published literature, gene sequence, functional annotation, etc. can be considered when comparing the similarity of candidates to &#8216;known genes&#8217;. These tools <abbrgrp>
					<abbr bid="B5">5</abbr>
					<abbr bid="B6">6</abbr>
					<abbr bid="B7">7</abbr>
					<abbr bid="B8">8</abbr>
					<abbr bid="B9">9</abbr>
					<abbr bid="B10">10</abbr>
					<abbr bid="B11">11</abbr>
				</abbrgrp> have made important progress towards the problem of candidate prioritization. However, these tools use data queried from predetermined sources, such as public databases, and include embedded criteria. Thus, these software packages have limited utility.</p><p>Biologists, with expertise in a given area, generally already have a list of criteria that could be applied to identify high quality candidates. Likely, for a given set of experiments and resulting datasets, the best candidates may satisfy one set of criteria in one dataset and a separate set of criteria in another dataset. Currently, there is no tool that allows simultaneous consideration of heterogeneous datasets to identify candidates that satisfy multiple criteria. This problem does not only relate to candidate genes, but also to putative relationships between genes in networks.</p><p>Putative gene relationships can be inferred from many heterogeneous sources (e.g., physical interactions, genetic interactions, expression correlation and interactions predicted by computational models). While each of these relationships from a given dataset should be interpreted differently (and subject to very different criteria), the ability to easily hypothesize gene relationships based on their meeting appropriate criteria in multiple datasets is an attractive prospect. This task not only calls for an automated filtering and integration tool, but also demands great flexibility of data sources and the ability to set filtering criteria. Finally, for proper interpretation, visualization of the resulting network must facilitate inspection by 1) retaining the original data sources of each putative relationship and 2) providing a mechanism to easily manage the size of the displayed network. While some tools have been developed to generate composite networks from multiple data sources (e.g., the Cytoscape <abbrgrp>
					<abbr bid="B12">12</abbr>
				</abbrgrp> plugin CABIN <abbrgrp>
					<abbr bid="B13">13</abbr>
				</abbrgrp>, GraphWeb <abbrgrp>
					<abbr bid="B14">14</abbr>
				</abbrgrp> and GeneMania <abbrgrp>
					<abbr bid="B15">15</abbr>
				</abbrgrp>), they do not fully address the problems stated above. For example, CABIN supports only one filter for a single source network and thus multiple criteria cannot be applied. GraphWeb <abbrgrp>
					<abbr bid="B14">14</abbr>
				</abbrgrp> does not support filtering by user-defined criteria and interactive network visualization. GeneMania <abbrgrp>
					<abbr bid="B15">15</abbr>
				</abbrgrp> helps to predict the function of a set of input genes by utilizing functional association data to generate a functional relevant network, but does not address integration of user-determined data and filtration with user-defined criteria.</p><p>We identified the need for a tool that is able to: 1) filter individual datasets using appropriate criteria and then integrate them to prioritize candidates that meet the criteria in multiple datasets; 2) allow users to define the most appropriate datasets and filtering criteria; and 3) provide an interactive visualization to facilitate the generation of an integrated network with a manageable size and connectedness. To address the open demand of filtering and qualitative integration of heterogeneous datasets, we have developed a stand-alone, portable and flexible java application with its own user-interactive visualization. EnRICH (<b>
					<ul>E</ul>
				</b>xtractio<b>
					<ul>n</ul>
				</b> and <b>
					<ul>R</ul>
				</b>anking using <b>
					<ul>I</ul>
				</b>ntegration and <b>
					<ul>C</ul>
				</b>riteria <b>
					<ul>H</ul>
				</b>euristics) will assist biologists in prioritization of genes and gene relationships from heterogeneous-source data.</p>
		</sec>
		<sec>
			<st>
				<p>Implementation</p>
			</st><p>EnRICH was implemented in Java (SE 6 JDK). EnRICH visualization was written in Processing (<url>http://processing.org/</url>), an open-source programming language to create images, animation and interactions. The separation of non-visual and visual modules of EnRICH lays a flexible foundation for future development and provides the user easy access to both the text and visual output results.</p>
			<sec>
				<st>
					<p>Design</p>
				</st><p>The objectives of EnRICH are firstly to provide a tool for integration of multiple or heterogeneous data sets to prioritize candidate molecules that fulfill user-defined criteria, and secondly to make the integration process flexible and simple for biologists who have little programming skill. Our aim-oriented design principles are 1) user-defined data sources and criteria, 2) simplicity which allows straight-forward application of user-defined criteria to filter user-defined datasets, and 3) platform independence.</p><p>The overall architecture of EnRICH is reflected in its workflow-like graphical user interface (GUI) (Figure&#8201;<figr fid="F1">1</figr>). The first component (numbered as step 1 in the GUI) accepts a single file or a directory of files as input data and lists all files that can be selected for analysis. The second component (numbered as step 2 in the GUI) allows the user to display the selected file as a table and edit the table. The third component (numbered as step 3 in the GUI) enables the user to specify filtering criteria for each attribute of the selected file. The fourth component (numbered as step 4 in the GUI) displays all uploaded files for the user to customize an integration pool. It also provides the user running options on whether to apply filters that are already specified in step 3. The fifth component is a dialog window, which appears when the integration run is finished, and gives the user the option to save or visualize the result. For network data, EnRICH has an additional visualization component where the user can do an interactive visual analysis of the integrated network.</p>
				<fig id="F1"><title><p>Figure 1</p></title><caption><p>EnRICH graphical user interface.</p></caption><text>
   <p><b>EnRICH graphical user interface. </b>There are four major components in the user interface, which are numbered as 1, 2, 3 and 4. Component 1: upload input data; Component 2: browse or edit the selected file; Component 3: specify filtering criteria of the selected file; Component 4: select files to define integration pool.</p>
</text><graphic file="1752-0509-7-4-1"/></fig>
			</sec>
			<sec>
				<st>
					<p>Input data</p>
				</st><p>The current version of EnRICH accepts two types of data: list and network. A list is a set of elements that could be genes, proteins, etc., which have their own unique identification code or name. List data can come from a large variety of sources. For example, a list of genes can be differentially expressed genes (DEGs) from the analysis of a microarray experiment, genes identified by genome-wide association mapping, or genes retrieved from a database query. Each list member may have one or more attributes. For example, each gene in a list of DEGs has its own significance value, functional annotation, etc. For EnRICH, list data is represented as a named matrix that is composed of one column of elements and zero to multiple attribute columns. Attributes can either be value attributes that will be taken as mathematical values or label attributes treated as tags.</p><p>A network is a set of nodes that are interconnected by edges representing particular relationships between nodes. Like list data, network data can originate from heterogeneous sources including yeast two-hybrid experiments, computational or statistical inferences, literature summaries or database queries. Although there are several standard languages or formats for network representation, we assume that biologists may not be familiar with those standards. Thus, EnRICH applies a popular node-pair/edge list format as the input format for network data, where an edge is denoted by the pair of nodes it connects. In the matrix format, network edges are represented by two columns of node names. Like list data, network edges may have values and label attributes. Accordingly, a network is a named matrix consisting of two columns of nodes and zero to multiple attribute columns. EnRICH allows blank fields in the attribute column when data are missing.</p>
			</sec>
			<sec>
				<st>
					<p>Running mode</p>
				</st><p>EnRICH runs in two modes: undefined (without filters) and defined (using specific criteria to filter attributes). The undefined mode simply ignores the attributes of networks or lists. Each list or network is considered as a source, and all sources will be merged together. The defined mode simultaneously considers integration of networks or lists as well as user-defined criteria (which filters out elements that do not meet the criteria) over each network or list. For both types of running modes, candidates (edges of a network or elements of a list) are ranked by their reoccurrence across all sources after integration. The filtering process is completely user-defined. Because the filter is totally attribute-based, the user sets filters most appropriate for their biological question, which may include a combination of filters for each attribute, and even multiple filters for multiple attributes. For example, two of the comparison operators (&lt;, &lt;=, &gt;, &gt;=, ==) applied at the same time can be used to set a cutoff range for value attributes or several tags can be used (with an OR operator between them) when the user wants to select multiple label values (e.g. two annotations) for one attribute. When there are multiple attributes, multiple filters (with an AND operator between them) can be applied simultaneously.</p>
			</sec>
			<sec>
				<st>
					<p>Text output</p>
				</st><p>EnRICH saves output results as a tab-delimited text file. In the output text file, the user can see what files were integrated, which filters were applied to each file, and the result. For list data, the result is a table, which consists of three columns: the label of an element, its reoccurrence across all lists, and names of source-lists. For network data, the result includes four tables: node statistics, edge statistics, nodes, and edges. The node degree reveals topological importance of the node, so the table of node statistics contains two columns, one column is the node degree (the number of connections a single node has) and the other is the number of nodes that are greater than or equal to (&gt;=) this node degree. For the table of edge statistics, one column is edge reoccurrence (the number of times a single edge is recovered across all datasets) and the other is the number of edges that have an edge reoccurrence that are greater than or equal to (&gt;=) this edge reoccurrence. The table of nodes and the table of edges are quite similar. Each has a column of nodes/edges, their reoccurrence, and source-networks. The only difference between the node and edge tables is that a node is represented by the node label and an edge is denoted as two node labels. The table of edges is a tab-delimited data table composed of several columns such as node label name, edge reoccurrence and source. Therefore, if desired, the user can directly copy or import them into another network visualization tool such as Cytoscape <abbrgrp>
						<abbr bid="B12">12</abbr>
						<abbr bid="B16">16</abbr>
					</abbrgrp>.</p>
			</sec>
			<sec>
				<st>
					<p>Visualization</p>
				</st><p>EnRICH enables an interactive visual analysis of the integrated network without depending on a third-party visualization software. EnRICH network visualization consists of two components for user interaction: the integrated network and the plot of network statistics (Figure&#8201;<figr fid="F2">2</figr>). In the integrated network, an undirected edge is drawn as a blue line while a directed edge is drawn as a pink line with a pink arrow to indicate the direction of the interaction (e.g. transcriptional regulation). A blue line with a pink arrow is used to denote merged undirected and directed edges. All edges and nodes can be repositioned, without changing connections, by clicking and then dragging the item on the screen. In addition, the user can click to show or dissipate node labels and edge sources at the node- and edge-specific level, instead of the whole network level. The plot component has two plots: 1) the number of nodes vs. node degree plot and 2) the number of edges vs. edge-reoccurrence plot. The number of nodes and the number of edges are two aspects of the network size, while the node degree reveals topological importance of node. Edge reoccurrence is the number of times the edge is recovered in different data sources, which implies the reliability of an edge. In conjunction, the two plots are used to balance the visualization of network size and quality. All data points in the two plots are clickable to re-draw the integrated network at the selected level of node degree or edge reoccurrence. This interactive plot gives the user an easily visible comparison of node degree, edge reoccurrence and network size, and allows the user to simultaneously visualize the network at corresponding levels. EnRICH also allows the exportation of the network as an image file in TIFF format, which is widely supported.</p>
				<fig id="F2"><title><p>Figure 2</p></title><caption><p>EnRICH visualization window.</p></caption><text>
   <p><b>EnRICH visualization window. </b>Left: integrated network from synthesizing data. Circles represent nodes, while circle size represents node degree (the number of connections one node has). Lines represent edges of a network and line stroke represents the amount of edge reoccurrence (the number of times one edge is recovered across data sources). Undirected edges are represented by blue lines, while directed edges are pink lines with pink arrows. The merged edge of undirected and directed data is denoted by blue line with a pink arrow. Right: top panel is the statistical plot of node degree vs. the number of nodes. Bottom panel is the statistical plot of edge reoccurrence vs. the number of edges. In the software, all data points in the two plots are clickable to update visualization of the integrated network.</p>
</text><graphic file="1752-0509-7-4-2"/></fig>
			</sec>
		</sec>
		<sec>
			<st>
				<p>Result and discussion</p>
			</st>
			<sec>
				<st>
					<p>Application of EnRICH</p>
				</st><p>Retinal disease genes are genes that, when knocked out or mutated, cause retinal degeneration (<url>https://sph.uth.tmc.edu/retnet/disease.htm</url>). The identification of retinal disease genes is a major goal of retinal degenerative disease research and as part of this effort, there have been a significant number of experiments that describe transcriptional changes during normal retinal development <abbrgrp>
						<abbr bid="B17">17</abbr>
						<abbr bid="B18">18</abbr>
						<abbr bid="B19">19</abbr>
						<abbr bid="B20">20</abbr>
						<abbr bid="B21">21</abbr>
						<abbr bid="B22">22</abbr>
						<abbr bid="B23">23</abbr>
					</abbrgrp>. Here, we present a case study in which we use EnRICH to integrate multiple gene lists to identify potential retinal disease genes.</p><p>
					<it>Nrl</it>
					<abbrgrp>
						<abbr bid="B24">24</abbr>
						<abbr bid="B25">25</abbr>
						<abbr bid="B26">26</abbr>
						<abbr bid="B27">27</abbr>
					</abbrgrp> is a retinal disease gene that is associated with the retinal degenerative disease enhanced s-cone syndrome <abbrgrp>
						<abbr bid="B28">28</abbr>
					</abbrgrp>. When <it>Nrl</it> is mutated, the resulting phenotype is an abundance of s-cone photoreceptors at the expense of rod photoreceptor differentiation <abbrgrp>
						<abbr bid="B25">25</abbr>
						<abbr bid="B29">29</abbr>
					</abbrgrp>, leading to the eventual death of all photoreceptors. During normal development, <it>Nrl</it> influences the cone versus the rod cell fate decision by activating rod-specific genes, including the genes <it>Rho</it> and <it>Nr2e3</it>
					<abbrgrp>
						<abbr bid="B30">30</abbr>
					</abbrgrp>. <it>Rho</it>
					<abbrgrp>
						<abbr bid="B31">31</abbr>
						<abbr bid="B32">32</abbr>
						<abbr bid="B33">33</abbr>
					</abbrgrp> is a rod-specific gene, the mutation of which leads to rod photoreceptor cell death and retinal degeneration. <it>Nr2e3</it>
					<abbrgrp>
						<abbr bid="B34">34</abbr>
						<abbr bid="B35">35</abbr>
					</abbrgrp> is also essential during retinal development, as it promotes the expression of rod-specific genes (including <it>Rho</it>) and represses the expression of cone-specific genes in rods. The mutation of <it>Nr2e3</it> also causes enhanced s-cone syndrome <abbrgrp>
						<abbr bid="B36">36</abbr>
					</abbrgrp>. Based on the known regulatory relationships between these three disease genes and their importance for normal photoreceptor development, we rationalize that the behavior of these genes would make good criteria to identify additional retinal disease genes.</p><p>Using these assumptions, we defined the following criteria to identify retinal disease genes: 1) candidates must be highly co-expressed with <it>Nrl</it>, <it>Nr2e3</it> and <it>Rho</it> during rod photoreceptor development of wild-type mice; and 2) candidates must be disregulated when <it>Nrl</it> is knocked out (as <it>Nr2e3</it> and <it>Rho</it> are). With these criteria in mind, we decided to use a microarray dataset <abbrgrp>
						<abbr bid="B17">17</abbr>
					</abbrgrp> (GSE4051), which profiles gene expression in isolated rod photoreceptors at multiple developmental stages (E16, P2, P6, xP10, 4-weeks) in both <it>Nrl</it>-knockout and wild-type mice. In these microarrays, we confirmed that <it>Nr2e3</it> and <it>Rho</it> are highly co-expressed with <it>Nrl</it> in wildtype and are no longer co-expressed in the <it>Nrl</it> mutant.</p><p>According to the corresponding workflow (Figure&#8201;<figr fid="F3">3</figr>), we prepared, and subsequently integrated, three types of gene lists which are: Type 1) Genes that are co-expressed with <it>Nrl</it>, <it>Nr2e3</it> and <it>Rho</it> in developing wild type rod photoreceptors; Type 2) Genes that are co-expressed with <it>Nrl</it>, <it>Nr2e3</it> and <it>Rho</it> in developing photoreceptors isolated from <it>Nrl</it>-mutant retinas; and Type 3) Differentially expressed genes (DEGs) at each age when comparing gene expression in wild-type rod photoreceptors to <it>Nrl</it>-knockout rods. Each list contained attributes that were used to apply criteria filters (i.e. pairwise correlations for type 1 and 2, age at which expression was up or down regulated for type 3). To carry out the workflow, we first specified filtering criteria for each list. This is a key element of EnRICH, where users can simultaneously query multiple datasets to generate an &#8216;integrated result&#8217;. For this experiment, eight list datasets were integrated. The filtering criterion for six of the lists was an absolute value of the correlation coefficient greater than 0.9, while the filtering criterion for the two differentially expressed gene lists was the developmental time points P6 and P10 (for criteria on each single list, see Additional file <supplr sid="S1">1</supplr>: Table S1). Candidates that satisfied these filtering criteria in eight lists were identified as the highest priority candidates. All the lists were prepared from by standard analyses of the dataset GSE 4051 (calculation of co-expression coefficients within a genotype and differentially expressed genes between genotypes).</p>
				<suppl id="S1">
					<title>
						<p>Additional file 1</p>
					</title>
					<text>
						<p>
							<b>Gene lists and their filtration criteria prior to integration. </b>This file includes a supplementary table that displays names and descriptions of gene lists for integration in case study and some further explanation on the sources of gene lists.</p>
					</text>
					<file name="1752-0509-7-4-S1.pdf">
   <p>Click here for file</p>
</file>
				</suppl>
				<fig id="F3"><title><p>Figure 3</p></title><caption><p>Case study workflow.</p></caption><text>
   <p><b>Case study workflow. </b><b>A. </b>Rationale illustrates our current knowledge of Nrl, Nr2e3 and Rho and their behaviors in the dataset we analyzed. <b>B. </b>Workflow displays the steps that we must go through in order to apply our criteria to identify candidate genes. These steps are executed by EnRICH to obtain candidate genes. Workflow is the implementation of the investigation process based on the rationale (A).</p>
</text><graphic file="1752-0509-7-4-3"/></fig><p>The execution of our workflow generated five candidate genes (see Additional file <supplr sid="S2">2</supplr>: Table S2) from an initial pool of 272 unique differentially expressed genes (see Additional file <supplr sid="S3">3</supplr>). Based on a literature/database search, four of our five candidate genes (<it>pde6b</it>, <it>gnb1</it>, <it>guca1a</it> and <it>cgna1</it>) are confirmed retinal disease genes <abbrgrp>
						<abbr bid="B37">37</abbr>
						<abbr bid="B38">38</abbr>
						<abbr bid="B39">39</abbr>
						<abbr bid="B40">40</abbr>
						<abbr bid="B41">41</abbr>
						<abbr bid="B42">42</abbr>
						<abbr bid="B43">43</abbr>
						<abbr bid="B44">44</abbr>
						<abbr bid="B45">45</abbr>
						<abbr bid="B46">46</abbr>
					</abbrgrp>, and the fifth gene (<it>kcne2</it>) has been shown to be up regulated during a neuroinflamatory response in the retinas of diabetic rats <abbrgrp>
						<abbr bid="B47">47</abbr>
					</abbrgrp>, making it a reasonable candidate for a disease gene as well. Thus, in our example analysis to identify disease genes, 80% of our candidates are known disease genes, while the remaining candidate has a demonstrated tie to the diseased retina, and is therefore a high quality candidate. Using a Fisher test we also concluded that retinal disease genes are significantly overrepresented in the genes prioritized by EnRICH, compared with genes not prioritized by EnRICH (see Additional file <supplr sid="S3">3</supplr>).</p>
				<suppl id="S2">
					<title>
						<p>Additional file 2</p>
					</title>
					<text>
						<p>
							<b>Gene candidates resulting from the EnRICH filtration and prioritization analyses. </b>This file is the supplementary table of the five gene candidates from the prioritization by using EnRICH in the case study.</p>
					</text>
					<file name="1752-0509-7-4-S2.xlsx">
   <p>Click here for file</p>
</file>
				</suppl>
				<suppl id="S3">
					<title>
						<p>Additional file 3</p>
					</title>
					<text>
						<p>
							<b>Description of data processing. </b>This file includes detail description of data pre-processing for case study, and analysis of the significance of case study result.</p>
					</text>
					<file name="1752-0509-7-4-S3.pdf">
   <p>Click here for file</p>
</file>
				</suppl><p>Our case study demonstrates that a well-conceived data integration and criteria-based filtration, as implemented in EnRICH, can effectively identify a limited number of high quality candidate genes for careful hypothesis-based investigation. Conversely, if the number of candidates returned is too small, slight adjustments in the filtration criteria may be easily made to generate a larger, while still reasonably-sized, candidate pool.</p>
			</sec>
		</sec>
		<sec>
			<st>
				<p>Conclusions</p>
			</st><p>EnRICH is a free java application which can qualitatively integrate results from large, heterogeneous data sources while simultaneously applying filters to each of them. It allows the user to define data sources, and to integrate them as well as specify multiple sorting criteria specific to each data source. It provides interactive network visualization tool for the user to identify an integrated network with a desirable balance between network size and quality. With EnRICH, biologists have an automated yet flexible integration tool to carry out their data analysis and effectively prioritize candidate genes for further investigation.</p>
		</sec>
		<sec>
			<st>
				<p>Availability and requirements</p>
			</st><p>
				<b>Project name:</b> EnRICH (see Additional file <supplr sid="S4">4</supplr>: for the jar file of EnRICH program).</p>
			<suppl id="S4">
				<title>
					<p>Additional file 4</p>
				</title>
				<text>
					<p>
						<b>EnRICH program. </b>This file is the archive file of EnRICH which aggregates Java class files and associated metadata and resources.</p>
				</text>
				<file name="1752-0509-7-4-S4.jar">
   <p>Click here for file</p>
</file>
			</suppl><p>
				<b>Project home page:</b>
				<url>http://xiazhang.public.iastate.edu/</url> or the software category on the lab homepage <url>http://serb.public.iastate.edu</url>.</p><p>
				<b>Operating system(s):</b> platform-independent.</p><p>
				<b>Programming language:</b> Java.</p><p>
				<b>Other requirements:</b> Java 1.4.2 or higher.</p><p>
				<b>License:</b> GNU General Public License.</p><p>
				<b>Any restrictions to use by non-academics:</b> NO.</p>
		</sec>
		<sec>
			<st>
				<p>Competing interest</p>
			</st><p>The author(s) declare that they have no competing interests.</p>
		</sec>
		<sec>
			<st>
				<p>Authors&#8217; contributions</p>
			</st><p>XZ, JMS and MHWG conceived and designed this software, and drafted this manuscript. XZ coded this software and conducted case study presented in this manuscript. All authors read and approved the final manuscript.</p>
		</sec>
	</bdy>
	<bm>
		<ack>
			<sec>
				<st>
					<p>Acknowledgements</p>
				</st><p>The authors wish to thank Guan Wang and Fadi Towfic for their helpful comments on this software. Financial support was provided by the Center for Integrative Animal Genomics at Iowa State University to JMS and MHWG.</p>
			</sec>
		</ack>
		<refgrp><bibl id="B1"><title><p>Integrative network analysis identifies key genes and pathways in the progression of hepatitis C virus induced hepatocellular carcinoma</p></title><aug><au><snm>Zheng</snm><fnm>S</fnm></au><au><snm>Tansey</snm><fnm>WP</fnm></au><au><snm>Hiebert</snm><fnm>SW</fnm></au><au><snm>Zhao</snm><fnm>Z</fnm></au></aug><source>BMC Med Genomics</source><pubdate>2011</pubdate><volume>4</volume><fpage>62</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/1755-8794-4-62</pubid><pubid idtype="pmcid">3212927</pubid><pubid idtype="pmpid" link="fulltext">21824427</pubid></pubidlist></xrefbib></bibl><bibl id="B2"><title><p>Predicting cancer involvement of genes from heterogeneous data</p></title><aug><au><snm>Aragues</snm><fnm>R</fnm></au><au><snm>Sander</snm><fnm>C</fnm></au><au><snm>Oliva</snm><fnm>B</fnm></au></aug><source>BMC Bioinforma</source><pubdate>2008</pubdate><volume>9</volume><fpage>172</fpage><xrefbib><pubid idtype="doi">10.1186/1471-2105-9-172</pubid></xrefbib></bibl><bibl id="B3"><title><p>Prioritization and evaluation of depression candidate genes by combining multidimensional data resources</p></title><aug><au><snm>Kao</snm><fnm>CF</fnm></au><au><snm>Fang</snm><fnm>YS</fnm></au><au><snm>Zhao</snm><fnm>Z</fnm></au><au><snm>Kuo</snm><fnm>PH</fnm></au></aug><source>PLoS One</source><pubdate>2011</pubdate><volume>6</volume><issue>4</issue><fpage>e18696</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1371/journal.pone.0018696</pubid><pubid idtype="pmcid">3071871</pubid><pubid idtype="pmpid" link="fulltext">21494644</pubid></pubidlist></xrefbib></bibl><bibl id="B4"><title><p>Integration and visualization of systems biology data in context of the genome</p></title><aug><au><snm>Bare</snm><fnm>JC</fnm></au><au><snm>Koide</snm><fnm>T</fnm></au><au><snm>Reiss</snm><fnm>DJ</fnm></au><au><snm>Tenenbaum</snm><fnm>D</fnm></au><au><snm>Baliga</snm><fnm>NS</fnm></au></aug><source>BMC Bioinforma</source><pubdate>2010</pubdate><volume>11</volume><fpage>382</fpage><xrefbib><pubid idtype="doi">10.1186/1471-2105-11-382</pubid></xrefbib></bibl><bibl id="B5"><title><p>A guide to web tools to prioritize candidate genes</p></title><aug><au><snm>Tranchevent</snm><fnm>LC</fnm></au><au><snm>Capdevila</snm><fnm>FB</fnm></au><au><snm>Nitsch</snm><fnm>D</fnm></au><au><snm>De Moor</snm><fnm>B</fnm></au><au><snm>De Causmaecker</snm><fnm>P</fnm></au><au><snm>Moreau</snm><fnm>Y</fnm></au></aug><source>Brief Bioinform</source><pubdate>2010</pubdate><volume>12</volume><issue>1</issue><fpage>22</fpage><lpage>32</lpage><xrefbib><pubid idtype="pmpid" link="fulltext">21278374</pubid></xrefbib></bibl><bibl id="B6"><title><p>ToppGene Suite for gene list enrichment analysis and candidate gene prioritization</p></title><aug><au><snm>Chen</snm><fnm>J</fnm></au><au><snm>Bardes</snm><fnm>EE</fnm></au><au><snm>Aronow</snm><fnm>BJ</fnm></au><au><snm>Jegga</snm><fnm>AG</fnm></au></aug><source>Nucleic Acids Res</source><pubdate>2009</pubdate><volume>37</volume><issue>Web Server issue</issue><fpage>W305</fpage><lpage>311</lpage><xrefbib><pubidlist><pubid idtype="pmcid">2703978</pubid><pubid idtype="pmpid" link="fulltext">19465376</pubid></pubidlist></xrefbib></bibl><bibl id="B7"><title><p>SUSPECTS: enabling fast and effective prioritization of positional candidates</p></title><aug><au><snm>Adie</snm><fnm>EA</fnm></au><au><snm>Adams</snm><fnm>RR</fnm></au><au><snm>Evans</snm><fnm>KL</fnm></au><au><snm>Porteous</snm><fnm>DJ</fnm></au><au><snm>Pickard</snm><fnm>BS</fnm></au></aug><source>Bioinformatics</source><pubdate>2006</pubdate><volume>22</volume><issue>6</issue><fpage>773</fpage><lpage>774</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/bioinformatics/btk031</pubid><pubid idtype="pmpid" link="fulltext">16423925</pubid></pubidlist></xrefbib></bibl><bibl id="B8"><title><p>Genie: literature-based gene prioritization at multi genomic scale</p></title><aug><au><snm>Fontaine</snm><fnm>JF</fnm></au><au><snm>Priller</snm><fnm>F</fnm></au><au><snm>Barbosa-Silva</snm><fnm>A</fnm></au><au><snm>Andrade-Navarro</snm><fnm>MA</fnm></au></aug><source>Nucleic Acids Res</source><pubdate>2011</pubdate><volume>39</volume><issue>Web Server issue</issue><fpage>W455</fpage><lpage>461</lpage><xrefbib><pubidlist><pubid idtype="pmcid">3125729</pubid><pubid idtype="pmpid" link="fulltext">21609954</pubid></pubidlist></xrefbib></bibl><bibl id="B9"><title><p>PINTA: a web server for network-based gene prioritization from expression data</p></title><aug><au><snm>Nitsch</snm><fnm>D</fnm></au><au><snm>Tranchevent</snm><fnm>LC</fnm></au><au><snm>Goncalves</snm><fnm>JP</fnm></au><au><snm>Vogt</snm><fnm>JK</fnm></au><au><snm>Madeira</snm><fnm>SC</fnm></au><au><snm>Moreau</snm><fnm>Y</fnm></au></aug><source>Nucleic Acids Res</source><pubdate>2011</pubdate><volume>39</volume><issue>Web Server issue</issue><fpage>W334</fpage><lpage>338</lpage><xrefbib><pubidlist><pubid idtype="pmcid">3125740</pubid><pubid idtype="pmpid" link="fulltext">21602267</pubid></pubidlist></xrefbib></bibl><bibl id="B10"><title><p>ENDEAVOUR update: a web resource for gene prioritization in multiple species</p></title><aug><au><snm>Tranchevent</snm><fnm>LC</fnm></au><au><snm>Barriot</snm><fnm>R</fnm></au><au><snm>Yu</snm><fnm>S</fnm></au><au><snm>Van Vooren</snm><fnm>S</fnm></au><au><snm>Van Loo</snm><fnm>P</fnm></au><au><snm>Coessens</snm><fnm>B</fnm></au><au><snm>De Moor</snm><fnm>B</fnm></au><au><snm>Aerts</snm><fnm>S</fnm></au><au><snm>Moreau</snm><fnm>Y</fnm></au></aug><source>Nucleic Acids Res</source><pubdate>2008</pubdate><volume>36</volume><issue>Web Server issue</issue><fpage>W377</fpage><lpage>384</lpage><xrefbib><pubidlist><pubid idtype="pmcid">2447805</pubid><pubid idtype="pmpid" link="fulltext">18508807</pubid></pubidlist></xrefbib></bibl><bibl id="B11"><title><p>Recent approaches to the prioritization of candidate disease genes</p></title><aug><au><snm>Doncheva</snm><fnm>NT</fnm></au><au><snm>Kacprowski</snm><fnm>T</fnm></au><au><snm>Albrecht</snm><fnm>M</fnm></au></aug><source>Wiley Interdiscip Rev Syst Biol Med</source><pubdate>2012</pubdate><volume>4</volume><issue>5</issue><fpage>429</fpage><lpage>442</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1002/wsbm.1177</pubid><pubid idtype="pmpid" link="fulltext">22689539</pubid></pubidlist></xrefbib></bibl><bibl id="B12"><title><p>Cytoscape: a community-based framework for network modeling</p></title><aug><au><snm>Killcoyne</snm><fnm>S</fnm></au><au><snm>Carter</snm><fnm>GW</fnm></au><au><snm>Smith</snm><fnm>J</fnm></au><au><snm>Boyle</snm><fnm>J</fnm></au></aug><source>Methods Mol Biol</source><pubdate>2009</pubdate><volume>563</volume><fpage>219</fpage><lpage>239</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1007/978-1-60761-175-2_12</pubid><pubid idtype="pmpid" link="fulltext">19597788</pubid></pubidlist></xrefbib></bibl><bibl id="B13"><title><p>CABIN: collective analysis of biological interaction networks</p></title><aug><au><snm>Singhal</snm><fnm>M</fnm></au><au><snm>Domico</snm><fnm>K</fnm></au></aug><source>Comput Biol Chem</source><pubdate>2007</pubdate><volume>31</volume><issue>3</issue><fpage>222</fpage><lpage>225</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.compbiolchem.2007.03.006</pubid><pubid idtype="pmpid" link="fulltext">17500038</pubid></pubidlist></xrefbib></bibl><bibl id="B14"><title><p>GraphWeb: mining heterogeneous biological networks for gene modules with functional significance</p></title><aug><au><snm>Reimand</snm><fnm>J</fnm></au><au><snm>Tooming</snm><fnm>L</fnm></au><au><snm>Peterson</snm><fnm>H</fnm></au><au><snm>Adler</snm><fnm>P</fnm></au><au><snm>Vilo</snm><fnm>J</fnm></au></aug><source>Nucleic Acids Res</source><pubdate>2008</pubdate><volume>36</volume><issue>Web Server issue</issue><fpage>W452</fpage><lpage>459</lpage><xrefbib><pubidlist><pubid idtype="pmcid">2447774</pubid><pubid idtype="pmpid" link="fulltext">18460544</pubid></pubidlist></xrefbib></bibl><bibl id="B15"><title><p>GeneMANIA: a real-time multiple association network integration algorithm for predicting gene function</p></title><aug><au><snm>Mostafavi</snm><fnm>S</fnm></au><au><snm>Ray</snm><fnm>D</fnm></au><au><snm>Warde-Farley</snm><fnm>D</fnm></au><au><snm>Grouios</snm><fnm>C</fnm></au><au><snm>Morris</snm><fnm>Q</fnm></au></aug><source>Genome Biol</source><pubdate>2008</pubdate><volume>9</volume><issue>Suppl 1</issue><fpage>S4</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/gb-2008-9-s1-s4</pubid><pubid idtype="pmcid">2559988</pubid><pubid idtype="pmpid" link="fulltext">18834495</pubid></pubidlist></xrefbib></bibl><bibl id="B16"><title><p>Cytoscape: software for visualization and analysis of biological networks</p></title><aug><au><snm>Kohl</snm><fnm>M</fnm></au><au><snm>Wiese</snm><fnm>S</fnm></au><au><snm>Warscheid</snm><fnm>B</fnm></au></aug><source>Methods Mol Biol</source><pubdate>2011</pubdate><volume>696</volume><fpage>291</fpage><lpage>303</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1007/978-1-60761-987-1_18</pubid><pubid idtype="pmpid" link="fulltext">21063955</pubid></pubidlist></xrefbib></bibl><bibl id="B17"><title><p>Targeting of GFP to newborn rods by Nrl promoter and temporal expression profiling of flow-sorted photoreceptors</p></title><aug><au><snm>Akimoto</snm><fnm>M</fnm></au><au><snm>Cheng</snm><fnm>H</fnm></au><au><snm>Zhu</snm><fnm>D</fnm></au><au><snm>Brzezinski</snm><fnm>JA</fnm></au><au><snm>Khanna</snm><fnm>R</fnm></au><au><snm>Filippova</snm><fnm>E</fnm></au><au><snm>Oh</snm><fnm>EC</fnm></au><au><snm>Jing</snm><fnm>Y</fnm></au><au><snm>Linares</snm><fnm>JL</fnm></au><au><snm>Brooks</snm><fnm>M</fnm></au><etal/></aug><source>Proc Natl Acad Sci USA</source><pubdate>2006</pubdate><volume>103</volume><issue>10</issue><fpage>3890</fpage><lpage>3895</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1073/pnas.0508214103</pubid><pubid idtype="pmcid">1383502</pubid><pubid idtype="pmpid" link="fulltext">16505381</pubid></pubidlist></xrefbib></bibl><bibl id="B18"><title><p>Global gene expression analysis of the developing postnatal mouse retina</p></title><aug><au><snm>Dorrell</snm><fnm>MI</fnm></au><au><snm>Aguilar</snm><fnm>E</fnm></au><au><snm>Weber</snm><fnm>C</fnm></au><au><snm>Friedlander</snm><fnm>M</fnm></au></aug><source>Invest Ophthalmol Vis Sci</source><pubdate>2004</pubdate><volume>45</volume><issue>3</issue><fpage>1009</fpage><lpage>1019</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1167/iovs.03-0806</pubid><pubid idtype="pmpid" link="fulltext">14985324</pubid></pubidlist></xrefbib></bibl><bibl id="B19"><title><p>Gene expression profiles of mouse retinas during the second and third postnatal weeks</p></title><aug><au><snm>Liu</snm><fnm>J</fnm></au><au><snm>Wang</snm><fnm>J</fnm></au><au><snm>Huang</snm><fnm>Q</fnm></au><au><snm>Higdon</snm><fnm>J</fnm></au><au><snm>Magdaleno</snm><fnm>S</fnm></au><au><snm>Curran</snm><fnm>T</fnm></au><au><snm>Zuo</snm><fnm>J</fnm></au></aug><source>Brain Res</source><pubdate>2006</pubdate><volume>1098</volume><issue>1</issue><fpage>113</fpage><lpage>125</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.brainres.2006.04.086</pubid><pubid idtype="pmpid" link="fulltext">16777074</pubid></pubidlist></xrefbib></bibl><bibl id="B20"><title><p>Molecular heterogeneity of developing retinal ganglion and amacrine cells revealed through single cell gene expression profiling</p></title><aug><au><snm>Trimarchi</snm><fnm>JM</fnm></au><au><snm>Stadler</snm><fnm>MB</fnm></au><au><snm>Roska</snm><fnm>B</fnm></au><au><snm>Billings</snm><fnm>N</fnm></au><au><snm>Sun</snm><fnm>B</fnm></au><au><snm>Bartch</snm><fnm>B</fnm></au><au><snm>Cepko</snm><fnm>CL</fnm></au></aug><source>J Comp Neurol</source><pubdate>2007</pubdate><volume>502</volume><issue>6</issue><fpage>1047</fpage><lpage>1065</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1002/cne.21368</pubid><pubid idtype="pmpid" link="fulltext">17444492</pubid></pubidlist></xrefbib></bibl><bibl id="B21"><title><p>Gene expression profiling of vasoregression in the retina--involvement of microglial cells</p></title><aug><au><snm>Feng</snm><fnm>Y</fnm></au><au><snm>Wang</snm><fnm>Y</fnm></au><au><snm>Li</snm><fnm>L</fnm></au><au><snm>Wu</snm><fnm>L</fnm></au><au><snm>Hoffmann</snm><fnm>S</fnm></au><au><snm>Gretz</snm><fnm>N</fnm></au><au><snm>Hammes</snm><fnm>HP</fnm></au></aug><source>PLoS One</source><pubdate>2011</pubdate><volume>6</volume><issue>2</issue><fpage>e16865</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1371/journal.pone.0016865</pubid><pubid idtype="pmcid">3040753</pubid><pubid idtype="pmpid" link="fulltext">21379381</pubid></pubidlist></xrefbib></bibl><bibl id="B22"><title><p>Gene expression profiling of zebrafish embryonic retina</p></title><aug><au><snm>Leung</snm><fnm>YF</fnm></au><au><snm>Dowling</snm><fnm>JE</fnm></au></aug><source>Zebrafish</source><pubdate>2005</pubdate><volume>2</volume><issue>4</issue><fpage>269</fpage><lpage>283</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1089/zeb.2005.2.269</pubid><pubid idtype="pmpid" link="fulltext">18248185</pubid></pubidlist></xrefbib></bibl><bibl id="B23"><title><p>Global gene expression profiling of ischemic preconditioning in the rat retina</p></title><aug><au><snm>Kamphuis</snm><fnm>W</fnm></au><au><snm>Dijk</snm><fnm>F</fnm></au><au><snm>van Soest</snm><fnm>S</fnm></au><au><snm>Bergen</snm><fnm>AA</fnm></au></aug><source>Mol Vis</source><pubdate>2007</pubdate><volume>13</volume><fpage>1020</fpage><lpage>1030</lpage><xrefbib><pubidlist><pubid idtype="pmcid">2776543</pubid><pubid idtype="pmpid" link="fulltext">17653046</pubid></pubidlist></xrefbib></bibl><bibl id="B24"><title><p>The basic motif-leucine zipper transcription factor Nrl can positively regulate rhodopsin gene expression</p></title><aug><au><snm>Rehemtulla</snm><fnm>A</fnm></au><au><snm>Warwar</snm><fnm>R</fnm></au><au><snm>Kumar</snm><fnm>R</fnm></au><au><snm>Ji</snm><fnm>X</fnm></au><au><snm>Zack</snm><fnm>DJ</fnm></au><au><snm>Swaroop</snm><fnm>A</fnm></au></aug><source>Proc Natl Acad Sci USA</source><pubdate>1996</pubdate><volume>93</volume><issue>1</issue><fpage>191</fpage><lpage>195</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1073/pnas.93.1.191</pubid><pubid idtype="pmcid">40204</pubid><pubid idtype="pmpid" link="fulltext">8552602</pubid></pubidlist></xrefbib></bibl><bibl id="B25"><title><p>Nrl is required for rod photoreceptor development</p></title><aug><au><snm>Mears</snm><fnm>AJ</fnm></au><au><snm>Kondo</snm><fnm>M</fnm></au><au><snm>Swain</snm><fnm>PK</fnm></au><au><snm>Takada</snm><fnm>Y</fnm></au><au><snm>Bush</snm><fnm>RA</fnm></au><au><snm>Saunders</snm><fnm>TL</fnm></au><au><snm>Sieving</snm><fnm>PA</fnm></au><au><snm>Swaroop</snm><fnm>A</fnm></au></aug><source>Nat Genet</source><pubdate>2001</pubdate><volume>29</volume><issue>4</issue><fpage>447</fpage><lpage>452</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/ng774</pubid><pubid idtype="pmpid" link="fulltext">11694879</pubid></pubidlist></xrefbib></bibl><bibl id="B26"><title><p>In vivo function of the orphan nuclear receptor NR2E3 in establishing photoreceptor identity during mammalian retinal development</p></title><aug><au><snm>Cheng</snm><fnm>H</fnm></au><au><snm>Aleman</snm><fnm>TS</fnm></au><au><snm>Cideciyan</snm><fnm>AV</fnm></au><au><snm>Khanna</snm><fnm>R</fnm></au><au><snm>Jacobson</snm><fnm>SG</fnm></au><au><snm>Swaroop</snm><fnm>A</fnm></au></aug><source>Hum Mol Genet</source><pubdate>2006</pubdate><volume>15</volume><issue>17</issue><fpage>2588</fpage><lpage>2602</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/hmg/ddl185</pubid><pubid idtype="pmcid">1592580</pubid><pubid idtype="pmpid" link="fulltext">16868010</pubid></pubidlist></xrefbib></bibl><bibl id="B27"><title><p>Rod differentiation factor NRL activates the expression of nuclear receptor NR2E3 to suppress the development of cone photoreceptors</p></title><aug><au><snm>Oh</snm><fnm>EC</fnm></au><au><snm>Cheng</snm><fnm>H</fnm></au><au><snm>Hao</snm><fnm>H</fnm></au><au><snm>Jia</snm><fnm>L</fnm></au><au><snm>Khan</snm><fnm>NW</fnm></au><au><snm>Swaroop</snm><fnm>A</fnm></au></aug><source>Brain Res</source><pubdate>2008</pubdate><volume>1236</volume><fpage>16</fpage><lpage>29</lpage><xrefbib><pubidlist><pubid idtype="pmcid">2660138</pubid><pubid idtype="pmpid" link="fulltext">18294621</pubid></pubidlist></xrefbib></bibl><bibl id="B28"><title><p>Enhanced S cone syndrome: evidence for an abnormally large number of S cones</p></title><aug><au><snm>Hood</snm><fnm>DC</fnm></au><au><snm>Cideciyan</snm><fnm>AV</fnm></au><au><snm>Roman</snm><fnm>AJ</fnm></au><au><snm>Jacobson</snm><fnm>SG</fnm></au></aug><source>Vision Res</source><pubdate>1995</pubdate><volume>35</volume><issue>10</issue><fpage>1473</fpage><lpage>1481</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/0042-6989(95)98727-Q</pubid><pubid idtype="pmpid" link="fulltext">7645276</pubid></pubidlist></xrefbib></bibl><bibl id="B29"><title><p>Cone-like morphological, molecular, and electrophysiological features of the photoreceptors of the Nrl knockout mouse</p></title><aug><au><snm>Daniele</snm><fnm>LL</fnm></au><au><snm>Lillo</snm><fnm>C</fnm></au><au><snm>Lyubarsky</snm><fnm>AL</fnm></au><au><snm>Nikonov</snm><fnm>SS</fnm></au><au><snm>Philp</snm><fnm>N</fnm></au><au><snm>Mears</snm><fnm>AJ</fnm></au><au><snm>Swaroop</snm><fnm>A</fnm></au><au><snm>Williams</snm><fnm>DS</fnm></au><au><snm>Pugh</snm><fnm>EN</fnm><suf>Jr</suf></au></aug><source>Invest Ophthalmol Vis Sci</source><pubdate>2005</pubdate><volume>46</volume><issue>6</issue><fpage>2156</fpage><lpage>2167</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1167/iovs.04-1427</pubid><pubid idtype="pmpid" link="fulltext">15914637</pubid></pubidlist></xrefbib></bibl><bibl id="B30"><title><p>Mouse retinal development: a dark horse model for systems biology research</p></title><aug><au><snm>Zhang</snm><fnm>X</fnm></au><au><snm>Serb</snm><fnm>JM</fnm></au><au><snm>Greenlee</snm><fnm>MH</fnm></au></aug><source>Bioinform Biol Insights</source><pubdate>2011</pubdate><volume>5</volume><fpage>99</fpage><lpage>113</lpage><xrefbib><pubidlist><pubid idtype="pmcid">3118678</pubid><pubid idtype="pmpid" link="fulltext">21698072</pubid></pubidlist></xrefbib></bibl><bibl id="B31"><title><p>Disease sequence from mutant rhodopsin allele to rod and cone photoreceptor degeneration in man</p></title><aug><au><snm>Cideciyan</snm><fnm>AV</fnm></au><au><snm>Hood</snm><fnm>DC</fnm></au><au><snm>Huang</snm><fnm>Y</fnm></au><au><snm>Banin</snm><fnm>E</fnm></au><au><snm>Li</snm><fnm>ZY</fnm></au><au><snm>Stone</snm><fnm>EM</fnm></au><au><snm>Milam</snm><fnm>AH</fnm></au><au><snm>Jacobson</snm><fnm>SG</fnm></au></aug><source>Proc Natl Acad Sci USA</source><pubdate>1998</pubdate><volume>95</volume><issue>12</issue><fpage>7103</fpage><lpage>7108</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1073/pnas.95.12.7103</pubid><pubid idtype="pmcid">22754</pubid><pubid idtype="pmpid" link="fulltext">9618546</pubid></pubidlist></xrefbib></bibl><bibl id="B32"><title><p>Novel dominant rhodopsin mutation triggers two mechanisms of retinal degeneration and photoreceptor desensitization</p></title><aug><au><snm>Iakhine</snm><fnm>R</fnm></au><au><snm>Chorna-Ornan</snm><fnm>I</fnm></au><au><snm>Zars</snm><fnm>T</fnm></au><au><snm>Elia</snm><fnm>N</fnm></au><au><snm>Cheng</snm><fnm>Y</fnm></au><au><snm>Selinger</snm><fnm>Z</fnm></au><au><snm>Minke</snm><fnm>B</fnm></au><au><snm>Hyde</snm><fnm>DR</fnm></au></aug><source>J Neurosci</source><pubdate>2004</pubdate><volume>24</volume><issue>10</issue><fpage>2516</fpage><lpage>2526</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1523/JNEUROSCI.5426-03.2004</pubid><pubid idtype="pmpid" link="fulltext">15014127</pubid></pubidlist></xrefbib></bibl><bibl id="B33"><title><p>The relationship of photoreceptor degeneration to retinal vascular development and loss in mutant rhodopsin transgenic and RCS rats</p></title><aug><au><snm>Pennesi</snm><fnm>ME</fnm></au><au><snm>Nishikawa</snm><fnm>S</fnm></au><au><snm>Matthes</snm><fnm>MT</fnm></au><au><snm>Yasumura</snm><fnm>D</fnm></au><au><snm>LaVail</snm><fnm>MM</fnm></au></aug><source>Exp Eye Res</source><pubdate>2008</pubdate><volume>87</volume><issue>6</issue><fpage>561</fpage><lpage>570</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.exer.2008.09.004</pubid><pubid idtype="pmcid">3541029</pubid><pubid idtype="pmpid" link="fulltext">18848932</pubid></pubidlist></xrefbib></bibl><bibl id="B34"><title><p>The rod photoreceptor-specific nuclear receptor Nr2e3 represses transcription of multiple cone-specific genes</p></title><aug><au><snm>Chen</snm><fnm>J</fnm></au><au><snm>Rattner</snm><fnm>A</fnm></au><au><snm>Nathans</snm><fnm>J</fnm></au></aug><source>J Neurosci</source><pubdate>2005</pubdate><volume>25</volume><issue>1</issue><fpage>118</fpage><lpage>129</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1523/JNEUROSCI.3571-04.2005</pubid><pubid idtype="pmpid" link="fulltext">15634773</pubid></pubidlist></xrefbib></bibl><bibl id="B35"><title><p>The photoreceptor-specific nuclear receptor Nr2e3 interacts with Crx and exerts opposing effects on the transcription of rod versus cone genes</p></title><aug><au><snm>Peng</snm><fnm>GH</fnm></au><au><snm>Ahmad</snm><fnm>O</fnm></au><au><snm>Ahmad</snm><fnm>F</fnm></au><au><snm>Liu</snm><fnm>J</fnm></au><au><snm>Chen</snm><fnm>S</fnm></au></aug><source>Hum Mol Genet</source><pubdate>2005</pubdate><volume>14</volume><issue>6</issue><fpage>747</fpage><lpage>764</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/hmg/ddi070</pubid><pubid idtype="pmpid" link="fulltext">15689355</pubid></pubidlist></xrefbib></bibl><bibl id="B36"><title><p>A hybrid photoreceptor expressing both rod and cone genes in a mouse model of enhanced S-cone syndrome</p></title><aug><au><snm>Corbo</snm><fnm>JC</fnm></au><au><snm>Cepko</snm><fnm>CL</fnm></au></aug><source>PLoS Genet</source><pubdate>2005</pubdate><volume>1</volume><issue>2</issue><fpage>e11</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1371/journal.pgen.0010011</pubid><pubid idtype="pmcid">1186732</pubid><pubid idtype="pmpid" link="fulltext">16110338</pubid></pubidlist></xrefbib></bibl><bibl id="B37"><title><p>shRNA knockdown of guanylate cyclase 2e or cyclic nucleotide gated channel alpha 1 increases photoreceptor survival in a cGMP phosphodiesterase mouse model of retinitis pigmentosa</p></title><aug><au><snm>Tosi</snm><fnm>J</fnm></au><au><snm>Davis</snm><fnm>RJ</fnm></au><au><snm>Wang</snm><fnm>NK</fnm></au><au><snm>Naumann</snm><fnm>M</fnm></au><au><snm>Lin</snm><fnm>CS</fnm></au><au><snm>Tsang</snm><fnm>SH</fnm></au></aug><source>J Cell Mol Med</source><pubdate>2011</pubdate><volume>15</volume><issue>8</issue><fpage>1778</fpage><lpage>1787</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1111/j.1582-4934.2010.01201.x</pubid><pubid idtype="pmcid">3071858</pubid><pubid idtype="pmpid" link="fulltext">20950332</pubid></pubidlist></xrefbib></bibl><bibl id="B38"><title><p>Genotype-phenotype correlation of mouse pde6b mutations</p></title><aug><au><snm>Hart</snm><fnm>AW</fnm></au><au><snm>McKie</snm><fnm>L</fnm></au><au><snm>Morgan</snm><fnm>JE</fnm></au><au><snm>Gautier</snm><fnm>P</fnm></au><au><snm>West</snm><fnm>K</fnm></au><au><snm>Jackson</snm><fnm>IJ</fnm></au><au><snm>Cross</snm><fnm>SH</fnm></au></aug><source>Invest Ophthalmol Vis Sci</source><pubdate>2005</pubdate><volume>46</volume><issue>9</issue><fpage>3443</fpage><lpage>3450</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1167/iovs.05-0254</pubid><pubid idtype="pmpid" link="fulltext">16123450</pubid></pubidlist></xrefbib></bibl><bibl id="B39"><title><p>Evidence for retinal remodelling in retinitis pigmentosa caused by PDE6B mutation</p></title><aug><au><snm>Jacobson</snm><fnm>SG</fnm></au><au><snm>Sumaroka</snm><fnm>A</fnm></au><au><snm>Aleman</snm><fnm>TS</fnm></au><au><snm>Cideciyan</snm><fnm>AV</fnm></au><au><snm>Danciger</snm><fnm>M</fnm></au><au><snm>Farber</snm><fnm>DB</fnm></au></aug><source>Br J Ophthalmol</source><pubdate>2007</pubdate><volume>91</volume><issue>5</issue><fpage>699</fpage><lpage>701</lpage><xrefbib><pubidlist><pubid idtype="pmcid">1954784</pubid><pubid idtype="pmpid" link="fulltext">17446518</pubid></pubidlist></xrefbib></bibl><bibl id="B40"><title><p>Transgenic mice carrying the H258N mutation in the gene encoding the beta-subunit of phosphodiesterase-6 (PDE6B) provide a model for human congenital stationary night blindness</p></title><aug><au><snm>Tsang</snm><fnm>SH</fnm></au><au><snm>Woodruff</snm><fnm>ML</fnm></au><au><snm>Jun</snm><fnm>L</fnm></au><au><snm>Mahajan</snm><fnm>V</fnm></au><au><snm>Yamashita</snm><fnm>CK</fnm></au><au><snm>Pedersen</snm><fnm>R</fnm></au><au><snm>Lin</snm><fnm>CS</fnm></au><au><snm>Goff</snm><fnm>SP</fnm></au><au><snm>Rosenberg</snm><fnm>T</fnm></au><au><snm>Larsen</snm><fnm>M</fnm></au><etal/></aug><source>Hum Mutat</source><pubdate>2007</pubdate><volume>28</volume><issue>3</issue><fpage>243</fpage><lpage>254</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1002/humu.20425</pubid><pubid idtype="pmcid">2753261</pubid><pubid idtype="pmpid" link="fulltext">17044014</pubid></pubidlist></xrefbib></bibl><bibl id="B41"><title><p>A screen for mutations in the transducin gene GNB1 in patients with autosomal dominant retinitis pigmentosa</p></title><aug><au><snm>Mylvaganam</snm><fnm>GH</fnm></au><au><snm>McGee</snm><fnm>TL</fnm></au><au><snm>Berson</snm><fnm>EL</fnm></au><au><snm>Dryja</snm><fnm>TP</fnm></au></aug><source>Mol Vis</source><pubdate>2006</pubdate><volume>12</volume><fpage>1496</fpage><lpage>1498</lpage><xrefbib><pubid idtype="pmpid" link="fulltext">17167406</pubid></xrefbib></bibl><bibl id="B42"><title><p>Mutation in the gene GUCA1A, encoding guanylate cyclase-activating protein 1, causes cone, cone-rod, and macular dystrophy</p></title><aug><au><snm>Michaelides</snm><fnm>M</fnm></au><au><snm>Wilkie</snm><fnm>SE</fnm></au><au><snm>Jenkins</snm><fnm>S</fnm></au><au><snm>Holder</snm><fnm>GE</fnm></au><au><snm>Hunt</snm><fnm>DM</fnm></au><au><snm>Moore</snm><fnm>AT</fnm></au><au><snm>Webster</snm><fnm>AR</fnm></au></aug><source>Ophthalmology</source><pubdate>2005</pubdate><volume>112</volume><issue>8</issue><fpage>1442</fpage><lpage>1447</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.ophtha.2005.02.024</pubid><pubid idtype="pmpid" link="fulltext">15953638</pubid></pubidlist></xrefbib></bibl><bibl id="B43"><title><p>Dominant cone-rod dystrophy: a mouse model generated by gene targeting of the GCAP1/Guca1a gene</p></title><aug><au><snm>Buch</snm><fnm>PK</fnm></au><au><snm>Mihelec</snm><fnm>M</fnm></au><au><snm>Cottrill</snm><fnm>P</fnm></au><au><snm>Wilkie</snm><fnm>SE</fnm></au><au><snm>Pearson</snm><fnm>RA</fnm></au><au><snm>Duran</snm><fnm>Y</fnm></au><au><snm>West</snm><fnm>EL</fnm></au><au><snm>Michaelides</snm><fnm>M</fnm></au><au><snm>Ali</snm><fnm>RR</fnm></au><au><snm>Hunt</snm><fnm>DM</fnm></au></aug><source>PLoS One</source><pubdate>2011</pubdate><volume>6</volume><issue>3</issue><fpage>e18089</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1371/journal.pone.0018089</pubid><pubid idtype="pmcid">3065489</pubid><pubid idtype="pmpid" link="fulltext">21464903</pubid></pubidlist></xrefbib></bibl><bibl id="B44"><title><p>Mutations in the GUCA1A gene involved in hereditary cone dystrophies impair calcium-mediated regulation of guanylate cyclase</p></title><aug><au><snm>Kitiratschky</snm><fnm>VB</fnm></au><au><snm>Behnen</snm><fnm>P</fnm></au><au><snm>Kellner</snm><fnm>U</fnm></au><au><snm>Heckenlively</snm><fnm>JR</fnm></au><au><snm>Zrenner</snm><fnm>E</fnm></au><au><snm>Jagle</snm><fnm>H</fnm></au><au><snm>Kohl</snm><fnm>S</fnm></au><au><snm>Wissinger</snm><fnm>B</fnm></au><au><snm>Koch</snm><fnm>KW</fnm></au></aug><source>Hum Mutat</source><pubdate>2009</pubdate><volume>30</volume><issue>8</issue><fpage>E782</fpage><lpage>796</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1002/humu.21055</pubid><pubid idtype="pmpid" link="fulltext">19459154</pubid></pubidlist></xrefbib></bibl><bibl id="B45"><title><p>Autosomal dominant cone dystrophy caused by a novel mutation in the GCAP1 gene (GUCA1A)</p></title><aug><au><snm>Jiang</snm><fnm>L</fnm></au><au><snm>Katz</snm><fnm>BJ</fnm></au><au><snm>Yang</snm><fnm>Z</fnm></au><au><snm>Zhao</snm><fnm>Y</fnm></au><au><snm>Faulkner</snm><fnm>N</fnm></au><au><snm>Hu</snm><fnm>J</fnm></au><au><snm>Baird</snm><fnm>J</fnm></au><au><snm>Baehr</snm><fnm>W</fnm></au><au><snm>Creel</snm><fnm>DJ</fnm></au><au><snm>Zhang</snm><fnm>K</fnm></au></aug><source>Mol Vis</source><pubdate>2005</pubdate><volume>11</volume><fpage>143</fpage><lpage>151</lpage><xrefbib><pubid idtype="pmpid" link="fulltext">15735604</pubid></xrefbib></bibl><bibl id="B46"><title><p>Mutations in the gene encoding the alpha subunit of the rod cGMP-gated channel in autosomal recessive retinitis pigmentosa</p></title><aug><au><snm>Dryja</snm><fnm>TP</fnm></au><au><snm>Finn</snm><fnm>JT</fnm></au><au><snm>Peng</snm><fnm>YW</fnm></au><au><snm>McGee</snm><fnm>TL</fnm></au><au><snm>Berson</snm><fnm>EL</fnm></au><au><snm>Yau</snm><fnm>KW</fnm></au></aug><source>Proc Natl Acad Sci USA</source><pubdate>1995</pubdate><volume>92</volume><issue>22</issue><fpage>10177</fpage><lpage>10181</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1073/pnas.92.22.10177</pubid><pubid idtype="pmcid">40759</pubid><pubid idtype="pmpid" link="fulltext">7479749</pubid></pubidlist></xrefbib></bibl><bibl id="B47"><title><p>Whole genome assessment of the retinal response to diabetes reveals a progressive neurovascular inflammatory response</p></title><aug><au><snm>Brucklacher</snm><fnm>RM</fnm></au><au><snm>Patel</snm><fnm>KM</fnm></au><au><snm>VanGuilder</snm><fnm>HD</fnm></au><au><snm>Bixler</snm><fnm>GV</fnm></au><au><snm>Barber</snm><fnm>AJ</fnm></au><au><snm>Antonetti</snm><fnm>DA</fnm></au><au><snm>Lin</snm><fnm>CM</fnm></au><au><snm>LaNoue</snm><fnm>KF</fnm></au><au><snm>Gardner</snm><fnm>TW</fnm></au><au><snm>Bronson</snm><fnm>SK</fnm></au><etal/></aug><source>BMC Med Genomics</source><pubdate>2008</pubdate><volume>1</volume><fpage>26</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/1755-8794-1-26</pubid><pubid idtype="pmcid">2442612</pubid><pubid idtype="pmpid" link="fulltext">18554398</pubid></pubidlist></xrefbib></bibl></refgrp>
	</bm>
</art>