<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1471-2164-10-42</ui>
   <ji>1471-2164</ji>
   <fm>
		<dochead>Research article</dochead>
		<bibl>
			<title>
				<p>Positive correlation between gene coexpression and positional clustering in the zebrafish genome</p>
			</title>
			<aug>
				<au id="A1">
					<snm>Ng</snm>
					<fnm>Yen Kaow</fnm>
					<insr iid="I1"/>
					<email>matnyk@nus.edu.sg</email>
				</au>
				<au id="A2">
					<snm>Wu</snm>
					<fnm>Wei</fnm>
					<insr iid="I2"/>
					<email>wwu@imcb.a-star.edu.sg</email>
				</au>
				<au id="A3" ca="yes">
					<snm>Zhang</snm>
					<fnm>Louxin</fnm>
					<insr iid="I1"/>
					<email>matzlx@nus.edu.sg</email>
				</au>
			</aug>
			<insg>
				<ins id="I1">
					<p>Department of Mathematics, National University of Singapore, 2 Science Drive 2, Singapore 117543, Singapore</p>
				</ins>
				<ins id="I2">
					<p>Institute of Molecular and Cell Biology, Singapore 138673, Singapore</p>
				</ins>
			</insg>
			<source>BMC Genomics</source>
			<issn>1471-2164</issn>
			<pubdate>2009</pubdate>
			<volume>10</volume>
			<issue>1</issue>
			<fpage>42</fpage>
			<url>http://www.biomedcentral.com/1471-2164/10/42</url>
			<xrefbib>
				<pubidlist>
					<pubid idtype="pmpid">19159490</pubid>
					<pubid idtype="doi">10.1186/1471-2164-10-42</pubid>
				</pubidlist>
			</xrefbib>
		</bibl>
		<history>
			<rec>
				<date>
					<day>04</day>
					<month>7</month>
					<year>2008</year>
				</date>
			</rec>
			<acc>
				<date>
					<day>22</day>
					<month>1</month>
					<year>2009</year>
				</date>
			</acc>
			<pub>
				<date>
					<day>22</day>
					<month>1</month>
					<year>2009</year>
				</date>
			</pub>
		</history>
		<cpyrt>
			<year>2009</year>
			<collab>Ng et al; licensee BioMed Central Ltd.</collab>
			<note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
		</cpyrt>
		<abs>
			<sec>
				<st>
					<p>Abstract</p>
				</st>
				<sec>
					<st>
						<p>Background</p>
					</st>
					<p>Co-expressing genes tend to cluster in eukaryotic genomes. This paper analyzes correlation between the proximity of eukaryotic genes and their transcriptional expression pattern in the zebrafish (<it>Danio rerio</it>) genome using available microarray data and gene annotation.</p>
				</sec>
				<sec>
					<st>
						<p>Results</p>
					</st>
					<p>The analyses show that neighbouring genes are significantly coexpressed in the zebrafish genome, and the coexpression level is influenced by the intergenic distance and transcription orientation. This fact is further supported by examining the coexpression level of genes within positional clusters in the neighbourhood model. There is a positive correlation between gene coexpression and positional clustering in the zebrafish genome.</p>
				</sec>
				<sec>
					<st>
						<p>Conclusion</p>
					</st>
					<p>The study provides another piece of evidence for the hypothesis that coexpressed genes do cluster in the eukaryotic genomes.</p>
				</sec>
			</sec>
		</abs>
	</fm>
   <bdy>
		<sec>
			<st>
				<p>Background</p>
			</st>
			<p>In most eukaryotes, the transcription factor mechanism seems sufficient to ensure coregulation of genes, and hence co-localization of genes is not critical. Accordingly, there should be no selection pressure for coregulated genes to line up next to each other in an eukaryotic genome. However, genes are not randomly distributed in the genome as they were thought to be even after tandem genes are excluded <abbrgrp>
					<abbr bid="B1">1</abbr>
					<abbr bid="B2">2</abbr>
				</abbrgrp>. The coexpression of clustered genes has been reported in <it>Homo sapiens </it>
				<abbrgrp>
					<abbr bid="B3">3</abbr>
				</abbrgrp>, <it>Caenorhabditis elegans </it>
				<abbrgrp>
					<abbr bid="B4">4</abbr>
					<abbr bid="B5">5</abbr>
					<abbr bid="B6">6</abbr>
					<abbr bid="B7">7</abbr>
				</abbrgrp>, <it>Drosophila melanogaster </it>
				<abbrgrp>
					<abbr bid="B8">8</abbr>
					<abbr bid="B9">9</abbr>
					<abbr bid="B10">10</abbr>
					<abbr bid="B11">11</abbr>
					<abbr bid="B12">12</abbr>
				</abbrgrp>, <it>Saccharomyces cerevisiae </it>
				<abbrgrp>
					<abbr bid="B8">8</abbr>
					<abbr bid="B13">13</abbr>
					<abbr bid="B14">14</abbr>
				</abbrgrp>, and <it>Arabidopsis thaliana </it>
				<abbrgrp>
					<abbr bid="B15">15</abbr>
				</abbrgrp>. Moreover, positional clustering of genes that are highly expressed in a specific tissue or a pathway has also been revealed in different genomes <abbrgrp>
					<abbr bid="B16">16</abbr>
					<abbr bid="B17">17</abbr>
					<abbr bid="B18">18</abbr>
					<abbr bid="B19">19</abbr>
				</abbrgrp>.</p>
			<p>These mentioned studies on the coexpression of proximate genes and positional clustering of coexpressed genes are based on expression data obtained from biotechnologies such as SAGE data, DNA microarray, together with gene annotations. There are several reasons for proximate genes to be coexpressed to a certain degree. There are operons in <it>C. elegans </it>
				<abbrgrp>
					<abbr bid="B4">4</abbr>
					<abbr bid="B20">20</abbr>
				</abbrgrp>. Adjacent gene pairs can share <it>cis</it>-regulatory elements <abbrgrp>
					<abbr bid="B21">21</abbr>
				</abbrgrp>. There could be some selection force that keeps coregulated genes in the same region, for example, to make transcription more efficient as a group <abbrgrp>
					<abbr bid="B10">10</abbr>
				</abbrgrp>.</p>
			<p>Here we present a genome-wide analysis of clustering of coexpressed genes in the zebrafish genome using available microarray data. As a representative of the bony fishes, the zebrafish has become a well-established model organism in a variety of studies in developmental biology and drug discovery. It has made important contributions to the identification of genes involved in development, behaviour and disease. The zebrafish genome is about 1.9 billion base pairs long and contains approximately 20,000 to 30,000 genes on 25 chromosomes. We first used a method proposed in William and Bowles <abbrgrp>
					<abbr bid="B15">15</abbr>
				</abbrgrp> to examine the degree of coexpression among proximity genes. We investigated the effect of intergenic distance and transcription orientation on the level of coexpression of neighboring genes. To further investigate the coexpression of proximate genes, we investigated the level of coexpression of genes in positional clusters identified in the neighbourhood model. Our bioinformatics analyses suggest that a positive correlation exists between the significance of positional gene clusters with the degree of coexpression of genes in the clusters.</p>
		</sec>
		<sec>
			<st>
				<p>Results</p>
			</st>
			<sec>
				<st>
					<p>Proximate genes are coexpressed in the zebrafish genome</p>
				</st>
				<p>In order to study the coexpression of proximate genes, we analyzed 100 expression datasets derived from Affymetrix microarray experiments. We use the Pearson correlation coefficient (<it>R</it>) of two genes to measure the level of their coexpression. The mean <it>R </it>of all the neighbouring gene pairs in our dataset is 0.07468 (with standard error 0.00424). This mean value is statistically significant (with <it>p</it>-value 0.0001) as it is +11.4 standard deviations from the mean <it>R </it>in a randomized genome. In a randomized genome with the same genes and expression values, the mean <it>R </it>is only 0.03086 (with standard deviation 0.00384) (Figure <figr fid="F1">1A</figr>). Tandem duplicated genes have identical functions and hence are often highly coexpressed. To eliminate the effects of tandem duplicates on this coexpression study, we removed all members except one in each tandem gene cluster and redid the analysis. After removal of tandem duplicates, the mean <it>R </it>became 0.06844 (with standard error 0.00426). It is slightly smaller than the value when all genes are included in the analysis, but still significant (with <it>p</it>-value 0.0001, +9.7 standard deviations from the random mean) (Figure <figr fid="F1">1B</figr>).</p>
				<fig id="F1">
					<title>
						<p>Figure 1</p>
					</title>
					<caption>
						<p>Distribution of 10,000 mean <it>R </it>values calculated from randomized genome</p>
					</caption>
					<text>
						<p>
							<b>Distribution of 10,000 mean <it>R </it>values calculated from randomized genome</b>. Each plot shows the distribution of 10,000 mean <it>R </it>values. Each mean <it>R </it>value is calculated by first randomly permuting the gene order of the genome, and then averaging the <it>R </it>values for every pair of neighboring genes in the resulting gene order. The mean R value in the real genome is shown as a single line on each plot. Both plots are based on the same gene expression dataset: (A) the results on the original dataset (average of mean <it>R </it>= 0.03086, &#963; = 0.00384); (B) the results after tandem duplicates are removed (average of mean <it>R </it>= 0.03071, &#963; = 0.00389).</p>
					</text>
					<graphic file="1471-2164-10-42-1"/>
				</fig>
				<p>Surprisingly, the mean <it>R </it>is almost identical for zebrafish and <it>Arabidopsis </it>genomes <abbrgrp>
						<abbr bid="B15">15</abbr>
					</abbrgrp> when the gene order is randomized, which is about 0.03 with standard deviation 0.004). The underlying cause behind this is unclear. The positive mean value rather than zero could be an effect of the coexpression of the housekeeping genes as suggested in Williams and Bowles <abbrgrp>
						<abbr bid="B15">15</abbr>
					</abbrgrp>. The reason for the identical mean <it>R </it>when these two genomes are randomized is probably either that the housekeeping genes show common patterns of expression in different genomes, dominating the mean <it>R </it>value, or alternatively, that there is a constant bias or weak autocorrection between all genes in each microarray dataset (see Additional file <supplr sid="S1">1</supplr> for detailed discussion and also <abbrgrp>
						<abbr bid="B11">11</abbr>
					</abbrgrp>).</p>
				<suppl id="S1">
					<title>
						<p>Additional file 1</p>
					</title>
					<text>
						<p>
							<b>A</b>
							<b>nalysis of non-zero positive mean <it>R</it> value in randomized genome and other discussions.</b> It contains the analysis of mean <it>R</it> in randomized genome and other studies relevant to, but not written in this manuscript.</p>
					</text>
					<file name="1471-2164-10-42-S1.pdf">
						<p>Click here for file</p>
					</file>
				</suppl>
				<p>To further explore the coexpression of proximate genes, we partitioned the genes into non-overlapping blocks of <it>k </it>(3 &#8804; <it>k </it>&#8804; 20) physically adjacent genes according to their start position. For each gene block, we calculated a mean <it>R </it>of the coexpression values; then the mean of all the mean <it>R</it>s is calculated and plotted in Figure <figr fid="F2">2</figr>. The degree of coexpression first decreases and then becomes stable when the block size <it>k </it>increases from 3 to 20. To verify the significance of the coexpression degree of the genes for each block size, we compared them to what would have been obtained if the genes had been rearranged in each of the following three ways: (1) randomly permuting the gene order over the entire genome; (2) randomly permuting the order of genes within each chromosome; and (3) randomly permuting the order of non-overlapping blocks of 3 consecutive genes. The last rearrangement is used to examine whether the coexpression degree in the larger blocks are dominated mainly by genes that are separated by only one or two genes. The analyses show that there is a significant difference in degree of coexpression between actual and randomized genome (Figure <figr fid="F2">2</figr>). Finally, we remark that the start point for partitioning the genes into non-overlapping blocks has little effect on the analysis presented above because of the way we calculate the mean coexpression value of genes within a block.</p>
				<fig id="F2">
					<title>
						<p>Figure 2</p>
					</title>
					<caption>
						<p>Mean of pair-wise <it>R </it>values in blocks of size 3 to 20 (&#9650;), shown with standard error</p>
					</caption>
					<text>
						<p>
							<b>Mean of pair-wise <it>R </it>values in blocks of size 3 to 20 (&#9650;), shown with standard error</b>. This is compared to the mean of 100 values obtained similarly, each from the same analysis after a random permutation of: (1) the gene order of the entire genome (&#9651;); (2) the order of genes in each chromosome (&#9633;); (3) the order of non-overlapping blocks of 3 consecutive genes (&#9632;). Plots (&#9651;), (&#9633;) and (&#9632;) are shown with standard deviations. The points in (<b>A</b>) are from analyses with the full dataset, while (<b>B</b>) are from analyses after tandem duplicates are removed.</p>
					</text>
					<graphic file="1471-2164-10-42-2"/>
				</fig>
			</sec>
			<sec>
				<st>
					<p>Coexpression of genes within GO classes</p>
				</st>
				<p>Genes are classified into different classes according to the biological processes they are involved in the GO database <abbrgrp>
						<abbr bid="B22">22</abbr>
					</abbrgrp>. We evaluated the mean <it>R </it>of the coexpression value of a pair of genes within a GO class that contains 20 or less genes. There are 853 such GO classes. The mean <it>R </it>is higher than 0.13, 0.30 and 0.50 in 50%, 25% and 10% of these classes, respectively (see Additional file <supplr sid="S2">2</supplr>). Thus, genes within a GO class are highly coexpressed as reported in other genomes.</p>
				<suppl id="S2">
					<title>
						<p>Additional file 2</p>
					</title>
					<text>
						<p>
							<b>Coexpression analysis of genes with GO classes.</b> It contains spreadsheet data from analysis of coexpression of genes within GO classes.</p>
					</text>
					<file name="1471-2164-10-42-S2.xls">
						<p>Click here for file</p>
					</file>
				</suppl>
			</sec>
			<sec>
				<st>
					<p>Distance and coexpression</p>
				</st>
				<p>In this analysis, the genes within 50 kilo-base pairs (kbp) from each other are collected, mean <it>R </it>values calculated, and the distance between them rounded to the nearest number in 0, 2000, 4000..., 50000. Figure <figr fid="F3">3</figr> plots the mean of these <it>R </it>values against their intergenic distance. There is a clear negative correlation between intergenic distance and degree of coexpression in both the full dataset (regression line <it>R</it>
					<sup>2 </sup>= 0.50) or after removal of tandem duplicates (regression line <it>R</it>
					<sup>2 </sup>= 0.46). The correlation is most significant for the gene pairs of distances between 10 kbp to 40 kbp (<it>R</it>
					<sup>2 </sup>= 0.79, <it>p </it>&lt; 0.005 for the full dataset, <it>R</it>
					<sup>2 </sup>= 0.83, <it>p </it>&lt; 0.005 for the dataset without tandem duplicates). Outside of this range, this correlation seems to be absent.</p>
				<fig id="F3">
					<title>
						<p>Figure 3</p>
					</title>
					<caption>
						<p>Mean <it>R </it>of gene pairs of up to 50 kbp apart</p>
					</caption>
					<text>
						<p>
							<b>Mean <it>R </it>of gene pairs of up to 50 kbp apart</b>. Gene pairs of up to 50 kbp apart were binned according to their intergenic distance, shown with regression lines. (A) is from the full dataset, whereas (B) from the resulting dataset after tandem duplicates are removed.</p>
					</text>
					<graphic file="1471-2164-10-42-3"/>
				</fig>
			</sec>
			<sec>
				<st>
					<p>Gene orientation and coexpression</p>
				</st>
				<p>Genes can be transcripted in two directions, denoted by (&#8594;) and (&#8592;). Thus, two genes can have: divergent transcription (&#8592; &#8594;), convergent transcription (&#8594; &#8592;), or parallel transcription (&#8592; &#8592; or &#8594; &#8594;) orientation. We partitioned all the gene pairs into three groups according to transcription orientation. For each orientation class of gene pairs, we calculated the mean <it>R </it>value. Regardless of whether tandem duplicates are removed or not, gene pairs with parallel orientation showed the highest degree of coexpression, in both average as well as median values, while gene pairs with convergent orientation showed the lowest degree of coexpression (Table <tblr tid="T1">1</tblr>). Kruskal-Wallis tests confirmed this effect of orientation (<it>p </it>= 0.0267) for the entire dataset, but the effect is insignificant after the removal of tandem duplicates (<it>p </it>= 0.2894). This is presumably because most tandem gene pairs have parallel orientation.</p>
				<tbl id="T1">
					<title>
						<p>Table 1</p>
					</title>
					<caption>
						<p>Descriptive statistics for pair-wise comparison of neighboring genes according to orientation of transcription</p>
					</caption>
					<tblbdy cols="7">
						<r>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c cspan="2" ca="center">
								<p>
									<b>Pearson's correlation coefficient (R)</b>
								</p>
							</c>
							<c cspan="2" ca="center">
								<p>
									<b>Intergenic distance (bp)</b>
								</p>
							</c>
						</r>
						<r>
							<c cspan="7">
								<hr/>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>
									<b>Orientation</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>N</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>Mean R &#177; se</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>Median R</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>Mean bp &#177; se</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>Median bp</b>
								</p>
							</c>
						</r>
						<r>
							<c cspan="7">
								<hr/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>
									<b>Analyzed with full dataset</b>
								</p>
							</c>
							<c ca="center">
								<p>&#8592; &#8594;</p>
							</c>
							<c ca="center">
								<p>1681</p>
							</c>
							<c ca="center">
								<p>0.0637 &#177; 0.0084</p>
							</c>
							<c ca="center">
								<p>0.0238</p>
							</c>
							<c ca="center">
								<p>215221.3 &#177; 7887.2</p>
							</c>
							<c ca="center">
								<p>88957</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c cspan="6">
								<hr/>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>&#8594; &#8594;/&#8592; &#8592;</p>
							</c>
							<c ca="center">
								<p>3418</p>
							</c>
							<c ca="center">
								<p>0.0877 &#177; 0.0061</p>
							</c>
							<c ca="center">
								<p>0.0547</p>
							</c>
							<c ca="center">
								<p>201669.6 &#177; 5611.0</p>
							</c>
							<c ca="center">
								<p>75199</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c cspan="6">
								<hr/>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>&#8594; &#8592;</p>
							</c>
							<c ca="center">
								<p>1678</p>
							</c>
							<c ca="center">
								<p>0.0592 &#177; 0.0082</p>
							</c>
							<c ca="center">
								<p>0.0238</p>
							</c>
							<c ca="center">
								<p>207196.8 &#177; 8385.1</p>
							</c>
							<c ca="center">
								<p>75805</p>
							</c>
						</r>
						<r>
							<c cspan="7">
								<hr/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>
									<b>Analyzed w/o tandem duplicates</b>
								</p>
							</c>
							<c ca="center">
								<p>&#8592; &#8594;</p>
							</c>
							<c ca="center">
								<p>1635</p>
							</c>
							<c ca="center">
								<p>0.0618 &#177; 0.0086</p>
							</c>
							<c ca="center">
								<p>0.0220</p>
							</c>
							<c ca="center">
								<p>219869.4 &#177; 8382.6</p>
							</c>
							<c ca="center">
								<p>91546</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c cspan="6">
								<hr/>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>&#8594; &#8594;/&#8592; &#8592;</p>
							</c>
							<c ca="center">
								<p>3295</p>
							</c>
							<c ca="center">
								<p>0.0762 &#177; 0.0061</p>
							</c>
							<c ca="center">
								<p>0.0438</p>
							</c>
							<c ca="center">
								<p>209041.5 &#177; 5714.5</p>
							</c>
							<c ca="center">
								<p>82818</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c cspan="6">
								<hr/>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>&#8594; &#8592;</p>
							</c>
							<c ca="center">
								<p>1632</p>
							</c>
							<c ca="center">
								<p>0.0594 &#177; 0.0083</p>
							</c>
							<c ca="center">
								<p>0.0250</p>
							</c>
							<c ca="center">
								<p>216217.3 &#177; 8729.1</p>
							</c>
							<c ca="center">
								<p>80961</p>
							</c>
						</r>
					</tblbdy>
				</tbl>
			</sec>
			<sec>
				<st>
					<p>Positional clustering and the level of coexpression</p>
				</st>
				<p>We have seen from the analyses that a correlation exists between intergenic distance and degree of coexpression. To further the study in this direction, we examined coexpression among the genes in a positional cluster. We adopted the neighbourhood model to identify positional gene clusters and evaluated positional clustering using a method proposed by Li, Lee and Zhang <abbrgrp>
						<abbr bid="B19">19</abbr>
					</abbrgrp>. In the neighbourhood model, two genes are in a cluster if and only if there is a series of genes between them such that the distance between two adjacent genes in the series is less than a specified distance (<it>D</it>).</p>
				<p>The significance of a positional cluster depends on the value of <it>D</it>, the number of genes it contains, and the gene density of its vicinity. In this study, we set <it>D </it>to be 25K, which is one-eighth of the average distance between genes (206K for the full dataset, 213K after removal of tandem duplicates. c.f. Table <tblr tid="T1">1</tblr>). These clusters are described in Additional file <supplr sid="S3">3</supplr>.</p>
				<suppl id="S3">
					<title>
						<p>Additional file 3</p>
					</title>
					<text>
						<p>
							<b>Positional gene clusters together with their <it>p</it>-values and co expression mean <it>R</it>.</b> It contains the analysis of positional gene clusters for <it>D</it> = 25K with/without tandem duplicated genes.</p>
					</text>
					<file name="1471-2164-10-42-S3.xls">
						<p>Click here for file</p>
					</file>
				</suppl>
				<p>Neighbouring genes within the clusters we identified tend to show a higher degree of coexpression than neighbouring pairs that are not. This observation is consistent with the observation that is mentioned above: a pair of genes within shorter intergenic distance tends to be more coexpressed. Table <tblr tid="T2">2</tblr> shows that with only one exception, the mean <it>R </it>values for clusters of all sizes are higher than 0.07468, the value of mean <it>R </it>of all neighbouring gene pairs in the whole dataset.</p>
				<tbl id="T2">
					<title>
						<p>Table 2</p>
					</title>
					<caption>
						<p>Mean of <it>R </it>values for all neighboring gene pairs found within some cluster of size <it>d </it>(<it>d </it>= 2, 3 ..., 7, > 7).</p>
					</caption>
					<tblbdy cols="5">
						<r>
							<c>
								<p/>
							</c>
							<c cspan="2" ca="center">
								<p>
									<b>Number of gene pairs</b>
								</p>
							</c>
							<c cspan="2" ca="center">
								<p>
									<b>Mean R</b>
								</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c cspan="2">
								<hr/>
							</c>
							<c cspan="2">
								<hr/>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>
									<b>Cluster size (<it>d</it>)</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>Analyzed with full dataset</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>Analyzed w/o tandem duplicates</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>Analyzed with full dataset</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>Analyzed w/o tandem duplicates</b>
								</p>
							</c>
						</r>
						<r>
							<c cspan="5">
								<hr/>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>2</p>
							</c>
							<c ca="center">
								<p>877</p>
							</c>
							<c ca="center">
								<p>867</p>
							</c>
							<c ca="center">
								<p>0.1051 &#177; 0.0122</p>
							</c>
							<c ca="center">
								<p>0.0930 &#177; 0.0121</p>
							</c>
						</r>
						<r>
							<c cspan="1">
								<hr/>
							</c>
							<c cspan="2">
								<hr/>
							</c>
							<c cspan="2">
								<hr/>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>3</p>
							</c>
							<c ca="center">
								<p>560</p>
							</c>
							<c ca="center">
								<p>520</p>
							</c>
							<c ca="center">
								<p>0.0952 &#177; 0.0152</p>
							</c>
							<c ca="center">
								<p>0.0799 &#177; 0.0157</p>
							</c>
						</r>
						<r>
							<c cspan="1">
								<hr/>
							</c>
							<c cspan="2">
								<hr/>
							</c>
							<c cspan="2">
								<hr/>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>4</p>
							</c>
							<c ca="center">
								<p>315</p>
							</c>
							<c ca="center">
								<p>282</p>
							</c>
							<c ca="center">
								<p>0.1237 &#177; 0.0202</p>
							</c>
							<c ca="center">
								<p>0.1148 &#177; 0.0207</p>
							</c>
						</r>
						<r>
							<c cspan="1">
								<hr/>
							</c>
							<c cspan="2">
								<hr/>
							</c>
							<c cspan="2">
								<hr/>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>5</p>
							</c>
							<c ca="center">
								<p>140</p>
							</c>
							<c ca="center">
								<p>148</p>
							</c>
							<c ca="center">
								<p>0.0739 &#177; 0.0303</p>
							</c>
							<c ca="center">
								<p>0.0870 &#177; 0.0306</p>
							</c>
						</r>
						<r>
							<c cspan="1">
								<hr/>
							</c>
							<c cspan="2">
								<hr/>
							</c>
							<c cspan="2">
								<hr/>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>6</p>
							</c>
							<c ca="center">
								<p>125</p>
							</c>
							<c ca="center">
								<p>80</p>
							</c>
							<c ca="center">
								<p>0.1302 &#177; 0.0328</p>
							</c>
							<c ca="center">
								<p>0.1330 &#177; 0.0385</p>
							</c>
						</r>
						<r>
							<c cspan="1">
								<hr/>
							</c>
							<c cspan="2">
								<hr/>
							</c>
							<c cspan="2">
								<hr/>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>7</p>
							</c>
							<c ca="center">
								<p>60</p>
							</c>
							<c ca="center">
								<p>48</p>
							</c>
							<c ca="center">
								<p>0.2360 &#177; 0.0545</p>
							</c>
							<c ca="center">
								<p>0.2505 &#177; 0.0585</p>
							</c>
						</r>
						<r>
							<c cspan="1">
								<hr/>
							</c>
							<c cspan="2">
								<hr/>
							</c>
							<c cspan="2">
								<hr/>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>> 7</p>
							</c>
							<c ca="center">
								<p>88</p>
							</c>
							<c ca="center">
								<p>62</p>
							</c>
							<c ca="center">
								<p>0.3427 &#177; 0.0390</p>
							</c>
							<c ca="center">
								<p>0.2456 &#177; 0.0492</p>
							</c>
						</r>
					</tblbdy>
				</tbl>
				<p>Among the identified positional clusters, there are ten highly significant clusters containing eight or more genes, listed in Table <tblr tid="T3">3</tblr>. They include hox gene clusters <it>hoxba </it>on <it>Chr3 </it>and hoxca on <it>Chr23 </it>
					<abbrgrp>
						<abbr bid="B23">23</abbr>
					</abbrgrp>, olfactory receptor gene cluster <it>or1 </it>on <it>Chr15</it>, <it>cytochrome P450 aromatase </it>gene cluster <it>cyp2j </it>on <it>Chr20</it>, and major histocompatibility complex (<it>Mhc</it>) gene cluster on <it>Chr19 </it>
					<abbrgrp>
						<abbr bid="B24">24</abbr>
						<abbr bid="B25">25</abbr>
					</abbrgrp>. The other five clusters contain a mix of genes from different GO classes. Although these genes have not yet been investigated, they are likely structurally and functionally unrelated. In each of the gene clusters except for three eight-gene clusters on <it>Chr4</it>, <it>Chr19 </it>and <it>Chr20</it>, genes have high level of coexpression.</p>
				<tbl id="T3">
					<title>
						<p>Table 3</p>
					</title>
					<caption>
						<p>The positional clusters which contain at least 8 genes. The <it>xxx </it>stands for genes with unknown functions.</p>
					</caption>
					<tblbdy cols="6">
						<r>
							<c ca="center">
								<p>
									<b>
										<it>Chr</it>
									</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>size</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>span</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>
										<it>p</it>-value</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>mean <it>R</it>
									</b>
								</p>
							</c>
							<c ca="left">
								<p>
									<b>genes listed in order</b>
								</p>
							</c>
						</r>
						<r>
							<c cspan="6">
								<hr/>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>3</p>
							</c>
							<c ca="center">
								<p>10</p>
							</c>
							<c ca="center">
								<p>100K</p>
							</c>
							<c ca="center">
								<p>1.24e-6</p>
							</c>
							<c ca="center">
								<p>0.289</p>
							</c>
							<c ca="left">
								<p>
									<it>hoxb1a, hoxb2a, hoxb3a, hoxb4a, hoxb5a, hoxb6a, hoxb7a, hoxb8a, hoxb9a, hoxb10a</it>
								</p>
							</c>
						</r>
						<r>
							<c cspan="6">
								<hr/>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>4</p>
							</c>
							<c ca="center">
								<p>8</p>
							</c>
							<c ca="center">
								<p>197K</p>
							</c>
							<c ca="center">
								<p>9.19e-7</p>
							</c>
							<c ca="center">
								<p>0.031</p>
							</c>
							<c ca="left">
								<p>
									<it>zgc:86611, psmc2, psmc2, smo, si:dkey-180p18.2, si:dkey-180p18.2, ube2h, nrf1</it>
								</p>
							</c>
						</r>
						<r>
							<c cspan="6">
								<hr/>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>5</p>
							</c>
							<c ca="center">
								<p>8</p>
							</c>
							<c ca="center">
								<p>89k</p>
							</c>
							<c ca="center">
								<p>1.36e-5</p>
							</c>
							<c ca="center">
								<p>0.213</p>
							</c>
							<c ca="left">
								<p>
									<it>ptges, usp20, zgc:103692, surf5, rpl7a, surf1, surf6l, ccbl1</it>
								</p>
							</c>
						</r>
						<r>
							<c cspan="6">
								<hr/>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>13</p>
							</c>
							<c ca="center">
								<p>8</p>
							</c>
							<c ca="center">
								<p>139K</p>
							</c>
							<c ca="center">
								<p>6.39e-5</p>
							</c>
							<c ca="center">
								<p>0.148</p>
							</c>
							<c ca="left">
								<p>
									<it>xxx, golga5, rtf1, zgc:113197, zgc:113197, tmem39b, xxx, zgc:123267</it>
								</p>
							</c>
						</r>
						<r>
							<c cspan="6">
								<hr/>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>15</p>
							</c>
							<c ca="center">
								<p>17</p>
							</c>
							<c ca="center">
								<p>174K</p>
							</c>
							<c ca="center">
								<p>2.69e-7</p>
							</c>
							<c ca="center">
								<p>0.371</p>
							</c>
							<c ca="left">
								<p>
									<it>xxx, or7.1, or2.6, or2.4, xxx, or2.7, xxx, or2.5, or2.1, or2.10, or2.8, xxx, or13.4, or5.1, or5.3, or5.4, or5.2</it>
								</p>
							</c>
						</r>
						<r>
							<c cspan="6">
								<hr/>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>19</p>
							</c>
							<c ca="center">
								<p>13</p>
							</c>
							<c ca="center">
								<p>189K</p>
							</c>
							<c ca="center">
								<p>4.12e-7</p>
							</c>
							<c ca="center">
								<p>0.229</p>
							</c>
							<c ca="left">
								<p>
									<it>Kifc1, zbtb22, daxx, tpsn, xxx, xxx, xxx, psmb10, psmb11, psmb9a, xxx, brd2, fabgl</it>
								</p>
							</c>
						</r>
						<r>
							<c cspan="6">
								<hr/>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>8</p>
							</c>
							<c ca="center">
								<p>159K</p>
							</c>
							<c ca="center">
								<p>3.04e-5</p>
							</c>
							<c ca="center">
								<p>-0.026</p>
							</c>
							<c ca="left">
								<p>
									<it>stk3, zgc:92739, rpl30, laptm4b, lyricl, rrm2b, azin1, atp6v1c1l</it>
								</p>
							</c>
						</r>
						<r>
							<c cspan="6">
								<hr/>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>20</p>
							</c>
							<c ca="center">
								<p>8</p>
							</c>
							<c ca="center">
								<p>168K</p>
							</c>
							<c ca="center">
								<p>1.77e-6</p>
							</c>
							<c ca="center">
								<p>-0.075</p>
							</c>
							<c ca="left">
								<p>
									<it>zgc:55404, si:dkeyp-55f12.3, itpk1, chga, si:dkey-177p2.3, ahsa1, si:dkey-177p2.6, zgc:92217</it>
								</p>
							</c>
						</r>
						<r>
							<c cspan="6">
								<hr/>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>8</p>
							</c>
							<c ca="center">
								<p>110K</p>
							</c>
							<c ca="center">
								<p>2.61e-4</p>
							</c>
							<c ca="center">
								<p>0.259</p>
							</c>
							<c ca="left">
								<p>
									<it>xxx, xxx, paics, cyp2j21, cyp2j22, cyp2j25, cyp2j26, cyp2j28</it>
								</p>
							</c>
						</r>
						<r>
							<c cspan="6">
								<hr/>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>23</p>
							</c>
							<c ca="center">
								<p>10</p>
							</c>
							<c ca="center">
								<p>96K</p>
							</c>
							<c ca="center">
								<p>2.12e-5</p>
							</c>
							<c ca="center">
								<p>0.606</p>
							</c>
							<c ca="left">
								<p>
									<it>hoxc13a, xxx, hoxc11a, hoxc10a, hoxc9a, hoxc8a, hoxc6a, hoxc5a, hoxc4a, hoxc3a</it>
								</p>
							</c>
						</r>
					</tblbdy>
				</tbl>
				<p>It is proposed that evolutionary selection organizes genes according to their biological function so that their expression can be co-ordinately regulated. To test this hypothesis, we use the GO database <abbrgrp>
						<abbr bid="B22">22</abbr>
					</abbrgrp> as a source of annotations of biological processes. The number of clusters formed by genes in some GO class is listed in Table <tblr tid="T4">4</tblr> (see Additional file <supplr sid="S4">4</supplr> for details). The number is very much reduced when compared to the total number of clusters for each size. Thus, most positional gene clusters we observe are likely not composed of genes with similar biological functions. As suggested by Spellman and Rubin <abbrgrp>
						<abbr bid="B10">10</abbr>
					</abbrgrp>, the above hypothesis is not supported.</p>
				<suppl id="S4">
					<title>
						<p>Additional file 4</p>
					</title>
					<text>
						<p>
							<b>Positional clusters formed by genes with GO classes.</b> It contains spreadsheet data from analysis of positional clusters that are composed of genes within GO classes.</p>
					</text>
					<file name="1471-2164-10-42-S4.xls">
						<p>Click here for file</p>
					</file>
				</suppl>
				<tbl id="T4">
					<title>
						<p>Table 4</p>
					</title>
					<caption>
						<p>Number of positional gene clusters found with intergenic distance <it>D </it>= <it>25K</it>.</p>
					</caption>
					<tblbdy cols="9">
						<r>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c cspan="7" ca="center">
								<p>
									<b>Size of gene clusters</b>
								</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c cspan="7">
								<hr/>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>
									<b>2</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>3</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>4</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>5</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>6</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>7</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>> = 8</b>
								</p>
							</c>
						</r>
						<r>
							<c cspan="9">
								<hr/>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>
									<b>Complete data</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>Number of clusters formed by genes in some GO class</b>
								</p>
							</c>
							<c ca="center">
								<p>328</p>
							</c>
							<c ca="center">
								<p>48</p>
							</c>
							<c ca="center">
								<p>12</p>
							</c>
							<c ca="center">
								<p>3</p>
							</c>
							<c ca="center">
								<p>2</p>
							</c>
							<c ca="center">
								<p>0</p>
							</c>
							<c ca="center">
								<p>3</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c cspan="8">
								<hr/>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>
									<b>Total number of clusters</b>
								</p>
							</c>
							<c ca="center">
								<p>847</p>
							</c>
							<c ca="center">
								<p>280</p>
							</c>
							<c ca="center">
								<p>105</p>
							</c>
							<c ca="center">
								<p>35</p>
							</c>
							<c ca="center">
								<p>25</p>
							</c>
							<c ca="center">
								<p>10</p>
							</c>
							<c ca="center">
								<p>10</p>
							</c>
						</r>
						<r>
							<c cspan="9">
								<hr/>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>
									<b>Data without tandem duplicates</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>Number of clusters formed by genes in some GO class</b>
								</p>
							</c>
							<c ca="center">
								<p>322</p>
							</c>
							<c ca="center">
								<p>33</p>
							</c>
							<c ca="center">
								<p>9</p>
							</c>
							<c ca="center">
								<p>1</p>
							</c>
							<c ca="center">
								<p>2</p>
							</c>
							<c ca="center">
								<p>1</p>
							</c>
							<c ca="center">
								<p>1</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c cspan="8">
								<hr/>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>
									<b>Total number of clusters</b>
								</p>
							</c>
							<c ca="center">
								<p>867</p>
							</c>
							<c ca="center">
								<p>260</p>
							</c>
							<c ca="center">
								<p>94</p>
							</c>
							<c ca="center">
								<p>37</p>
							</c>
							<c ca="center">
								<p>16</p>
							</c>
							<c ca="center">
								<p>8</p>
							</c>
							<c ca="center">
								<p>8</p>
							</c>
						</r>
					</tblbdy>
				</tbl>
				<p>Finally, we investigated whether there is a correlation between the mean <it>R </it>of gene pairs and <it>p</it>-value for a positional cluster. With <it>D </it>= 25K, we considered all the pairs of the neighbouring genes in the same cluster. We divided the gene pairs into seven categories according to the <it>p</it>-value of the clusters to which the gene pairs belong. These seven categories correspond one-to-one to the following intervals of <it>p</it>-values: 0~10<sup>-6</sup>, 10<sup>-6</sup>~10<sup>-5</sup>, 10<sup>-5</sup>~10<sup>-4</sup>, 10<sup>-4</sup>~10<sup>-3</sup>, 10<sup>-3</sup>~10<sup>-2</sup>, 10<sup>-2</sup>~10<sup>-1</sup>, 10<sup>-1</sup>~1. To simplify presentation we consider (base 10 logarithm) -lg <it>p</it>-value instead of <it>p</it>-value, and use the intervals: 0~1, 1~2, 2~3, 3~4, 4~5, 5~6, > 6. We calculated the mean <it>R</it> of the neighbouring gene pairs in each category and observed a significant correlation between -lg <it>p</it>-value and the degree of coexpression of neighbouring gene pairs, either using the complete dataset (Figure <figr fid="F4">4</figr>) or the dataset after tandem duplicates are removed (Figure <figr fid="F5">5</figr>). This correlation is extremely significant for gene pairs that are transcripted in the parallel orientation. The mean <it>R</it> value is as high as 0.5088 (with standard error 0.0642) for the complete dataset and 0.3228 (with standard error 0.1330) even after tandem duplicates are removed. We also observed that at low <it>p</it>-value (high -lg <it>p</it>-value), more gene pairs in the identified clusters are transcribed in the parallel orientation, even with tandem duplicates (Table <tblr tid="T5">5</tblr>). We examined a correlation between -lg <it>p</it>-value and neighbouring gene distance to find if such a high correlation can be explained with intergenic distance. No such correlation was found (Figure <figr fid="F6">6</figr>).</p>
				<fig id="F4">
					<title>
						<p>Figure 4</p>
					</title>
					<caption>
						<p>Mean <it>R </it>of neighboring gene pairs in different -lg <it>p</it>-value intervals</p>
					</caption>
					<text>
						<p>
							<b>Mean <it>R </it>of neighboring gene pairs in different -lg <it>p</it>-value intervals</b>. Mean <it>R </it>values of neighboring gene pairs in -lg <it>p </it>intervals. All <it>p</it>-values were calculated with <it>D </it>= <it>25K</it>. Gene pairs grouped into parallel, divergent, and convergent orientations are plotted similarly. There is only one gene pair has -lg <it>p</it>-value in the interval 5~6, for both the divergent and convergent cases. They are hence omitted from the plot.</p>
					</text>
					<graphic file="1471-2164-10-42-4"/>
				</fig>
				<fig id="F5">
					<title>
						<p>Figure 5</p>
					</title>
					<caption>
						<p>Results from the same analysis as in Figure 4 after tandem duplicates are removed</p>
					</caption>
					<text>
						<p>
							<b>Results from the same analysis as in Figure </b>
							<figr fid="F4">4</figr>
							<b> after tandem duplicates are removed</b>.</p>
					</text>
					<graphic file="1471-2164-10-42-5"/>
				</fig>
				<fig id="F6">
					<title>
						<p>Figure 6</p>
					</title>
					<caption>
						<p>Average distance between neighboring gene pairs in different -lg <it>p</it>-value intervals</p>
					</caption>
					<text>
						<p>
							<b>Average distance between neighboring gene pairs in different -lg <it>p</it>-value intervals</b>.</p>
					</text>
					<graphic file="1471-2164-10-42-6"/>
				</fig>
				<tbl id="T5">
					<title>
						<p>Table 5</p>
					</title>
					<caption>
						<p>Number of neighboring gene pairs found in clusters in different -lg <it>p</it>-value intervals (<it>D </it>= <it>25K</it>).</p>
					</caption>
					<tblbdy cols="9">
						<r>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>
									<b>Transcription orientation</b>
								</p>
							</c>
							<c cspan="7" ca="center">
								<p>
									<b>Number of gene pairs within -lg <it>p</it>-value range</b>
								</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c cspan="7">
								<hr/>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>
									<b>0~1</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>1~2</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>2~3</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>3~4</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>4~5</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>5~6</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>6~</b>
								</p>
							</c>
						</r>
						<r>
							<c cspan="9">
								<hr/>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>
									<b>Analyzed with full dataset</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>All orientations</b>
								</p>
							</c>
							<c ca="center">
								<p>64</p>
							</c>
							<c ca="center">
								<p>1101</p>
							</c>
							<c ca="center">
								<p>660</p>
							</c>
							<c ca="center">
								<p>210</p>
							</c>
							<c ca="center">
								<p>79</p>
							</c>
							<c ca="center">
								<p>16</p>
							</c>
							<c ca="center">
								<p>35</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c cspan="8">
								<hr/>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>
									<b>Parallel</b>
								</p>
							</c>
							<c ca="center">
								<p>30</p>
							</c>
							<c ca="center">
								<p>541</p>
							</c>
							<c ca="center">
								<p>331</p>
							</c>
							<c ca="center">
								<p>107</p>
							</c>
							<c ca="center">
								<p>48</p>
							</c>
							<c ca="center">
								<p>14</p>
							</c>
							<c ca="center">
								<p>23</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c cspan="8">
								<hr/>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>
									<b>Divergent</b>
								</p>
							</c>
							<c ca="center">
								<p>18</p>
							</c>
							<c ca="center">
								<p>237</p>
							</c>
							<c ca="center">
								<p>167</p>
							</c>
							<c ca="center">
								<p>51</p>
							</c>
							<c ca="center">
								<p>17</p>
							</c>
							<c ca="center">
								<p>1</p>
							</c>
							<c ca="center">
								<p>6</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c cspan="8">
								<hr/>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>
									<b>Convergent</b>
								</p>
							</c>
							<c ca="center">
								<p>16</p>
							</c>
							<c ca="center">
								<p>323</p>
							</c>
							<c ca="center">
								<p>162</p>
							</c>
							<c ca="center">
								<p>52</p>
							</c>
							<c ca="center">
								<p>14</p>
							</c>
							<c ca="center">
								<p>1</p>
							</c>
							<c ca="center">
								<p>6</p>
							</c>
						</r>
						<r>
							<c cspan="9">
								<hr/>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>
									<b>Analyzed w/o tandem duplicates</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>All orientations</b>
								</p>
							</c>
							<c ca="center">
								<p>63</p>
							</c>
							<c ca="center">
								<p>1073</p>
							</c>
							<c ca="center">
								<p>586</p>
							</c>
							<c ca="center">
								<p>169</p>
							</c>
							<c ca="center">
								<p>83</p>
							</c>
							<c ca="center">
								<p>14</p>
							</c>
							<c ca="center">
								<p>19</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c cspan="8">
								<hr/>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>
									<b>Parallel</b>
								</p>
							</c>
							<c ca="center">
								<p>27</p>
							</c>
							<c ca="center">
								<p>518</p>
							</c>
							<c ca="center">
								<p>291</p>
							</c>
							<c ca="center">
								<p>89</p>
							</c>
							<c ca="center">
								<p>49</p>
							</c>
							<c ca="center">
								<p>12</p>
							</c>
							<c ca="center">
								<p>9</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c cspan="8">
								<hr/>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>
									<b>Divergent</b>
								</p>
							</c>
							<c ca="center">
								<p>17</p>
							</c>
							<c ca="center">
								<p>244</p>
							</c>
							<c ca="center">
								<p>152</p>
							</c>
							<c ca="center">
								<p>41</p>
							</c>
							<c ca="center">
								<p>18</p>
							</c>
							<c ca="center">
								<p>1</p>
							</c>
							<c ca="center">
								<p>5</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c cspan="8">
								<hr/>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>
									<b>Convergent</b>
								</p>
							</c>
							<c ca="center">
								<p>19</p>
							</c>
							<c ca="center">
								<p>311</p>
							</c>
							<c ca="center">
								<p>143</p>
							</c>
							<c ca="center">
								<p>39</p>
							</c>
							<c ca="center">
								<p>16</p>
							</c>
							<c ca="center">
								<p>1</p>
							</c>
							<c ca="center">
								<p>5</p>
							</c>
						</r>
					</tblbdy>
				</tbl>
			</sec>
		</sec>
		<sec>
			<st>
				<p>Discussion</p>
			</st>
			<p>As the zebrafish genome is almost completely sequenced, more and more information has been available for genome-wide analysis. Using public microarray datasets and gene annotation, we investigated, for the first time, the global gene expression patterns in the zebrafish genome. Our results have several implications.</p>
			<p>First, proximate genes in the zebrafish genome tend to coexpress at a significant level. This is partially due to tandem genes, which often have high degree of coexpression. The coexpression level decreases with these tandem genes excluded, but still remain significant. These observations are in general comparable &#8211; in both effects and magnitude &#8211; to the other studies surveyed in <abbrgrp>
					<abbr bid="B1">1</abbr>
				</abbrgrp> and <abbrgrp>
					<abbr bid="B2">2</abbr>
				</abbrgrp>. We measured the degree of coexpression not only between two neighbouring genes, but also among the genes in blocks of sizes from 3 to 20. In each case, the degree of coexpression is significant compared with that of a random genome in which the genes are randomly rearranged.</p>
			<p>As shown in other genomes <abbrgrp>
					<abbr bid="B14">14</abbr>
					<abbr bid="B15">15</abbr>
					<abbr bid="B26">26</abbr>
				</abbrgrp>, the degree of coexpression is to some extend influenced by the intergenic distance, and there is evidence of clusters that span 20 coexpressing genes. To investigate this fact further, we examined whether genes in positional clusters of arbitrary span have more significant level of coexpression or not. The average intergenic distance is about 200 kb with or without tandem genes. Using the statistical method proposed by Li, Lee and Zhang <abbrgrp>
					<abbr bid="B19">19</abbr>
				</abbrgrp>, we examined the positional gene clusters within which the intergenic distance is less than 25 kb. As shown in Table <tblr tid="T3">3</tblr>, the genes in these positional clusters usually have higher degrees of coexpression although most of these clusters are composed of genes in different GO classes. We observe ten large positional clusters each having eight or more genes. One of these statistically significant clusters contains 13 highly co-expressed genes in the <it>Mhc </it>class I region. Interestingly, Murray et al. <abbrgrp>
					<abbr bid="B24">24</abbr>
				</abbrgrp> noted that the <it>psmb </it>genes on the zebrafish <it>Mhc </it>class I region concurs with Hughes' hypothesis of a selective advantage to the clustering of genes with similar expression patterns <abbrgrp>
					<abbr bid="B27">27</abbr>
				</abbrgrp>. Moreover, five clusters listed in Table <tblr tid="T3">3</tblr> have not been investigated yet. The genes in these newly identified clusters are probably worthy to be investigated biologically.</p>
			<p>We also investigated the effect of transcript orientation on the level of coexpression. In yeast <abbrgrp>
					<abbr bid="B14">14</abbr>
				</abbrgrp>, human <abbrgrp>
					<abbr bid="B26">26</abbr>
				</abbrgrp>, <it>Arabidopsis </it>
				<abbrgrp>
					<abbr bid="B15">15</abbr>
				</abbrgrp>, and <it>C. elegans </it>
				<abbrgrp>
					<abbr bid="B6">6</abbr>
				</abbrgrp>, it is observed that transcript-divergent neighbouring genes have higher coexpression level than transcript-parallel or transcript-convergent neighbouring genes. Our study finds that transcript-parallel or transcript-divergent neighbouring genes have higher coexpression level than transcript-convergent genes in the zebrafish genome. The fact that the transcript-convergent neighbouring genes have the lowest level of coexpression is consistent with the studies that are just mentioned. This fact is likely related to 5' <it>cis</it>-regulatory elements <abbrgrp>
					<abbr bid="B6">6</abbr>
				</abbrgrp>. Only transcript-parallel or -divergent gene pairs can be driven by a 5' cis-regulatory element. For example, the genes in the identified positional cluster in the <it>Mhc </it>class I region on Ch19 are believed to be coregulated from shared bidirectional promoters <abbrgrp>
					<abbr bid="B24">24</abbr>
				</abbrgrp>. However, that the transcript-parallel neighbouring genes have a higher level of coexpression than the transcript-divergent neighbouring genes could be special to the zebrafish genome as such a fact has not been reported in other genomes to our best knowledge. The underlying cause for it could be that there are less bidirectional promoters in the zebrafish genome than in mammalian genomes; it may also be due to the fact that the tandem neighbouring genes that have parallel orientation are strongly coexpressed in the zebrafish genome, which may result from our analysis being done on a partial list of genes and incomplete positional information. When the zebrafish genome is completely sequenced in the near future, repeating our analysis will definitely give a better picture of the influence of transcript orientation on the coexpression level of zebrafish genes.</p>
		</sec>
		<sec>
			<st>
				<p>Conclusion</p>
			</st>
			<p>In summary, we have observed that gene order of the zebrafish is non-random. In addition, the statistical significance of genes' positional clustering is positively correlated to coexpression degree. These facts suggest that the clustering of genes may be subjected to selection forces that favour having coexpressing genes in close proximity.</p>
		</sec>
		<sec>
			<st>
				<p>Methods</p>
			</st>
			<sec>
				<st>
					<p>Microarray data sources</p>
				</st>
				<p>Our gene-expression datasets are compiled from several previous studies with the Affymetrix GeneChip<sup>&#174; </sup>Zebrafish Genome Array (GeneChip 430), which contains 39,000 <it>Danio rerio </it>transcripts. These microarray data were used to study the transcriptional changes of genes in embryonic development and divided into the following four groups:</p>
				<p>(i) Nine expression datasets based on experiments on zebrafish embryonic fibroblast cell lines ZF4 and PAC2 (with accession id E-MEXP-736 in ArrayExpress) <abbrgrp>
						<abbr bid="B28">28</abbr>
					</abbrgrp>. They are the gene expression profiles of these two cell lines in cultures with and without the presence of serum.</p>
				<p>(ii) Two expression datasets (id E-MEXP-737) derived from experiments on the 24-hour embryos from the Tuebingen cell line <abbrgrp>
						<abbr bid="B28">28</abbr>
					</abbrgrp>.</p>
				<p>(iii) Forty-two expression datasets (id E-MEXP-758) from analysis of the transcriptional response to TCDD at different stages <abbrgrp>
						<abbr bid="B29">29</abbr>
					</abbrgrp>. They were used to identify the gene expression changes in the heart and other tissues of zebrafish larvae at 1 h, 2 h, 4 h and 12 h after exposure to TCDD beginning at 72 h fertilization, and</p>
				<p>(iv) Forty-one expression datasets collected at the Lab of Functional Genomics, the Institute of Molecular and Cell Biology, Singapore <abbrgrp>
						<abbr bid="B30">30</abbr>
						<abbr bid="B31">31</abbr>
						<abbr bid="B32">32</abbr>
					</abbrgrp>. The microarray data in <abbrgrp>
						<abbr bid="B30">30</abbr>
					</abbrgrp> were collected in the experiments with RNA extracted from wild type AB and <it>def</it>
					<sup>
						<it>hi</it>429 </sup>mutant embryos after 5-day fertilization. In <abbrgrp>
						<abbr bid="B31">31</abbr>
					</abbrgrp>, microarray gene expression profiles of the liver and the remaining liver-free body of adult zebrafish (wild type AB strain) were used to study the regulation mechanism of liver-enriched genes. In <abbrgrp>
						<abbr bid="B32">32</abbr>
					</abbrgrp>, microarray data were obtained from the experiments with RNA samples extracted from five embryos for gene expression profiling of the 18-somite zebrafish cloche mutant, in which development of hematopoietic lineage is severely impaired.</p>
				<p>Microarray experiment database ArrayExpress is available at EMBL-EBI <abbrgrp>
						<abbr bid="B33">33</abbr>
					</abbrgrp>. These available datasets were preprocessed by using the invariant set normalization method <abbrgrp>
						<abbr bid="B34">34</abbr>
					</abbrgrp>.</p>
			</sec>
			<sec>
				<st>
					<p>Locating expressed genes in Zebrafish genome</p>
				</st>
				<p>Expressed genes were identified using the Ensembl database. The clone sequences in Affymetrix Zebrafish Genome Array were aligned back to the zebrafish genomic sequence (available from <abbrgrp>
						<abbr bid="B35">35</abbr>
					</abbrgrp>) using BLAST program. This results in 6802 expressed zebrafish genes. The positional information of these genes was then extracted to arrange the genes for analysis.</p>
			</sec>
			<sec>
				<st>
					<p>Identification of biological processes genes are involved in</p>
				</st>
				<p>The zebrafish GO terms are obtained along with the genomic sequences from the Ensembl database <abbrgrp>
						<abbr bid="B35">35</abbr>
					</abbrgrp>. It annotates the 6802 genes into 1722 GO classes. The correspondence between gene IDs and Affymetrix Zebrafish DB IDs is obtained similarly.</p>
			</sec>
			<sec>
				<st>
					<p>Removal of tandem genes</p>
				</st>
				<p>We used the same criterion to remove tandem duplicates as in <abbrgrp>
						<abbr bid="B3">3</abbr>
					</abbrgrp>. Two genes are considered as tandem duplicate if they are within 100 genes from each other and aligned by BLAST with E-value less than 0.2. After a tandem gene cluster was detected, we removed all but one gene from the analysis. After removal of 215 tandem genes, 6587 genes remained for further analysis.</p>
			</sec>
			<sec>
				<st>
					<p>Measuring the level of coexpression</p>
				</st>
				<p>Pearson's correlation coefficient (<it>R</it>) is used to measure the level of coexpression between two genes. For two genes <it>X </it>and <it>Y </it>with expression values (<it>x</it>
					<sub>1</sub>, <it>x</it>
					<sub>2</sub>,... <it>x</it>
					<sub>
						<it>n</it>
					</sub>) and (<it>y</it>
					<sub>1</sub>, <it>y</it>
					<sub>2</sub>,..., <it>y</it>
					<sub>
						<it>n</it>
					</sub>) in <it>n </it>microarray experiments respectively, <it>R</it>(<it>X</it>, <it>Y</it>) is computed as</p>
				<p>
					<display-formula>
						<m:math name="1471-2164-10-42-i1" xmlns:m="http://www.w3.org/1998/Math/MathML">
							<m:semantics>
								<m:mrow>
									<m:mfrac>
										<m:mrow>
											<m:mstyle displaystyle="true">
												<m:msub>
													<m:mo>&#8721;</m:mo>
													<m:mi>i</m:mi>
												</m:msub>
												<m:mrow>
													<m:mo stretchy="false">(</m:mo>
													<m:msub>
														<m:mi>x</m:mi>
														<m:mi>i</m:mi>
													</m:msub>
													<m:mo>&#8722;</m:mo>
													<m:mover accent="true">
														<m:mi>x</m:mi>
														<m:mo>&#175;</m:mo>
													</m:mover>
													<m:mo stretchy="false">)</m:mo>
													<m:mo stretchy="false">(</m:mo>
													<m:msub>
														<m:mi>y</m:mi>
														<m:mi>i</m:mi>
													</m:msub>
													<m:mo>&#8722;</m:mo>
													<m:mover accent="true">
														<m:mi>y</m:mi>
														<m:mo>&#175;</m:mo>
													</m:mover>
													<m:mo stretchy="false">)</m:mo>
												</m:mrow>
											</m:mstyle>
										</m:mrow>
										<m:mrow>
											<m:msqrt>
												<m:mrow>
													<m:mstyle displaystyle="true">
														<m:msub>
															<m:mo>&#8721;</m:mo>
															<m:mi>i</m:mi>
														</m:msub>
														<m:mrow>
															<m:msup>
																<m:mrow>
																	<m:mo stretchy="false">(</m:mo>
																	<m:msub>
																		<m:mi>x</m:mi>
																		<m:mi>i</m:mi>
																	</m:msub>
																	<m:mo>&#8722;</m:mo>
																	<m:mover accent="true">
																		<m:mi>x</m:mi>
																		<m:mo>&#175;</m:mo>
																	</m:mover>
																	<m:mo stretchy="false">)</m:mo>
																</m:mrow>
																<m:mn>2</m:mn>
															</m:msup>
														</m:mrow>
													</m:mstyle>
													<m:mstyle displaystyle="true">
														<m:msub>
															<m:mo>&#8721;</m:mo>
															<m:mi>i</m:mi>
														</m:msub>
														<m:mrow>
															<m:msup>
																<m:mrow>
																	<m:mo stretchy="false">(</m:mo>
																	<m:msub>
																		<m:mi>y</m:mi>
																		<m:mi>i</m:mi>
																	</m:msub>
																	<m:mo>&#8722;</m:mo>
																	<m:mover accent="true">
																		<m:mi>y</m:mi>
																		<m:mo>&#175;</m:mo>
																	</m:mover>
																	<m:mo stretchy="false">)</m:mo>
																</m:mrow>
																<m:mn>2</m:mn>
															</m:msup>
														</m:mrow>
													</m:mstyle>
												</m:mrow>
											</m:msqrt>
										</m:mrow>
									</m:mfrac>
								</m:mrow>
								<m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaqcfa4aaSaaaeaadaaeqaqaaiabcIcaOiabdIha4naaBaaabaGaemyAaKgabeaacqGHsislcuWG4baEgaqeaiabcMcaPiabcIcaOiabdMha5naaBaaabaGaemyAaKgabeaacqGHsislcuWG5bqEgaqeaiabcMcaPaqaaiabdMgaPbqabiabggHiLdaabaWaaOaaaeaadaaeqaqaaiabcIcaOiabdIha4naaBaaabaGaemyAaKgabeaacqGHsislcuWG4baEgaqeaiabcMcaPmaaCaaabeqaaiabikdaYaaaaeaacqWGPbqAaeqacqGHris5amaaqababaGaeiikaGIaemyEaK3aaSbaaeaacqWGPbqAaeqaaiabgkHiTiqbdMha5zaaraGaeiykaKYaaWbaaeqabaGaeGOmaidaaaqaaiabdMgaPbqabiabggHiLdaabeaaaaaaaa@5507@</m:annotation>
							</m:semantics>
						</m:math>
					</display-formula>
				</p>
				<p>where <inline-formula>
						<m:math name="1471-2164-10-42-i2" xmlns:m="http://www.w3.org/1998/Math/MathML">
							<m:semantics>
								<m:mover accent="true">
									<m:mi>x</m:mi>
									<m:mo>&#175;</m:mo>
								</m:mover>
								<m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGafmiEaGNbaebaaaa@2D66@</m:annotation>
							</m:semantics>
						</m:math>
					</inline-formula> and <inline-formula>
						<m:math name="1471-2164-10-42-i3" xmlns:m="http://www.w3.org/1998/Math/MathML">
							<m:semantics>
								<m:mover accent="true">
									<m:mi>y</m:mi>
									<m:mo>&#175;</m:mo>
								</m:mover>
								<m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGafmyEaKNbaebaaaa@2D68@</m:annotation>
							</m:semantics>
						</m:math>
					</inline-formula> are the mean expression value of <it>X </it>and <it>Y </it>in an experiment respectively.</p>
				<p>The significance of mean <it>R </it>calculated from the real data was estimated by comparing it with the mean <it>R</it>s for 10000 random genomes. In each random genome, genes were rearranged though a series of transposes.</p>
				<p>To analyze the level of coexpression among the multiple proximate genes, we divided the genes in the Zebrafish genome into non-overlapping blocks, each of <it>k </it>consecutive genes, where <it>k </it>is a fixed integer from 3 to 20. For a block of <it>k </it>genes, there are <it>k</it>(<it>k </it>- 1)/2 pairs of genes; the mean <it>R </it>of these pairs is used to measure the degree of coexpression among the genes in the block, called the block R. The mean block <it>R </it>was compared with mean block <it>R</it>s calculated from the randomized genome.</p>
			</sec>
			<sec>
				<st>
					<p>Analysis of positional gene clusters</p>
				</st>
				<p>We examined positional gene clusters in the neighbourhood model. In the neighbourhood model, two genes x and y are in a cluster if and only if the distance between any two adjacent genes locating between x and y is less than a fixed threshold (<it>D</it>). For a cluster of <it>n </it>genes, its <it>p</it>-value is equal to (1 - <it>e</it>
					<sup>-<it>&#945;D</it>
					</sup>)<sup>
						<it>n</it>
					</sup>, where <it>&#945; </it>is the gene density in a considered region <abbrgrp>
						<abbr bid="B19">19</abbr>
					</abbrgrp>. Since gene density varies in the zebrafish genome, we set &#945; to be the gene density in the 2 Mbp region around the gene cluster.</p>
				<p>This formula is derived under the assumption that the (start) position of a gene is uniformly distributed. This assumption is obviously invalid in a whole chromosome since there are gene dense and sparse regions in a genome. Thus, we focus on the 2 Mbp region centred on the gene cluster to be analyzed.</p>
			</sec>
		</sec>
		<sec>
			<st>
				<p>Abbreviations</p>
			</st>
			<p>SAGE: Serial Analysis of Gene Expression; GO: Gene Ontology; TCDD: tetrachlorodibenzo-<it>p</it>-dioxin.</p>
		</sec>
		<sec>
			<st>
				<p>Authors' contributions</p>
			</st>
			<p>WW prepared the microarray data and performed initial analyses. YKN performed the analyses herein. LXZ conceived of the study, coordinated the analysis. YKN and LXZ prepared the final manuscript. All authors read and approved the final manuscript.</p>
		</sec>
	</bdy>
   <bm>
		<ack>
			<sec>
				<st>
					<p>Acknowledgements</p>
				</st>
				<p>LX Zhang gratefully acknowledged the NUS ARF grant R-146-000-109-112. We thank the referees for helpful suggestions and providing more references related to the work.</p>
			</sec>
		</ack>
		<refgrp>
			<bibl id="B1">
				<title>
					<p>The evolutionary dynamics of eukaryotic gene order</p>
				</title>
				<aug>
					<au>
						<snm>Hurst</snm>
						<fnm>LD</fnm>
					</au>
					<au>
						<snm>Pal</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Lercher</snm>
						<fnm>MJ</fnm>
					</au>
				</aug>
				<source>Nat Rev Genet</source>
				<pubdate>2004</pubdate>
				<volume>5</volume>
				<fpage>299</fpage>
				<lpage>310</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1038/nrg1319</pubid>
						<pubid idtype="pmpid" link="fulltext">15131653</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B2">
				<title>
					<p>Coexpression, coregulation, and cofunctionality of neighboring genes in eukaryotic genomes</p>
				</title>
				<aug>
					<au>
						<snm>Michalak</snm>
						<fnm>P</fnm>
					</au>
				</aug>
				<source>Genomics</source>
				<pubdate>2008</pubdate>
				<volume>91</volume>
				<fpage>243</fpage>
				<lpage>248</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1016/j.ygeno.2007.11.002</pubid>
						<pubid idtype="pmpid" link="fulltext">18082363</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B3">
				<title>
					<p>Clustering of housekeeping genes provides a unified model of gene order in the human genome</p>
				</title>
				<aug>
					<au>
						<snm>Lercher</snm>
						<fnm>MJ</fnm>
					</au>
					<au>
						<snm>Urrutia</snm>
						<fnm>AO</fnm>
					</au>
					<au>
						<snm>Hurst</snm>
						<fnm>LD</fnm>
					</au>
				</aug>
				<source>Nat Genet</source>
				<pubdate>2002</pubdate>
				<volume>31</volume>
				<fpage>180</fpage>
				<lpage>183</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1038/ng887</pubid>
						<pubid idtype="pmpid" link="fulltext">11992122</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B4">
				<title>
					<p>Gene clusters and polycistronic transcription in eukaryotes</p>
				</title>
				<aug>
					<au>
						<snm>Blumenthal</snm>
						<fnm>T</fnm>
					</au>
				</aug>
				<source>BioEssays</source>
				<pubdate>1998</pubdate>
				<volume>20</volume>
				<fpage>480</fpage>
				<lpage>487</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1002/(SICI)1521-1878(199806)20:6&lt;480::AID-BIES6>3.0.CO;2-Q</pubid>
						<pubid idtype="pmpid" link="fulltext">9699460</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B5">
				<title>
					<p>Chromosomal clustering of muscle-expressed genes in <it>Caenorhabditis elegans</it>
					</p>
				</title>
				<aug>
					<au>
						<snm>Roy</snm>
						<fnm>PJ</fnm>
					</au>
					<au>
						<snm>Stuart</snm>
						<fnm>JM</fnm>
					</au>
					<au>
						<snm>Lund</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Kim</snm>
						<fnm>SK</fnm>
					</au>
				</aug>
				<source>Nature</source>
				<pubdate>2002</pubdate>
				<volume>418</volume>
				<fpage>975</fpage>
				<lpage>979</lpage>
				<xrefbib>
					<pubid idtype="pmpid" link="fulltext">12214599</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B6">
				<title>
					<p>Coexpression of neighboring genes in <it>Caenorhabditis elegans </it>is mostly due to operons and duplicate genes</p>
				</title>
				<aug>
					<au>
						<snm>Lercher</snm>
						<fnm>MJ</fnm>
					</au>
					<au>
						<snm>Blumenthal</snm>
						<fnm>T</fnm>
					</au>
					<au>
						<snm>Hurst</snm>
						<fnm>LD</fnm>
					</au>
				</aug>
				<source>Genome Res</source>
				<pubdate>2003</pubdate>
				<volume>13</volume>
				<fpage>238</fpage>
				<lpage>243</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">420373</pubid>
						<pubid idtype="pmpid" link="fulltext">12566401</pubid>
						<pubid idtype="doi">10.1101/gr.553803</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B7">
				<title>
					<p>Conservation and functional significance of gene topology in the genome of <it>Caenorhabditis elegans</it>
					</p>
				</title>
				<aug>
					<au>
						<snm>Chen</snm>
						<fnm>N</fnm>
					</au>
					<au>
						<snm>Stein</snm>
						<fnm>LD</fnm>
					</au>
				</aug>
				<source>Genome Res</source>
				<pubdate>2006</pubdate>
				<volume>16</volume>
				<fpage>606</fpage>
				<lpage>617</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">1457050</pubid>
						<pubid idtype="pmpid" link="fulltext">16606698</pubid>
						<pubid idtype="doi">10.1101/gr.4515306</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B8">
				<title>
					<p>A computational analysis of whole-genome expression data reveals chromosomal domains of gene expression</p>
				</title>
				<aug>
					<au>
						<snm>Cohen</snm>
						<fnm>BA</fnm>
					</au>
					<au>
						<snm>Mitra</snm>
						<fnm>RD</fnm>
					</au>
					<au>
						<snm>Hughes</snm>
						<fnm>JD</fnm>
					</au>
					<au>
						<snm>Church</snm>
						<fnm>GM</fnm>
					</au>
				</aug>
				<source>Nature Genet</source>
				<pubdate>2000</pubdate>
				<volume>26</volume>
				<fpage>183</fpage>
				<lpage>186</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1038/79896</pubid>
						<pubid idtype="pmpid" link="fulltext">11017073</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B9">
				<title>
					<p>Large clusters of coexpressed genes in the <it>Drosophila </it>genome</p>
				</title>
				<aug>
					<au>
						<snm>Boutanaev</snm>
						<fnm>AM</fnm>
					</au>
					<au>
						<snm>Kalmykova</snm>
						<fnm>AI</fnm>
					</au>
					<au>
						<snm>Shevelyov</snm>
						<fnm>YY</fnm>
					</au>
					<au>
						<snm>Nurminsky</snm>
						<fnm>DI</fnm>
					</au>
				</aug>
				<source>Nature</source>
				<pubdate>2002</pubdate>
				<volume>420</volume>
				<fpage>666</fpage>
				<lpage>669</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1038/nature01216</pubid>
						<pubid idtype="pmpid" link="fulltext">12478293</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B10">
				<title>
					<p>Evidence for large domains of similarly expressed genes in the <it>Drosophila </it>genome</p>
				</title>
				<aug>
					<au>
						<snm>Spellman</snm>
						<fnm>PT</fnm>
					</au>
					<au>
						<snm>Rubin</snm>
						<fnm>GM</fnm>
					</au>
				</aug>
				<source>J Biol</source>
				<pubdate>2002</pubdate>
				<volume>1</volume>
				<fpage>5</fpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">117248</pubid>
						<pubid idtype="pmpid" link="fulltext">12144710</pubid>
						<pubid idtype="doi">10.1186/1475-4924-1-5</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B11">
				<title>
					<p>Modelling the correlation between the activities of adjacent genes in drosophila</p>
				</title>
				<aug>
					<au>
						<snm>Thygesen</snm>
						<fnm>HH</fnm>
					</au>
					<au>
						<snm>Zwinderman</snm>
						<fnm>AH</fnm>
					</au>
				</aug>
				<source>BMC Bioinformatics</source>
				<pubdate>2005</pubdate>
				<volume>6</volume>
				<issue>10</issue>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">547897</pubid>
						<pubid idtype="pmpid" link="fulltext">15659243</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B12">
				<title>
					<p>Coordinated evolution of co-expressed gene clusters in the <it>Drosophila </it>transcriptome</p>
				</title>
				<aug>
					<au>
						<snm>Mezey</snm>
						<fnm>JG</fnm>
					</au>
					<au>
						<snm>Nuzhdin</snm>
						<fnm>SV</fnm>
					</au>
					<au>
						<snm>Ye</snm>
						<fnm>FF</fnm>
					</au>
					<au>
						<snm>Jones</snm>
						<fnm>CD</fnm>
					</au>
				</aug>
				<source>BMC Evol Biol</source>
				<pubdate>2008</pubdate>
				<volume>8</volume>
				<issue>2</issue>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">2266709</pubid>
						<pubid idtype="pmpid" link="fulltext">18179715</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B13">
				<title>
					<p>A genome-wide transcriptional analysis of the mitotic cell cycle</p>
				</title>
				<aug>
					<au>
						<snm>Cho</snm>
						<fnm>RJ</fnm>
					</au>
					<au>
						<snm>Campbell</snm>
						<fnm>MJ</fnm>
					</au>
					<au>
						<snm>Winzeler</snm>
						<fnm>EA</fnm>
					</au>
					<au>
						<snm>Steinmetz</snm>
						<fnm>L</fnm>
					</au>
					<au>
						<snm>Conway</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Wodicka</snm>
						<fnm>L</fnm>
					</au>
					<au>
						<snm>Wolfsberg</snm>
						<fnm>TG</fnm>
					</au>
					<au>
						<snm>Gabrielian</snm>
						<fnm>AE</fnm>
					</au>
					<au>
						<snm>Landsman</snm>
						<fnm>D</fnm>
					</au>
					<au>
						<snm>Lockhart</snm>
						<fnm>DJ</fnm>
					</au>
					<au>
						<snm>Davis</snm>
						<fnm>RW</fnm>
					</au>
				</aug>
				<source>Mol Cell</source>
				<pubdate>1998</pubdate>
				<volume>2</volume>
				<fpage>65</fpage>
				<lpage>73</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1016/S1097-2765(00)80114-8</pubid>
						<pubid idtype="pmpid" link="fulltext">9702192</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B14">
				<title>
					<p>Regulation of adjacent yeast genes</p>
				</title>
				<aug>
					<au>
						<snm>Kruglyak</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Tang</snm>
						<fnm>H</fnm>
					</au>
				</aug>
				<source>Trends Genet</source>
				<pubdate>2000</pubdate>
				<volume>16</volume>
				<fpage>109</fpage>
				<lpage>111</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1016/S0168-9525(99)01941-1</pubid>
						<pubid idtype="pmpid" link="fulltext">10689350</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B15">
				<title>
					<p>Coexpression of neighboring genes in the genome of <it>Arabidopsis thaliana</it>
					</p>
				</title>
				<aug>
					<au>
						<snm>Williams</snm>
						<fnm>EJ</fnm>
					</au>
					<au>
						<snm>Bowles</snm>
						<fnm>DJ</fnm>
					</au>
				</aug>
				<source>Genome Res</source>
				<pubdate>2004</pubdate>
				<volume>14</volume>
				<fpage>1060</fpage>
				<lpage>1067</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">419784</pubid>
						<pubid idtype="pmpid" link="fulltext">15173112</pubid>
						<pubid idtype="doi">10.1101/gr.2131104</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B16">
				<title>
					<p>Genome-wide mapping of unselected transcripts from extraembryonic tissue of 7.5-day mouse embryos reveals enrichment in the t-complex and under-representation on the X chromosome</p>
				</title>
				<aug>
					<au>
						<snm>Ko</snm>
						<fnm>MSH</fnm>
					</au>
					<au>
						<snm>Threat</snm>
						<fnm>TA</fnm>
					</au>
					<au>
						<snm>Wang</snm>
						<fnm>XQ</fnm>
					</au>
					<au>
						<snm>Horton</snm>
						<fnm>JH</fnm>
					</au>
					<au>
						<snm>Cui</snm>
						<fnm>YS</fnm>
					</au>
					<au>
						<snm>Wang</snm>
						<fnm>XH</fnm>
					</au>
					<au>
						<snm>Pryor</snm>
						<fnm>E</fnm>
					</au>
					<au>
						<snm>Paris</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Wells-Smith</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Kitchen</snm>
						<fnm>JR</fnm>
					</au>
					<au>
						<snm>Rowe</snm>
						<fnm>LB</fnm>
					</au>
					<au>
						<snm>Eppig</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Satoh</snm>
						<fnm>T</fnm>
					</au>
					<au>
						<snm>Brant</snm>
						<fnm>L</fnm>
					</au>
					<au>
						<snm>Fujiwara</snm>
						<fnm>H</fnm>
					</au>
					<au>
						<snm>Yotsumoto</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Nakashima</snm>
						<fnm>H</fnm>
					</au>
				</aug>
				<source>Hum Mol Genet</source>
				<pubdate>1998</pubdate>
				<volume>7</volume>
				<fpage>1967</fpage>
				<lpage>1978</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1093/hmg/7.12.1967</pubid>
						<pubid idtype="pmpid" link="fulltext">9811942</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B17">
				<title>
					<p>Partial genome scale analysis of gene expression in human adipose tissue using DNA array</p>
				</title>
				<aug>
					<au>
						<snm>Gabrielsson</snm>
						<fnm>BL</fnm>
					</au>
					<au>
						<snm>Carlsson</snm>
						<fnm>B</fnm>
					</au>
					<au>
						<snm>Carlsson</snm>
						<fnm>LM</fnm>
					</au>
				</aug>
				<source>Obes Res</source>
				<pubdate>2000</pubdate>
				<volume>8</volume>
				<fpage>374</fpage>
				<lpage>384</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1038/oby.2000.45</pubid>
						<pubid idtype="pmpid" link="fulltext">10968729</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B18">
				<title>
					<p>Organization of human cardiovascular-expressed genes on chromosomes 21 and 22</p>
				</title>
				<aug>
					<au>
						<snm>Dempsey</snm>
						<fnm>AA</fnm>
					</au>
					<au>
						<snm>Pabalan</snm>
						<fnm>N</fnm>
					</au>
					<au>
						<snm>Tang</snm>
						<fnm>HC</fnm>
					</au>
					<au>
						<snm>Liew</snm>
						<fnm>CC</fnm>
					</au>
				</aug>
				<source>J Mol Cell Cardiol</source>
				<pubdate>2001</pubdate>
				<volume>33</volume>
				<issue>3</issue>
				<fpage>587</fpage>
				<lpage>591</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmpid" link="fulltext">11181026</pubid>
						<pubid idtype="doi">10.1006/jmcc.2000.1335</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B19">
				<title>
					<p>Genome-scale analysis of positional clustering of mouse testis-specific genes</p>
				</title>
				<aug>
					<au>
						<snm>Li</snm>
						<fnm>Q</fnm>
					</au>
					<au>
						<snm>Lee</snm>
						<fnm>BTK</fnm>
					</au>
					<au>
						<snm>Zhang</snm>
						<fnm>LX</fnm>
					</au>
				</aug>
				<source>BMC Genomics</source>
				<pubdate>2005</pubdate>
				<volume>6</volume>
				<issue>1</issue>
				<fpage>7</fpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmpid" link="fulltext">15656914</pubid>
						<pubid idtype="pmcid">548148</pubid>
						<pubid idtype="doi">10.1186/1471-2164-6-7</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B20">
				<title>
					<p>Operons in <it>C. elegans</it>: polycistronic mRNA precursors are processed by trans-splicing of SL2 to downstream coding regions</p>
				</title>
				<aug>
					<au>
						<snm>Spieth</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Brook</snm>
						<fnm>G</fnm>
					</au>
					<au>
						<snm>Kuersten</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Lea</snm>
						<fnm>K</fnm>
					</au>
					<au>
						<snm>Blumenthal</snm>
						<fnm>T</fnm>
					</au>
				</aug>
				<source>Cell</source>
				<pubdate>1993</pubdate>
				<volume>73</volume>
				<fpage>521</fpage>
				<lpage>532</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1016/0092-8674(93)90139-H</pubid>
						<pubid idtype="pmpid" link="fulltext">8098272</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B21">
				<title>
					<p>Yeast "Operons"</p>
				</title>
				<aug>
					<au>
						<snm>Zhang</snm>
						<fnm>X</fnm>
					</au>
					<au>
						<snm>Smith</snm>
						<fnm>TF</fnm>
					</au>
				</aug>
				<source>Microb Comp Genomics</source>
				<pubdate>1998</pubdate>
				<volume>3</volume>
				<issue>2</issue>
				<fpage>133</fpage>
				<lpage>140</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmpid">9697097</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B22">
				<title>
					<p>The Gene Ontology Consortium</p>
				</title>
				<url>http://www.geneontology.org</url>
			</bibl>
			<bibl id="B23">
				<title>
					<p>Zebrafish hox clusters and vertebrate genome evolution</p>
				</title>
				<aug>
					<au>
						<snm>Amores</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Force</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Yan</snm>
						<fnm>Y-L</fnm>
					</au>
					<au>
						<snm>Amemiya</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Fritz</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Ho</snm>
						<fnm>RK</fnm>
					</au>
					<au>
						<snm>Joly</snm>
						<fnm>L</fnm>
					</au>
					<au>
						<snm>Langeland</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Prince</snm>
						<fnm>VE</fnm>
					</au>
					<au>
						<snm>Wang</snm>
						<fnm>Y-L</fnm>
					</au>
					<au>
						<snm>Westerfield</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Ekker</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Postlethwait</snm>
						<fnm>JH</fnm>
					</au>
				</aug>
				<source>Science</source>
				<pubdate>1998</pubdate>
				<volume>282</volume>
				<fpage>1711</fpage>
				<lpage>1714</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1126/science.282.5394.1711</pubid>
						<pubid idtype="pmpid" link="fulltext">9831563</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B24">
				<title>
					<p>Analysis of a 26-kb region linked to the <it>Mhc </it>in Zebrafish: genomic organization of the proteasome component &#946;/transporter associated with antigen processing-2 gene cluster and identification of five new proteasome &#946; subunit genes</p>
				</title>
				<aug>
					<au>
						<snm>Murray</snm>
						<fnm>BW</fnm>
					</au>
					<au>
						<snm>S&#252;ltmann</snm>
						<fnm>H</fnm>
					</au>
					<au>
						<snm>Klein</snm>
						<fnm>J</fnm>
					</au>
				</aug>
				<source>J Immunol</source>
				<pubdate>1999</pubdate>
				<volume>163</volume>
				<fpage>2657</fpage>
				<lpage>2666</lpage>
				<xrefbib>
					<pubid idtype="pmpid" link="fulltext">10453006</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B25">
				<title>
					<p>A contig map of the <it>Mhc </it>class 1 genomic region in the zebrafish region reveals ancient synteny</p>
				</title>
				<aug>
					<au>
						<snm>Michalova</snm>
						<fnm>V</fnm>
					</au>
					<au>
						<snm>Murray</snm>
						<fnm>BW</fnm>
					</au>
					<au>
						<snm>S&#252;ltmann</snm>
						<fnm>H</fnm>
					</au>
					<au>
						<snm>Klein</snm>
						<fnm>J</fnm>
					</au>
				</aug>
				<source>J Immunol</source>
				<pubdate>2000</pubdate>
				<volume>164</volume>
				<fpage>5296</fpage>
				<lpage>5305</lpage>
				<xrefbib>
					<pubid idtype="pmpid" link="fulltext">10799891</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B26">
				<title>
					<p>Bidirectional gene organization: A common architectural feature of the human genome</p>
				</title>
				<aug>
					<au>
						<snm>Adachi</snm>
						<fnm>N</fnm>
					</au>
					<au>
						<snm>Lieber</snm>
						<fnm>MR</fnm>
					</au>
				</aug>
				<source>Cell</source>
				<pubdate>2002</pubdate>
				<volume>109</volume>
				<fpage>807</fpage>
				<lpage>809</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1016/S0092-8674(02)00758-4</pubid>
						<pubid idtype="pmpid" link="fulltext">12110178</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B27">
				<title>
					<p>Phylogenetic tests of the hypothesis of block duplication of homologous genes on human chromosome 6, 9 and 1</p>
				</title>
				<aug>
					<au>
						<snm>Hughes</snm>
						<fnm>AL</fnm>
					</au>
				</aug>
				<source>Mol Biol Evol</source>
				<pubdate>1998</pubdate>
				<volume>15</volume>
				<fpage>854</fpage>
				<lpage>870</lpage>
				<xrefbib>
					<pubid idtype="pmpid" link="fulltext">9656486</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B28">
				<title>
					<p>Genetic and transcriptome characterization of model zebrafish cell lines</p>
				</title>
				<aug>
					<au>
						<snm>He</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Salas-Vidal</snm>
						<fnm>E</fnm>
					</au>
					<au>
						<snm>Rueb</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Krens</snm>
						<fnm>SFG</fnm>
					</au>
					<au>
						<snm>Meijer</snm>
						<fnm>AH</fnm>
					</au>
					<au>
						<snm>Snaar-Jagalska</snm>
						<fnm>BE</fnm>
					</au>
					<au>
						<snm>Spaink</snm>
						<fnm>HP</fnm>
					</au>
				</aug>
				<source>Zebrafish</source>
				<pubdate>2006</pubdate>
				<volume>3</volume>
				<fpage>441</fpage>
				<lpage>453</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1089/zeb.2006.3.441</pubid>
						<pubid idtype="pmpid" link="fulltext">18377224</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B29">
				<title>
					<p>Aryl hydrocarbon receptor activation produces heart-specific transcriptional and toxic responses in developing zebrafish</p>
				</title>
				<aug>
					<au>
						<snm>Carney</snm>
						<fnm>SA</fnm>
					</au>
					<au>
						<snm>Chen</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Burns</snm>
						<fnm>CG</fnm>
					</au>
					<au>
						<snm>Xiong</snm>
						<fnm>KM</fnm>
					</au>
					<au>
						<snm>Peterson</snm>
						<fnm>RE</fnm>
					</au>
					<au>
						<snm>Heideman</snm>
						<fnm>W</fnm>
					</au>
				</aug>
				<source>Mol Pharmacol</source>
				<pubdate>2006</pubdate>
				<volume>70</volume>
				<fpage>549</fpage>
				<lpage>561</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1124/mol.106.025304</pubid>
						<pubid idtype="pmpid" link="fulltext">16714409</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B30">
				<title>
					<p>Loss of function of <it>def </it>selectively up-regulates &#916;<it>113p53 </it>expression to arrest expansion growth of digestive organs in zebrafish</p>
				</title>
				<aug>
					<au>
						<snm>Chen</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Ruan</snm>
						<fnm>H</fnm>
					</au>
					<au>
						<snm>Ng</snm>
						<fnm>SM</fnm>
					</au>
					<au>
						<snm>Gao</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Soo</snm>
						<fnm>HM</fnm>
					</au>
					<au>
						<snm>Wu</snm>
						<fnm>W</fnm>
					</au>
					<au>
						<snm>Zhang</snm>
						<fnm>Z</fnm>
					</au>
					<au>
						<snm>Wen</snm>
						<fnm>Z</fnm>
					</au>
					<au>
						<snm>Lane</snm>
						<fnm>DP</fnm>
					</au>
					<au>
						<snm>Peng</snm>
						<fnm>J</fnm>
					</au>
				</aug>
				<source>Genes &amp; Dev</source>
				<pubdate>2005</pubdate>
				<volume>19</volume>
				<fpage>2900</fpage>
				<lpage>2911</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">1315396</pubid>
						<pubid idtype="pmpid" link="fulltext">16322560</pubid>
						<pubid idtype="doi">10.1101/gad.1366405</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B31">
				<title>
					<p>HNF factors form a network to regulate liver-enriched genes in zebrafish</p>
				</title>
				<aug>
					<au>
						<snm>Cheng</snm>
						<fnm>W</fnm>
					</au>
					<au>
						<snm>Guo</snm>
						<fnm>L</fnm>
					</au>
					<au>
						<snm>Zhang</snm>
						<fnm>Z</fnm>
					</au>
					<au>
						<snm>Soo</snm>
						<fnm>HM</fnm>
					</au>
					<au>
						<snm>Wen</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Wu</snm>
						<fnm>W</fnm>
					</au>
					<au>
						<snm>Peng</snm>
						<fnm>J</fnm>
					</au>
				</aug>
				<source>Dev Biol</source>
				<pubdate>2006</pubdate>
				<volume>294</volume>
				<fpage>482</fpage>
				<lpage>496</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1016/j.ydbio.2006.03.018</pubid>
						<pubid idtype="pmpid" link="fulltext">16631158</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B32">
				<title>
					<p>Microarray analysis of zebrafish <it>cloche </it>mutant using amplified cDNA and identification of potential downstream target genes</p>
				</title>
				<aug>
					<au>
						<snm>Qian</snm>
						<fnm>F</fnm>
					</au>
					<au>
						<snm>Zhen</snm>
						<fnm>F</fnm>
					</au>
					<au>
						<snm>Ong</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Jin</snm>
						<fnm>S-W</fnm>
					</au>
					<au>
						<snm>Soo</snm>
						<fnm>H-M</fnm>
					</au>
					<au>
						<snm>Stainier</snm>
						<fnm>DYR</fnm>
					</au>
					<au>
						<snm>Lin</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Peng</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Wen</snm>
						<fnm>Z</fnm>
					</au>
				</aug>
				<source>Dev Dyn</source>
				<pubdate>2005</pubdate>
				<volume>233</volume>
				<fpage>1163</fpage>
				<lpage>1172</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1002/dvdy.20444</pubid>
						<pubid idtype="pmpid" link="fulltext">15937927</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B33">
				<title>
					<p>ArrayExpress</p>
				</title>
				<url>http://www.ebi.ac.uk/microarray-as/aer/</url>
			</bibl>
			<bibl id="B34">
				<title>
					<p>Model-based analysis of oligonucleotide arrays: model validation, design issues and standard error application</p>
				</title>
				<aug>
					<au>
						<snm>Li</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Wong</snm>
						<fnm>WH</fnm>
					</au>
				</aug>
				<source>Genome Biol</source>
				<pubdate>2001</pubdate>
				<volume>2</volume>
				<issue>8</issue>
				<fpage>RESEARCH0032</fpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmpid" link="fulltext">11532216</pubid>
						<pubid idtype="pmcid">55329</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B35">
				<title>
					<p>Ensembl Zebrafish</p>
				</title>
				<url>http://www.ensembl.org/Danio_rerio/index.html</url>
			</bibl>
		</refgrp>
	</bm>
</art>

