<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
	<ui>1471-2105-8-S10-S3</ui>
	<ji>1471-2105</ji>
	<fm>
		<dochead>Proceedings</dochead>
		<bibl>
			<title>
				<p>Semi-supervised learning for the identification of syn-expressed genes from fused microarray and <it>in situ </it>image data</p>
			</title>
			<aug>
				<au id="A1" ca="yes">
					<snm>Costa</snm>
					<mi>G</mi>
					<fnm>Ivan</fnm>
					<insr iid="I1"/>
					<email>ivan.filho@molgen.mpg.de</email>
				</au>
				<au id="A2">
					<snm>Krause</snm>
					<fnm>Roland</fnm>
					<insr iid="I1"/>
					<insr iid="I3"/>
					<email>roland.krause@molgen.mpg.de</email>
				</au>
				<au id="A3">
					<snm>Opitz</snm>
					<fnm>Lennart</fnm>
					<insr iid="I2"/>
					<email>lopitz@uni-goettingen.de</email>
				</au>
				<au id="A4" ca="yes">
					<snm>Schliep</snm>
					<fnm>Alexander</fnm>
					<insr iid="I1"/>
					<email>alexander.schliep@molgen.mpg.de</email>
				</au>
			</aug>
			<insg>
				<ins id="I1">
					<p>Department Computational Molecular Biology, Max Planck Institute for Molecular Genetics, Berlin, Germany</p>
				</ins>
				<ins id="I2">
					<p>Abteilung Entwicklungsbiochemie, Universit&#228;t G&#246;ttingen, G&#246;ttingen, Germany</p>
				</ins>
				<ins id="I3">
					<p>Department Cellular Microbiology, Max Planck Institute for Infection Biology, Berlin, Germany</p>
				</ins>
			</insg>
			<source>BMC Bioinformatics</source>
			<supplement>
				<title>
					<p>Neural Information Processing Systems (NIPS) workshop on New Problems and Methods in Computational Biology</p>
				</title>
				<editor>Gal Chechik, Christina Leslie, William Stafford Noble, Gunnar R&#228;tsch, Quiad Morris and Koji Tsuda</editor>
				<note>Proceedings</note>
			</supplement>
			<conference>
				<title>
					<p>NIPS workshop on New Problems and Methods in Computational Biology</p>
				</title>
				<location>Whistler, Canada</location>
				<date-range>8 December 2006</date-range>
				<url>http://www.mlcb.org</url>
			</conference>
			<issn>1471-2105</issn>
			<pubdate>2007</pubdate>
			<volume>8</volume>
			<issue>Suppl 10</issue>
			<fpage>S3</fpage>
			<url>http://www.biomedcentral.com/1471-2105/8/S10/S3</url>
			<xrefbib>
				<pubidlist><pubid idtype="pmpid">18269697</pubid><pubid idtype="doi">10.1186/1471-2105-8-S10-S3</pubid>
				</pubidlist></xrefbib>
		</bibl>
		<history>
			<pub>
				<date>
					<day>21</day>
					<month>12</month>
					<year>2007</year>
				</date>
			</pub>
		</history>
		<cpyrt>
			<year>2007</year>
			<collab>Costa et al; licensee BioMed Central Ltd.</collab>
			<note>This is an open access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
		</cpyrt>
		<abs>
			<sec>
				<st>
					<p>Abstract</p>
				</st>
				<sec>
					<st>
						<p>Background</p>
					</st>
					<p>Gene expression measurements during the development of the fly <it>Drosophila melanogaster </it>are routinely used to find functional modules of <it>temporally </it>co-expressed genes. Complimentary large data sets of <it>in situ </it>RNA hybridization images for different stages of the fly embryo elucidate the <it>spatial </it>expression patterns.</p>
				</sec>
				<sec>
					<st>
						<p>Results</p>
					</st>
					<p>Using a semi-supervised approach, constrained clustering with mixture models, we can find clusters of genes exhibiting <it>spatio-temporal </it>similarities in expression, or <it>syn</it>-expression. The temporal gene expression measurements are taken as primary data for which pairwise constraints are computed in an automated fashion from raw <it>in situ </it>images without the need for manual annotation. We investigate the influence of these pairwise constraints in the clustering and discuss the biological relevance of our results.</p>
				</sec>
				<sec>
					<st>
						<p>Conclusion</p>
					</st>
					<p>Spatial information contributes to a detailed, biological meaningful analysis of temporal gene expression data. Semi-supervised learning provides a flexible, robust and efficient framework for integrating data sources of differing quality and abundance.</p>
				</sec>
			</sec>
		</abs>
	</fm>
	<bdy>
		<sec>
			<st>
				<p>Background</p>
			</st>
			<p>The study of embryonic development of the fly <it>Drosophila melanogaster </it>revealed many genes important to the development of other metazoans, including humans. Knowing the precise localization and time of gene expression is crucial in the elucidation of these dynamic cellular mechanisms. The advent of microarray technology has led to the generation of data sets measuring transcription or gene expression levels over the complete embryonic development of the fly <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr></abbrgrp>. Under the assumption that genes with similar expression patterns have similar properties, the concept of <it>co-expressed genes </it>can be used to generate hypotheses about function, pathways and role of proteins that can be taken to the laboratory for further investigation. This reasoning, guilt-by-association, firmly installed the use of clustering algorithms in the analysis of data from DNA microarrays <abbrgrp><abbr bid="B5">5</abbr></abbrgrp>, as a cluster of genes ideally represent functional modules. For example, proteins expressed from co-regulated gene sets have been shown to physically interact <abbrgrp><abbr bid="B6">6</abbr><abbr bid="B7">7</abbr></abbrgrp>.</p>
			<p>Time-course data collected during the cell-cycle, development and differentiation, or in response to external factors required novel methods to cope with the temporal dependencies inherent to gene expression time-courses and with data quality issues; see <abbrgrp><abbr bid="B8">8</abbr></abbrgrp> for a recent review, which identified mixture models to be preferable. Hence, mixture models and model-based clustering found wide-spread use in gene expression time-course analysis <abbrgrp><abbr bid="B9">9</abbr><abbr bid="B10">10</abbr><abbr bid="B11">11</abbr><abbr bid="B12">12</abbr><abbr bid="B13">13</abbr><abbr bid="B14">14</abbr><abbr bid="B15">15</abbr><abbr bid="B16">16</abbr><abbr bid="B17">17</abbr><abbr bid="B18">18</abbr></abbrgrp>. The robustness of mixture models and their flexibility to capture dependencies in the data by use of Splines <abbrgrp><abbr bid="B9">9</abbr><abbr bid="B10">10</abbr></abbrgrp> or Hidden Markov Models (HMM) <abbrgrp><abbr bid="B14">14</abbr><abbr bid="B15">15</abbr><abbr bid="B16">16</abbr></abbrgrp> as component models are the main reasons for the success of this technique in gene expression analysis. While the performance of some methods is quite impressive, the dimensionality of typical data sets &#8211; ten-thousands of genes, less than twenty time-points &#8211; implies that apparent co-expression of genes can be observed due to chance and that, hence, the value of information transfer between co-regulated genes is limited.</p>
			<sec>
				<st>
					<p>Semi-supervised learning and Heterogeneous Data</p>
				</st>
				<p>One way to strengthen the concept of co-expression for clustering algorithms is to augment the <it>primary </it>data, gene expression time-courses in our application, with <it>secondary</it>, external data in order to yield biologically more plausible solutions; recall that most clustering algorithms are only guaranteed to converge locally. The framework of choice which fits in nicely with the iterative knowledge acquisition process in biology is semi-supervised learning <abbrgrp><abbr bid="B19">19</abbr></abbrgrp>, partly clustering (unlabeled learning) and partly classification (labeled learning); sometimes this is also referred to as constrained clustering <abbrgrp><abbr bid="B20">20</abbr><abbr bid="B21">21</abbr><abbr bid="B22">22</abbr></abbrgrp> (see Fig. <figr fid="F1">1</figr>). One of the first applications in bioinformatics <abbrgrp><abbr bid="B14">14</abbr></abbrgrp> shows, that less than 2% labels can drastically improve clustering quality. On real data, high-quality labels which indicate whether, for example, two genes are part of the same functional module can be obtained from the literature. Use of abundant annotations from the Gene Ontology (GO) <abbrgrp><abbr bid="B23">23</abbr></abbrgrp> that are often used to validate clusterings <abbrgrp><abbr bid="B24">24</abbr></abbrgrp> provide surprisingly little improvement, partly due to a mismatch between the semantics of GO and similarity of expression. In this work we use a formulation proposed in <abbrgrp><abbr bid="B20">20</abbr></abbrgrp> that can be combined with mixture model estimation.</p>
				<fig id="F1">
					<title>
						<p>Figure 1</p>
					</title>
					<caption>
						<p>Semi-supervised clustering</p>
					</caption>
					<text>
						<p><b>Semi-supervised clustering</b>. The effectiveness of pairwise constraints is shown by contrasting with the unsupervised setting (left). Assuming a two-dimensional space, the addition of positive pairwise constraints, depicted as red edges, and negative constraints depicted as blue edges (right), can indicate existence of two or more clusters and the cluster boundaries. Edge width corresponds to constraint magnitude.</p>
					</text>
					<graphic file="1471-2105-8-S10-S3-1"/>
				</fig>
			</sec>
			<sec>
				<st>
					<p>Spatial expression patterns</p>
				</st>
				<p>Another important aspect of gene expression, its precise localization, has been studied in great detail in the fly. While the prime motivation for these sensitive experiments was to understand the role of individual genes in organ development, we can incorporate the spatial expression patterns for the generation of functional hypotheses.</p>
				<p>Genes that share the same temporal-spatial expression patterns are more likely to form a functional module. If they are synchronously co-expressed in <it>one tissue</it>, or in <it>multiple tissues </it>we speak of <it>syn-expression </it><abbrgrp><abbr bid="B25">25</abbr></abbrgrp> and take in particular the latter case as a strong sign of functional similarity. The spatial expression patterns can be determined with <it>in situ </it>experiments where an mRNA-specific stain is produced by mRNA-binding oligonucleotides and a suitable dye <abbrgrp><abbr bid="B26">26</abbr></abbrgrp>. Further processing, imaging and image analysis produces either 2D or 3D images of spatial patterns of gene expression; large-scale data sets are available for fly development and for other model organisms. Even though <it>Drosophila </it>embryos are morphologically simple, the image analysis is quite involved as <it>in-situ </it>images are taken of many different subjects with large fluctuations in shape. In addition, the staining intensity has higher, gene-specific error rates compared to DNA microarrays.</p>
			</sec>
			<sec>
				<st>
					<p>Prior work</p>
				</st>
				<p>Tomancak <it>et al</it>. <abbrgrp><abbr bid="B4">4</abbr></abbrgrp> performed a large scale study of gene expression in the fly embryos by <it>in situ </it>RNA hybridizations. The images were manually curated and annotated using a controlled vocabulary &#8211; ImaGO &#8211; following the example of the Gene Ontology <abbrgrp><abbr bid="B23">23</abbr></abbrgrp>. The final result was a hierarchical clustering of genes based on the manual annotations; the gene expression time-courses were not included in the analysis. Further work concentrated on mining the image database for genes with a spatial expression pattern similar to a query <abbrgrp><abbr bid="B27">27</abbr><abbr bid="B28">28</abbr></abbrgrp> and on the extraction of features deemed peculiar and noteworthy <abbrgrp><abbr bid="B28">28</abbr></abbrgrp>, for example by clustering images based on an eigenvector based representation <abbrgrp><abbr bid="B29">29</abbr></abbrgrp>. Recent smaller scale studies investigated pattern formation in Drosophila based on 3D <it>in situ </it>images <abbrgrp><abbr bid="B30">30</abbr><abbr bid="B31">31</abbr></abbrgrp> for a small number of genes.</p>
			</sec>
			<sec>
				<st>
					<p>Our contribution</p>
				</st>
				<p>We obtain clusters of syn-expressed genes during the development of <it>Drosophila</it>. We propose to automatically infer positive constraints (spatial co-expression) and negative constraints (expression in distinct tissues) from the <it>in situ </it>image data and use them in a mixture model for the complementary, higher quality, DNA microarray time-course data as shown in Fig. <figr fid="F2">2</figr>.</p>
				<fig id="F2">
					<title>
						<p>Figure 2</p>
					</title>
					<caption>
						<p>Obtaining constraints from images</p>
					</caption>
					<text>
						<p><b>Obtaining constraints from images</b>. We depict the time course expression (top) and registered <it>in situ </it>images (middle) of genes <it>twi</it>, <it>G12177</it>, <it>Ef2 </it>and <it>RhoGAP71E</it>, which indicate their temporal and spatial expression patterns. From left to right, the embryo images are categorized into the time periods 0&#8211;3 h, 3&#8211;6 h, 6&#8211;9 h, 9&#8211;12 h, 12&#8211;15 h and 15&#8211;18 h. The microarray expression displays a similar expression pattern with maximal expression after 3 hours for all genes but weakly diverging at later time points. The <it>in situ </it>images indicates that <it>twi </it>and <it>CG12177 </it>have syn-expression at time periods 3&#8211;6, 6&#8211;9 and 9&#8211;12; while <it>Ef2 </it>and <it>RhoGAP71E </it>at periods 0&#8211;3, 3&#8211;6, 6&#8211;9, 9&#8211;12 and 15&#8211;18. At the bottom, we display how positive constraints are derived from <it>in situ </it>hybridization patterns. A heat-map displays the correlation coefficients between all pairs of <it>in situ </it>images of the corresponding time period (red values indicate positive correlations). After thresholding the correlation matrices, a constraint matrix for each time period is obtained. For example, constraint matrices from periods 3&#8211;6 and 6&#8211;9 indicates syn-expression of pairs (<it>twi</it>, <it>CG1217</it>) and (<it>Ef2</it>, <it>RhoGAP71E</it>), while the constraint matrix from period 9&#8211;12 also indicate that (<it>CG1217</it>, <it>RhoGAP71E</it>) are syn-expressed. The matrices are combined into one constraining genes that display syn-expression in at least three periods, as indicated in the matrix at the bottom.</p>
					</text>
					<graphic file="1471-2105-8-S10-S3-2"/>
				</fig>
			</sec>
		</sec>
		<sec>
			<st>
				<p>Results and discussion</p>
			</st>
			<sec>
				<st>
					<p>Clustering of gene expression data using mixture of multivariate Gaussians</p>
				</st>
				<p>We cluster gene expression data using a mixture of multivariate Gaussians with diagonal covariance matrix and choose the number of components to be 28 as suggested by the Bayesian Information criterion (BIC) (see Section Evaluation, <abbrgrp><abbr bid="B32">32</abbr></abbrgrp>).</p>
				<p>The gene expression time-courses cover the period from 1 to 12 hours of the embryo development and expression values are given as log-ratios (See Section Data for details). Overall, our clustering results reflect two typical classes (see Fig. <figr fid="F3">3</figr>), the maternal and zygotic transcripts <abbrgrp><abbr bid="B33">33</abbr></abbrgrp>. Maternal genes appear strongly expressed in the first three hours, usually followed by a decline. The clusters 18 to 28 clearly follow a maternal pattern. These transcripts are deposited in the oocyte; typically the embryo does not transcribe these genes in early development. They are responsible for the determination of body axes and the first phases of the cell cycle and other functions. The period from 2 to 3 hours coincides with the cellularization and the formation of three germ layers following gastrulation, when primary tissues start to develop <abbrgrp><abbr bid="B34">34</abbr></abbrgrp>. Conversely, genes actively transcribed in the embryo are not expressed in the early time points and expression rises to significant levels only in later stages (3 hours and later). Many of these genes are important to organogenesis. Transcripts in the clusters 1 to 4 and 8 to 11 follow the pattern of embryonic activation unambiguously. The functional association can be observed in the overrepresented Gene Ontology terms (see Supplementary Material <abbrgrp><abbr bid="B35">35</abbr></abbrgrp>). For other clusters, shapes cannot be matched to such simple schemes. Several have maximal expression in the midst of embryonic development. Note that the clusters that show varying levels are less populated than the ones in the maternal and in the activated class.</p>
				<fig id="F3">
					<title>
						<p>Figure 3</p>
					</title>
					<caption>
						<p>Clustering result: Mixture of Gaussians</p>
					</caption>
					<text>
						<p><b>Clustering result: Mixture of Gaussians</b>. The similarity of overall patterns in the clustering result of the MoG is explained by the developmental stages investigated. The major phenomena are depletion of maternal mRNA (maternal genes) and start of the embryonic transcriptional machinery during embryogenesis at time point 3 hours (zigotically expressed genes). In the clusters with zigotically expressed genes, we observe two main periods of activation: 3&#8211;4 hours for cluster U1 to U5, and 7&#8211;8 h for clusters U8 to U11. In the clusters with maternal genes, we observe under-expression of genes at several time periods: 3&#8211;4 h in clusters U21 to U28; 4&#8211;5 h for clusters U17 to U20; 6&#8211;7 h for cluster U16; 7&#8211;8 h for clusters U12 and U13; and 9&#8211;10 h for cluster U15.</p>
					</text>
					<graphic file="1471-2105-8-S10-S3-3"/>
				</fig>
			</sec>
			<sec>
				<st>
					<p>Using images as Partial Information</p>
				</st>
				<p>We use semi-supervised learning to obtain better solutions for the maximum-likelihood estimation by constraining the mixture estimation with pairwise constraints between genes. The principle behind this is shown in Fig. <figr fid="F1">1</figr>. We choose a very simple approach to compare the images, which gives competitive results compared to a computationally more complex previous approach <abbrgrp><abbr bid="B36">36</abbr></abbrgrp>, combined with judicious filtering. The constraints are derived by measuring correlation between <it>in situ </it>images of pairs of genes. Pairs of genes, whose images are highly correlated in three or more time periods, are positively constrained, see Fig. <figr fid="F2">2</figr> for example. Negative constraints are derived similarly (see Section Constraints from <it>in situ </it>data for details). These constraints will, ideally, differentiate between genes showing co-expression due to chance and causal temporal co-expression also supported by spatial co-expression (syn-expression).</p>
				<p>As a previous study has shown <abbrgrp><abbr bid="B24">24</abbr></abbrgrp>, noisy constraints will be detrimental to the clustering quality; consequently few high quality constraints are preferable compared to many constraints of medium or low quality. The correlation coefficients of all pairwise image comparisons showed a bi-modal distribution (not shown) which allowed to select the strongest correlations with little ambiguity. Thus we arrived at a set of constraints derived from strongly positive and negative correlations.</p>
			</sec>
			<sec>
				<st>
					<p>Changes in the biological annotations</p>
				</st>
				<p>To investigate the effects of the constraints in the clustering, we compare the results of the mixture of Gaussians (MoG) against the mixture of Gaussians with pairwise constraints (cMoG) (see Fig. <figr fid="F4">4</figr> for clusters). As explained in Section Evaluation, we choose to use positive constraints, which are supported in at least three developmental stages, as they yield good recall of <it>in situ </it>image annotations.</p>
				<fig id="F4">
					<title>
						<p>Figure 4</p>
					</title>
					<caption>
						<p>Clustering result: Constrained Mixture of Gaussians</p>
					</caption>
					<text>
						<p><b>Clustering result: Constrained Mixture of Gaussians</b>. The 28 clusters from cMoG show tightly co-regulated patterns and a refinement of the clustering solution of MoG. In the clusters with zygotically expressed genes, we also observe two main periods of activation: 3&#8211;4 h for clusters c1 to c5, and 7&#8211;8 h hours for clusters c9 to c12. In the clusters with maternal genes, we observe under-expression of genes at several time periods: 3&#8211;4 h for clusters C18, C20 to C28; 4&#8211;5 h for clusters C15, C16, C19; 6&#8211;7 h for clusters C8, C13, C14 and C19; and 7&#8211;8 h hours for cluster C7.</p>
					</text>
					<graphic file="1471-2105-8-S10-S3-4"/>
				</fig>
				<p>As a sanity check, we inspect if cMoG is successful in constraining the clustering by counting the number of constraints, as derived from the images, met in the final solutions. With MoG, a sizeable proportion of the constraints are already satisfied (656 out of 1756 pairwise positive constraints), as the expression data partially agrees with the constraints as syn-expressed genes are co-expressed. With cMoG, 1127 out of 1756 pairwise positive constraints are met, nearly twice the number for MoG. This demonstrates that cMoG benefits from the constraints in deriving the clusters.</p>
				<p>Another helpful analysis is the comparison of enrichment of <it>in situ </it>image annotations (ImaGO), as described in Section Evaluation (see <abbrgrp><abbr bid="B35">35</abbr></abbrgrp> for complete results). We display in Fig. <figr fid="F5">5</figr> a scatter plot with the <it>p</it>-values of all ImaGO terms, which had an enrichment <it>p</it>-value below 0.01 in one either cMoG or MoG clusters. In summary, cMoG has a higher enrichment in 67 out of 112 relevant ImaGO terms. A binomial test for testing the event of having 67 successes in 112 trials is rejected with a <it>p</it>-value of 0.0232, which indicates that the counts of ImaGO terms with higher enrichment for cMoG is significantly higher than expected by chance. Furthermore, if we take only ImaGO terms with a higher enrichment gain for one of the methods into account (points distant from the diagonal line in Fig. <figr fid="F5">5</figr>), the advantage of cMoG is even greater (see Fig. <figr fid="F6">6</figr> and Fig. <figr fid="F7">7</figr>). This indicates that even without direct use of the annotation information from ImaGO, cMoG has a greater sensitivity in grouping syn-expressed genes.</p>
				<fig id="F5">
					<title>
						<p>Figure 5</p>
					</title>
					<caption>
						<p>ImaGO term enrichment</p>
					</caption>
					<text>
						<p><b>ImaGO term enrichment</b>. We compare ImaGO term enrichment of MoG (<it>x</it>-axis) and cMoG (<it>y</it>-axis) in a scatter plot. We use -<it>log</it>(<it>p</it>)-values, thus larger values indicate a larger degree of enrichment. Points above the red line indicate a higher enrichment in cMoG clusters, and values below in MoG clusters. The distance from the diagonal is proportional to the increase in enrichment. For 67 out of 112 ImaGO terms we observe a higher degree of enrichment in cMoG clusters.</p>
					</text>
					<graphic file="1471-2105-8-S10-S3-5"/>
				</fig>
				<fig id="F6">
					<title>
						<p>Figure 6</p>
					</title>
					<caption>
						<p>Proportion of ImaGO term enrichment</p>
					</caption>
					<text>
						<p><b>Proportion of ImaGO term enrichment</b>. For each threshold <it>&#964; </it>(<it>x</it>-axis), we depict the proportion of ImaGO terms for which we observer a smaller <it>p</it>-value in cMoG than in MoG (<it>y</it>-axis). The threshold <it>&#964; </it>discards ImaGO terms, where the difference in the log of the <it>p</it>-value of cMoG and MoG in smaller then <it>&#964;</it>. As can be observed, the proportion is higher then 0.5 for all <it>&#964; </it>values, which indicates an advantage of cMoG. Furthermore, the proportion has an increasing tendency for higher <it>&#964; </it>values.</p>
					</text>
					<graphic file="1471-2105-8-S10-S3-6"/>
				</fig>
				<fig id="F7">
					<title>
						<p>Figure 7</p>
					</title>
					<caption>
						<p>ImaGO term enrichment for <it>&#964; </it>= 0.3</p>
					</caption>
					<text>
						<p><b>ImaGO term enrichment for <it>&#964; </it>= 0.3</b>. We compare ImaGO term enrichment of MoG (<it>x</it>-axis) and cMoG (<it>y</it>-axis) in a scatter plot for <it>&#964; </it>= 0.3. We use -<it>log</it>(<it>p</it>)-values, thus larger values indicate a larger degree of enrichment. Points above the red line indicate a higher enrichment in cMoG clusters, and values below in MoG clusters. Green points between the dotted lines represent ImaGO terms not satisfying the threshold <it>&#964; </it>= 0.3, where <it>&#964; </it>indicates the distance from the diagonal line to the dotted lines. We clearly observe a higher proportion of non-filtered ImaGO terms (points in blue) above the diagonal line (32 ImaGO terms) against (12 ImaGO terms) below the diagonal. A binomial test is rejected with a <it>p</it>-value of 0.0018, which indicates an significant advantage of cMoG.</p>
					</text>
					<graphic file="1471-2105-8-S10-S3-7"/>
				</fig>
				<p>Overall, the individual clusters of MoG and cMoG did not differ much; the cMoG clusters were better spread and the number of clusters with few genes assigned is smaller. One way to quantify the distinctions is to calculate the sensitivity and specificity of cMoG taking the results from MoG as the ground truth. These values are respectively 0.53 and 0.97, which indicates that cMoG has a tendency to subdivide clusters from MoG.</p>
			</sec>
			<sec>
				<st>
					<p>Functional annotations in constrained clusters</p>
				</st>
				<p>Even for a well characterized genome like <it>Drosophila </it>the high dimensionality in the annotation data provides only limited information for any single gene. Analyzing the obtained clusters is also challenging due to the necessity to identify the corresponding functional modules in the unconstrained and the constrained sets and by the requirement to show improvements rather than simple correct functional assignments. In the following, we will refer to the <it>i</it>th cluster from cMoG and MoG as C<it>i </it>and U<it>i </it>respectively. For some cases, the mapping from clusters of cMoG to MoG is simply one to one (e.g., C1 to U1, C5 to U5, C11 to U11 and C12 to U10). Most other clusters show larger differences. We focus our functional analysis on clusters with zygoticly expressed transcripts (i.e., C1 to C4 and C9 to C12 in Fig. <figr fid="F4">4</figr>).</p>
				<p>Cluster C2 represents a good example of the changes resulting from the introduction of constraints. It contains most of the genes from U2 (135 genes) and 16 genes from U3. Out of the seven genes, which show similar expression patterns and have co-location constraints (<it>CG6930</it>, <it>E2f</it>, <it>Iswi</it>, <it>neur</it>, <it>Set</it>, <it>RhoGAP771e</it>, <it>trx</it>), only four (<it>G6930</it>, <it>E2f</it>, <it>Iswi</it>, <it>trx</it>) are found in the U2. All these genes have ImaGO annotations related to <it>ventral nerve cord primordium </it>and related terms (see Fig. <figr fid="F8">8</figr> top for mean <it>in situ </it>images of these genes and <abbrgrp><abbr bid="B35">35</abbr></abbrgrp> for complete ImaGO enrichment results). Related genes that have no constraints but are annotated as part of the <it>embryonic central nervous system </it>are included in C2 (<it>CG7372</it>, <it>CG14722</it>, <it>fzy</it>). The analysis of GO term enrichment indicates terms such as <it>nervous system development </it>(<it>p</it>-value of 3.38e-23) and <it>system development </it>(<it>p</it>-value of 9.54e-21) (similar term enrichment is found for cluster U2). It should be noted that the clusters U2 and U3 are similar overall and mainly differ in the average time when genes reach the plateau of maximal expression.</p>
				<fig id="F8">
					<title>
						<p>Figure 8</p>
					</title>
					<caption>
						<p>Averaged <it>in situ </it>images C2, C3 and C10</p>
					</caption>
					<text>
						<p><b>Averaged <it>in situ </it>images C2, C3 and C10</b>. Averaged <it>in situ </it>images of genes constrained in Cluster C2 (top), C3 (middle) and C10 (bottom) allow to visually assess homogeneity of spatial distribution. From left to right, we have embryos at hours 0&#8211;3, 3&#8211;6, 6&#8211;9, 9&#8211;12, 12&#8211;15 and 15&#8211;18. Top images represents dorsal views, bottom images lateral views; not all time periods have images in both views.</p>
					</text>
					<graphic file="1471-2105-8-S10-S3-8"/>
				</fig>
				<p>An example for larger changes is cluster C3, which is mainly composed of genes originally found in U3 (101 genes) and U8 (63 genes). C3 was constrained by three genes (<it>rhea</it>, <it>Rsf1 </it>and <it>vig</it>) of which <it>rhea </it>and <it>vig </it>come from cluster U8 and <it>Rsf1 </it>from U3 (see Fig. <figr fid="F8">8</figr> middle for mean <it>in situ </it>images of C3). This cluster presents smaller <it>p</it>-values for ImaGO terms related to <it>muscle primordium </it>(genes <it>CG5522</it>, <it>CG9253</it>, <it>Dg</it>, <it>Mef2</it>, <it>betaTub60D</it>, <it>htl</it>, <it>mbc</it>, <it>vig</it>) than U3 and U8. Furthermore, GO term analysis reveals that this cluster shows enrichment for <it>nervous system development </it>(<it>p</it>-value of 1.33e-11) and <it>axis specification </it>(9.31e-05). For the latter term, seven genes are originally from U3 (<it>Dfd</it>, <it>Lis-1</it>, <it>sti</it>, <it>Syx1A</it>, <it>sqd</it>, <it>Ras85Dm</it>, <it>tup</it>) and five from U8 (<it>baz</it>, <it>Dg</it>, <it>pnt</it>, <it>Rac2</it>, <it>tok</it>), demonstrating that the changes introduced increased the number of syn-expressed genes within C3.</p>
				<p>The cluster C9 represents only a subset of U8 (59 out of the 126 genes) but has no genes with constraints. It consists of genes from U8 that are not constrained to genes from C3 (see paragraph above). Still, it is enriched in the ImaGO term <it>embryonic central nervous system </it>and related terms (genes <it>HLHmbeta</it>, <it>NetB</it>, <it>Oli</it>, <it>lin-28</it>, <it>scrt</it>, <it>sd</it>, <it>tap</it>, <it>uzip </it>and <it>zfh2</it>). The cluster is also enriched in the terms <it>organ </it>(<it>p</it>-value 2.66e-05) and <it>ectoderm development </it>(<it>p</it>-values 8.54e-05), which were significantly enriched in U8. In other words, this cluster is a specialization of U8, whose genes are specific to <it>organ development</it>.</p>
				<p>C10 is formed by the addition of most genes in the U4 cluster (39 genes) to U10 (118 genes). There are seven genes constraining this cluster (<it>CG6751</it>, <it>CG18446</it>, <it>CG13912</it>, <it>CG10924</it>, <it>CG8745</it>, <it>dm</it>, <it>Klp61F</it>) (see Fig. <figr fid="F8">8</figr> bottom). ImaGO term enrichment relates this cluster to <it>yolk nuclei </it>and <it>amnioserosa</it>. It is also in enriched in the GO term <it>nervous system development </it>(<it>p</it>-value 1.06e-08), all of which were insignificant in the U10 cluster.</p>
				<p>It is also worthwhile to look at those few cases where MoG performed better. From Fig. <figr fid="F5">5</figr>, two ImaGO terms with higher enrichment increase in MoG are <it>maternal </it>and <it>procephalic ectoderm anlage in statu nascendi</it>. The first term was enriched in cluster C22 and U21, where MoG had some more genes related to the term <it>maternal </it>(34 genes in MoG compared to 31 genes in cMoG). For the latter ImaGO term, clusters U2 and C2 were both enriched, and there was only one annotated gene in U2 not in C2. As none of these annotated groups of genes had pairwise constraints, we could not detect any direct effect of the constrained clustering on these results.</p>
				<p>The refined clusters improve the generation of testable hypotheses for the role of uncharacterized genes. Overall, we observe improvement in annotation of genes related to development of the fly, in particular with respect to the ImaGO annotations, which increases our confidence in the delineation of syn-expressed functional modules.</p>
			</sec>
		</sec>
		<sec>
			<st>
				<p>Conclusion</p>
			</st>
			<p>The generation of functional hypotheses by integrating different information sources is a key problem posed by the massive amounts of high-throughput data that is generated in todays laboratories. Often, analyses are limited to few information sources and the integration only starts after many processing steps, frequently including manual annotation, and is more often than not performed manually.</p>
			<p>Here we have shown, for a limited setting, how to automatically fuse temporal and spatial gene expression patterns by semi-supervised clustering. Our results show that the clusters we find are biologically meaningful and that we can detect clusters of syn-expressed genes which are worthwhile targets for further investigation, either with classical biological analysis or as the input for methods inferring networks. Our implementation is reasonably simple and computationally efficient and the semi-supervised approach provides a flexible framework for adapting results to questions biologists are interested in. The main advantage of the semi-supervised approach over joint models and other approaches is that it can cope easily with the variations in data abundance from different sources. Gene expression measured with DNA microarrays is often available for ten-thousands of genes and many time-points, in situ hybridization will typically only cover a fraction of those, as will high quality protein interaction or protein structure data. Our results show a small but very clear improvement, despite the complexity of the problem, namely the restriction to embryonic stages to few data points, and the usual caveats concerning DNA microarrays and <it>in situ </it>images of gene expression. The open questions are manifold and concern both the biology and the computer science. How can we refine biological questions to yield more meaningful answers? How can one mine image data effectively, and does a representation in 2D suffice or is using the third dimension a necessity? The best combination of several, potentially conflicting, information sources to arrive at one set of constraints is an equally challenging problem. Can one incorporate the per-stage constraints into the learning for time-course data? We have demonstrated that our methodology is a promising candidate for the delineation of functional modules using different data types and our results show that further investigations are likely to bear fruit.</p>
		</sec>
		<sec>
			<st>
				<p>Methods</p>
			</st>
			<sec>
				<st>
					<p>Mixture models</p>
				</st>
				<p>A mixture model <abbrgrp><abbr bid="B32">32</abbr></abbrgrp> is a stochastic model where observations are drawn from one of several component densities. More formally, it is a convex combination of density functions,</p>
				<p>
					<display-formula id="M1">
						<m:math name="1471-2105-8-S10-S3-i1" xmlns:m="http://www.w3.org/1998/Math/MathML">
							<m:semantics>
								<m:mrow>
									<m:mi>P</m:mi>
									<m:mo stretchy="false">[</m:mo>
									<m:msub>
										<m:mi>x</m:mi>
										<m:mi>i</m:mi>
									</m:msub>
									<m:mo>|</m:mo>
									<m:mi>&#920;</m:mi>
									<m:mo stretchy="false">]</m:mo>
									<m:mo>=</m:mo>
									<m:mstyle displaystyle="true">
										<m:munderover>
											<m:mo>&#8721;</m:mo>
											<m:mrow>
												<m:mi>k</m:mi>
												<m:mo>=</m:mo>
												<m:mn>1</m:mn>
											</m:mrow>
											<m:mi>K</m:mi>
										</m:munderover>
										<m:mrow>
											<m:msub>
												<m:mi>&#945;</m:mi>
												<m:mi>k</m:mi>
											</m:msub>
											<m:mi>P</m:mi>
											<m:mo stretchy="false">[</m:mo>
											<m:msub>
												<m:mi>x</m:mi>
												<m:mi>i</m:mi>
											</m:msub>
											<m:mo>|</m:mo>
											<m:msub>
												<m:mi>&#952;</m:mi>
												<m:mi>k</m:mi>
											</m:msub>
											<m:mo stretchy="false">]</m:mo>
										</m:mrow>
									</m:mstyle>
									<m:mo>.</m:mo>
								</m:mrow>
								<m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaaieqacqWFqbaucqGGBbWwcqWG4baEdaWgaaWcbaGaemyAaKgabeaakiabcYha8jabfI5arjabc2faDjabg2da9maaqahabaacciGae4xSde2aaSbaaSqaaiabdUgaRbqabaGccqWFqbaucqGGBbWwcqWG4baEdaWgaaWcbaGaemyAaKgabeaakiabcYha8jab+H7aXnaaBaaaleaacqWGRbWAaeqaaOGaeiyxa0faleaacqWGRbWAcqGH9aqpcqaIXaqmaeaacqWGlbWsa0GaeyyeIuoakiabc6caUaaa@4DB4@</m:annotation>
							</m:semantics>
						</m:math>
					</display-formula>
				</p>
				<p>Here, <inline-formula><m:math name="1471-2105-8-S10-S3-i2" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:mi>X</m:mi><m:mo>=</m:mo><m:msubsup><m:mrow><m:mo>{</m:mo><m:msub><m:mi>x</m:mi><m:mi>i</m:mi></m:msub><m:mo>}</m:mo></m:mrow><m:mrow><m:mi>i</m:mi><m:mo>=</m:mo><m:mn>1</m:mn></m:mrow><m:mi>N</m:mi></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacqWGybawcqGH9aqpcqGG7bWEcqWG4baEdaWgaaWcbaGaemyAaKgabeaakiabc2ha9naaDaaaleaacqWGPbqAcqGH9aqpcqaIXaqmaeaacqWGobGtaaaaaa@3998@</m:annotation></m:semantics></m:math></inline-formula> denotes the observed data (or the gene expression time-courses), &#920; = (<it>&#945;</it><sub>1</sub>,...,<it>&#945;</it><sub><it>K</it></sub>, <it>&#952;</it><sub>1</sub>,...,<it>&#952;</it><sub><it>K</it></sub>) the non-negative component weights or priors <it>&#945;</it><sub><it>k</it></sub>, <it>i </it>= 1,...,<it>K</it>, which add to unity and <b>P</b>[<it>x</it><sub><it>i</it></sub>|<it>&#952;</it><sub><it>k</it></sub>] are the <it>K </it>component densities parameterized by <it>&#952;</it><sub><it>k</it></sub>, <it>k </it>= 1,...,<it>K</it>, for example <it>&#952;</it><sub><it>k </it></sub>= (<it>&#956;</it><sub><it>k</it></sub>, &#931;<sub><it>k</it></sub>) for multivariate Gaussians. The Expectation-Maximization (EM) algorithm <abbrgrp><abbr bid="B32">32</abbr></abbrgrp> can be used to find parameters &#920;* maximizing (1) at least locally. The EM is necessary as (1) is essentially the incomplete data likelihood function; missing are the values of the indicator variables <inline-formula><m:math name="1471-2105-8-S10-S3-i3" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:mi>Y</m:mi><m:mo>=</m:mo><m:msubsup><m:mrow><m:mo>{</m:mo><m:msub><m:mi>y</m:mi><m:mi>i</m:mi></m:msub><m:mo>}</m:mo></m:mrow><m:mrow><m:mi>i</m:mi><m:mo>=</m:mo><m:mn>1</m:mn></m:mrow><m:mi>N</m:mi></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacqWGzbqwcqGH9aqpcqGG7bWEcqWG5bqEdaWgaaWcbaGaemyAaKgabeaakiabc2ha9naaDaaaleaacqWGPbqAcqGH9aqpcqaIXaqmaeaacqWGobGtaaaaaa@399C@</m:annotation></m:semantics></m:math></inline-formula>, <it>y</it><sub><it>i </it></sub>&#8712; {1,...,<it>K</it>}, which designate the component <it>y</it><sub><it>i </it></sub>which generated the observation <it>x</it><sub><it>i</it></sub>. If <it>Y </it>is known, the maximization is straight-forward, and the core idea of EM is to iteratively use expected values for <it>Y </it>based on current parameters &#920;<sub><it>t </it></sub>in the estimation of &#920;<sub><it>t</it>+1</sub>. A nice introduction to the EM-algorithm is given in <abbrgrp><abbr bid="B37">37</abbr></abbrgrp>.</p>
			</sec>
			<sec>
				<st>
					<p>Partially supervised learning</p>
				</st>
				<p>We assume additional soft constraints for observations in the form of pairwise positive (link) respectively negative (do not link) constraints <inline-formula><m:math name="1471-2105-8-S10-S3-i4" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>w</m:mi><m:mrow><m:mi>i</m:mi><m:mi>j</m:mi></m:mrow><m:mo>+</m:mo></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacqWG3bWDdaqhaaWcbaGaemyAaKMaemOAaOgabaGaey4kaScaaaaa@31EA@</m:annotation></m:semantics></m:math></inline-formula> respectively <inline-formula><m:math name="1471-2105-8-S10-S3-i5" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>w</m:mi><m:mrow><m:mi>i</m:mi><m:mi>j</m:mi></m:mrow><m:mo>&#8722;</m:mo></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacqWG3bWDdaqhaaWcbaGaemyAaKMaemOAaOgabaGaeyOeI0caaaaa@31F5@</m:annotation></m:semantics></m:math></inline-formula> &#8712; [0, 1], which reflect the degree of linking for each pairs of observations <it>x</it><sub><it>i</it></sub>, <it>x</it><sub><it>j</it></sub>, 1 &#8804; <it>i </it>&lt;<it>j </it>&#8804; <it>N</it>. We use a formulation proposed by Lange et al. <abbrgrp><abbr bid="B20">20</abbr></abbrgrp> which we summarize here; for further applications of the method we refer to <abbrgrp><abbr bid="B21">21</abbr><abbr bid="B24">24</abbr></abbrgrp>.</p>
				<p>Let <it>W</it><sup>+ </sup>= {<inline-formula><m:math name="1471-2105-8-S10-S3-i4" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>w</m:mi><m:mrow><m:mi>i</m:mi><m:mi>j</m:mi></m:mrow><m:mo>+</m:mo></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacqWG3bWDdaqhaaWcbaGaemyAaKMaemOAaOgabaGaey4kaScaaaaa@31EA@</m:annotation></m:semantics></m:math></inline-formula>} respectively <it>W</it><sup>- </sup>be symmetric <it>N </it>&#215; <it>N </it>matrices. The EM-algorithm can be easily modified to respect the constraints <it>W</it><sup>+</sup>, <it>W</it><sup>-</sup>. In the <it>t</it>-th E-step the posterior distribution <it>P</it>[<it>Y</it>|<it>X</it>, &#920;<sub><it>t</it></sub>] over hidden labels <it>y</it><sub><it>i </it></sub>is computed, where &#920;<sub><it>t </it></sub>is the last estimate of the parameters. By Bayes' rule we have</p>
				<p>
					<display-formula id="M2">
						<m:math name="1471-2105-8-S10-S3-i6" xmlns:m="http://www.w3.org/1998/Math/MathML">
							<m:semantics>
								<m:mrow>
									<m:mi>P</m:mi>
									<m:mo stretchy="false">[</m:mo>
									<m:mi>Y</m:mi>
									<m:mo>|</m:mo>
									<m:mi>X</m:mi>
									<m:mo>,</m:mo>
									<m:mi>&#920;</m:mi>
									<m:mo stretchy="false">]</m:mo>
									<m:mo>=</m:mo>
									<m:mfrac>
										<m:mn>1</m:mn>
										<m:mi>Z</m:mi>
									</m:mfrac>
									<m:mo>&#8901;</m:mo>
									<m:mi>P</m:mi>
									<m:mo stretchy="false">[</m:mo>
									<m:mi>X</m:mi>
									<m:mo>|</m:mo>
									<m:mi>Y</m:mi>
									<m:mo>,</m:mo>
									<m:mi>&#920;</m:mi>
									<m:mo stretchy="false">]</m:mo>
									<m:mo>&#8901;</m:mo>
									<m:mi>P</m:mi>
									<m:mo stretchy="false">[</m:mo>
									<m:mi>Y</m:mi>
									<m:mo>|</m:mo>
									<m:mi>&#920;</m:mi>
									<m:mo stretchy="false">]</m:mo>
									<m:mo>,</m:mo>
								</m:mrow>
								<m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaaieqacqWFqbaucqGGBbWwcqWGzbqwcqGG8baFcqWGybawcqGGSaalcqqHyoqucqGGDbqxcqGH9aqpdaWcaaqaaiabigdaXaqaaiabdQfaAbaacqGHflY1cqWFqbaucqGGBbWwcqWGybawcqGG8baFcqWGzbqwcqGGSaalcqqHyoqucqGGDbqxcqGHflY1cqWFqbaucqGGBbWwcqWGzbqwcqGG8baFcqqHyoqucqGGDbqxcqGGSaalaaa@5124@</m:annotation>
							</m:semantics>
						</m:math>
					</display-formula>
				</p>
				<p>where <it>Z </it>is a normalizing constant. Loosely speaking, the constraints are incorporated by choosing the prior distribution <b>P</b>[<it>Y</it>|&#920;<sub><it>t</it></sub>] such that neither constraints <it>W</it><sup>+</sup>, <it>W</it><sup>- </sup>nor prior probabilities <it>&#945;</it><sub><it>k </it></sub>in &#920;<sub><it>t </it></sub>are violated while maximizing its entropy. In other words, we choose the distribution, which obeys the <it>maximum entropy </it>principle and which is called the <it>Gibbs </it>distribution. See <abbrgrp><abbr bid="B20">20</abbr><abbr bid="B21">21</abbr></abbrgrp> for full details. Hence</p>
				<p>
					<display-formula id="M3">
						<m:math name="1471-2105-8-S10-S3-i7" xmlns:m="http://www.w3.org/1998/Math/MathML">
							<m:semantics>
								<m:mrow>
									<m:mi>P</m:mi>
									<m:mo stretchy="false">[</m:mo>
									<m:mi>Y</m:mi>
									<m:mo>|</m:mo>
									<m:mi>&#920;</m:mi>
									<m:mo stretchy="false">]</m:mo>
									<m:mo>=</m:mo>
									<m:mfrac>
										<m:mn>1</m:mn>
										<m:mi>Z</m:mi>
									</m:mfrac>
									<m:mstyle displaystyle="true">
										<m:munder>
											<m:mo>&#8719;</m:mo>
											<m:mi>i</m:mi>
										</m:munder>
										<m:mrow>
											<m:msub>
												<m:mi>&#945;</m:mi>
												<m:mrow>
													<m:msub>
														<m:mi>y</m:mi>
														<m:mi>i</m:mi>
													</m:msub>
												</m:mrow>
											</m:msub>
										</m:mrow>
									</m:mstyle>
									<m:mstyle displaystyle="true">
										<m:munder>
											<m:mo>&#8719;</m:mo>
											<m:mrow>
												<m:mi>i</m:mi>
												<m:mo>,</m:mo>
												<m:mi>j</m:mi>
											</m:mrow>
										</m:munder>
										<m:mrow>
											<m:mi>exp</m:mi>
											<m:mo>&#8289;</m:mo>
											<m:mo stretchy="false">(</m:mo>
											<m:mo>&#8722;</m:mo>
											<m:msup>
												<m:mi>&#955;</m:mi>
												<m:mo>+</m:mo>
											</m:msup>
											<m:msubsup>
												<m:mi>w</m:mi>
												<m:mrow>
													<m:mi>i</m:mi>
													<m:mi>j</m:mi>
												</m:mrow>
												<m:mo>+</m:mo>
											</m:msubsup>
											<m:mo stretchy="false">(</m:mo>
											<m:mn>1</m:mn>
											<m:mo>&#8722;</m:mo>
											<m:msub>
												<m:mi>&#948;</m:mi>
												<m:mrow>
													<m:msub>
														<m:mi>y</m:mi>
														<m:mi>i</m:mi>
													</m:msub>
													<m:msub>
														<m:mi>y</m:mi>
														<m:mi>j</m:mi>
													</m:msub>
												</m:mrow>
											</m:msub>
											<m:mo stretchy="false">)</m:mo>
											<m:mo>&#8722;</m:mo>
											<m:msup>
												<m:mi>&#955;</m:mi>
												<m:mo>&#8722;</m:mo>
											</m:msup>
											<m:msubsup>
												<m:mi>w</m:mi>
												<m:mrow>
													<m:mi>i</m:mi>
													<m:mi>j</m:mi>
												</m:mrow>
												<m:mo>&#8722;</m:mo>
											</m:msubsup>
											<m:msub>
												<m:mi>&#948;</m:mi>
												<m:mrow>
													<m:msub>
														<m:mi>y</m:mi>
														<m:mi>i</m:mi>
													</m:msub>
													<m:msub>
														<m:mi>y</m:mi>
														<m:mi>j</m:mi>
													</m:msub>
												</m:mrow>
											</m:msub>
											<m:mo stretchy="false">)</m:mo>
										</m:mrow>
									</m:mstyle>
									<m:mo>,</m:mo>
								</m:mrow>
								<m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaaieqacqWFqbaucqGGBbWwcqWGzbqwcqGG8baFcqqHyoqucqGGDbqxcqGH9aqpdaWcaaqaaiabigdaXaqaaiabdQfaAbaadaqeqbqaaGGaciab+f7aHnaaBaaaleaacqWG5bqEdaWgaaadbaGaemyAaKgabeaaaSqabaaabaGaemyAaKgabeqdcqGHpis1aOWaaebuaeaacyGGLbqzcqGG4baEcqGGWbaCcqGGOaakcqGHsislcqGF7oaBdaahaaWcbeqaaiabgUcaRaaakiabdEha3naaDaaaleaacqWGPbqAcqWGQbGAaeaacqGHRaWkaaGccqGGOaakcqaIXaqmcqGHsislcqGF0oazdaWgaaWcbaGaemyEaK3aaSbaaWqaaiabdMgaPbqabaWccqWG5bqEdaWgaaadbaGaemOAaOgabeaaaSqabaGccqGGPaqkcqGHsislcqGF7oaBdaahaaWcbeqaaiabgkHiTaaakiabdEha3naaDaaaleaacqWGPbqAcqWGQbGAaeaacqGHsislaaGccqGF0oazdaWgaaWcbaGaemyEaK3aaSbaaWqaaiabdMgaPbqabaWccqWG5bqEdaWgaaadbaGaemOAaOgabeaaaSqabaGccqGGPaqkaSqaaiabdMgaPjabcYcaSiabdQgaQbqab0Gaey4dIunakiabcYcaSaaa@7218@</m:annotation>
							</m:semantics>
						</m:math>
					</display-formula>
				</p>
				<p>where <it>Z </it>is the normalizing constant. The Lagrange parameters <it>&#955;</it><sup>+ </sup>and <it>&#955;</it><sup>- </sup>weigh the penalty of positive and negative constraints violations and hence control the importance of the constraints. If <it>&#955;</it><sup>+ </sup>= <it>&#955;</it><sup>- </sup>= 0 then the estimation maximized the likelihood, whereas for increasing <it>&#955;</it><sup>+</sup>, <it>&#955;</it><sup>- </sup>the result is more strongly influenced by the constraints. As computing (2) is usually infeasible we again follow <abbrgrp><abbr bid="B20">20</abbr></abbrgrp> and resort to a <it>mean field approximation</it>. Note, finally, that when there is no overlap in the constraints &#8211; more exactly, <inline-formula><m:math name="1471-2105-8-S10-S3-i8" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>w</m:mi><m:mrow><m:mi>i</m:mi><m:mi>j</m:mi></m:mrow><m:mo>+</m:mo></m:msubsup><m:msubsup><m:mi>w</m:mi><m:mrow><m:mi>i</m:mi><m:mi>j</m:mi></m:mrow><m:mo>&#8722;</m:mo></m:msubsup><m:mo>=</m:mo><m:mn>0</m:mn></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacqWG3bWDdaqhaaWcbaGaemyAaKMaemOAaOgabaGaey4kaScaaOGaem4DaC3aa0baaSqaaiabdMgaPjabdQgaQbqaaiabgkHiTaaakiabg2da9iabicdaWaaa@393B@</m:annotation></m:semantics></m:math></inline-formula>, and <it>&#955;</it><sup>+ </sup>= <it>&#955;</it><sup>- </sup>~ &#8734; &#8211; we obtain hard constraints <abbrgrp><abbr bid="B16">16</abbr><abbr bid="B38">38</abbr></abbrgrp>.</p>
			</sec>
			<sec>
				<st>
					<p>Data</p>
				</st>
				<p>We use the data-set described in <abbrgrp><abbr bid="B39">39</abbr></abbrgrp> which is available from the BDGP database <abbrgrp><abbr bid="B40">40</abbr></abbrgrp>.</p>
				<sec>
					<st>
						<p>Image collection</p>
					</st>
					<p>Embryos of Canton were collected and aged to produce embryos 0&#8211;3, 3&#8211;6, 6&#8211;9, 9&#8211;12, 12&#8211;15 and 15&#8211;18 hours old. The <it>in situ </it>reactions were based on a cDNA library of 2,721 clones; in the end images were collected for 1,388 genes. The difference is caused either by a failure of <it>in situ </it>reactions or by a lack of tissue-specific expression. Images were taken with a dissecting microscope in different focal planes and different orientations.</p>
				</sec>
				<sec>
					<st>
						<p>Time-courses</p>
					</st>
					<p>For twelve consecutive one-hour time windows of embryogenesis mRNA levels were measured using the Affymetrix GeneChip <it>Drosophila </it>Genome array targeting about 14,000 genes and processed with the standard Affymetrix tool suite. We used the median from three biological replicates. As the embryos were not synchronized, the manual inspection of the morphology was used to establish a common time-scale with the time-course data. Expression values were transformed to log-ratios by using time point 1 hour as reference. We removed genes not exhibiting at least a two-fold change, which leaves us with 2684 genes.</p>
				</sec>
			</sec>
			<sec>
				<st>
					<p><it>In situ </it>image processing</p>
				</st>
				<p>The majority of <it>in situ </it>hybridization images in the BDGP database <abbrgrp><abbr bid="B40">40</abbr></abbrgrp> contain the projection of exactly one centered embryo. However, there is a noticeable portion of images with multiple touching embryos. To exploit as much data as possible, the goal of image preprocessing is to locate and extract exactly one complete embryo from each image, even for touching embryos.</p>
				<p>To distinguish between embryo and non-embryo pixels we estimate the local variance of grey level intensities for each pixel in a 3 &#215; 3 neighborhood, following <abbrgrp><abbr bid="B27">27</abbr></abbrgrp>. It suffices to apply a fixed predefined threshold for segmentation using variance estimates because of a homogeneous background in contrast to the embryo. To eliminate erroneous embryo regions a sequence of morphological closing and opening using a circular mask of radius 4 is applied <abbrgrp><abbr bid="B41">41</abbr></abbrgrp>. Subsequently the largest connected component is extracted. The resulting region may be the projection of a single complete or partial embryo or the projection of a set of multiple touching embryos. To distinguish these different cases we apply a series of simple filters based on ellipticity, compactness and area of the extracted region. For regions of multiple touching embryos we introduce a procedure to separate the individuals and to extract a single complete high quality embryo. Further details are given in <abbrgrp><abbr bid="B36">36</abbr></abbrgrp>.</p>
				<p>The final step of image processing is to register the embryos extracted to a standardized orientation and size to allow for comparison of different expression patterns. The embryo is rotated to align horizontally to the principal axis. Subsequently the bounding box is scaled to a standard size. Fig. <figr fid="F9">9</figr> shows the steps of the image processing pipeline for one example image.</p>
				<fig id="F9">
					<title>
						<p>Figure 9</p>
					</title>
					<caption>
						<p>Image processing pipeline</p>
					</caption>
					<text>
						<p><b>Image processing pipeline</b>. The image pipeline combines registration, morphological operations and further processing steps to automatically process raw images, even if they include multiple touching embryos. Shown here is the image insitu8784, gene <it>CG5353</it>.</p>
					</text>
					<graphic file="1471-2105-8-S10-S3-9"/>
				</fig>
			</sec>
			<sec>
				<st>
					<p>Constraints from <it>in situ </it>images</p>
				</st>
				<p>To compare <it>in situ </it>hybridization patterns between a pair of registered embryo images, we compute the Pearson correlation as a co-location index, as proposed in <abbrgrp><abbr bid="B36">36</abbr></abbrgrp>. This index takes both the spatial distribution and the strength of hybridization into account. Despite its simplicity, it had similar performance, in a querying scenario, compared to more complex methods such as the one proposed in <abbrgrp><abbr bid="B27">27</abbr></abbrgrp>. More formally, let <it>X </it>and <it>Y </it>describe the pixel intensities of two equal sized and registered embryo images, the Pearson correlation is calculated as</p>
				<p>
					<display-formula id="M4">
						<m:math name="1471-2105-8-S10-S3-i9" xmlns:m="http://www.w3.org/1998/Math/MathML">
							<m:semantics>
								<m:mrow>
									<m:mi>C</m:mi>
									<m:mi>C</m:mi>
									<m:mo stretchy="false">(</m:mo>
									<m:mi>X</m:mi>
									<m:mo>,</m:mo>
									<m:mi>Y</m:mi>
									<m:mo stretchy="false">)</m:mo>
									<m:mo>=</m:mo>
									<m:mfrac>
										<m:mrow>
											<m:mi>C</m:mi>
											<m:mi>o</m:mi>
											<m:mi>v</m:mi>
											<m:mo stretchy="false">(</m:mo>
											<m:mi>X</m:mi>
											<m:mo>,</m:mo>
											<m:mi>Y</m:mi>
											<m:mo stretchy="false">)</m:mo>
										</m:mrow>
										<m:mrow>
											<m:msqrt>
												<m:mrow>
													<m:mi>V</m:mi>
													<m:mi>a</m:mi>
													<m:mi>r</m:mi>
													<m:mo stretchy="false">(</m:mo>
													<m:mi>X</m:mi>
													<m:mo stretchy="false">)</m:mo>
												</m:mrow>
											</m:msqrt>
											<m:msqrt>
												<m:mrow>
													<m:mi>V</m:mi>
													<m:mi>a</m:mi>
													<m:mi>r</m:mi>
													<m:mo stretchy="false">(</m:mo>
													<m:mi>Y</m:mi>
													<m:mo stretchy="false">)</m:mo>
												</m:mrow>
											</m:msqrt>
										</m:mrow>
									</m:mfrac>
									<m:mo>.</m:mo>
								</m:mrow>
								<m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacqWGdbWqcqWGdbWqcqGGOaakcqWGybawcqGGSaalcqWGzbqwcqGGPaqkcqGH9aqpdaWcaaqaaiabdoeadjabd+gaVjabdAha2jabcIcaOiabdIfayjabcYcaSiabdMfazjabcMcaPaqaamaakaaabaGaemOvayLaemyyaeMaemOCaiNaeiikaGIaemiwaGLaeiykaKcaleqaaOWaaOaaaeaacqWGwbGvcqWGHbqycqWGYbGCcqGGOaakcqWGzbqwcqGGPaqkaSqabaaaaOGaeiOla4caaa@4CB7@</m:annotation>
							</m:semantics>
						</m:math>
					</display-formula>
				</p>
				<p>The 18 developmental stages of the embryo are divided into six periods (0&#8211;3, 3&#8211;6, 6&#8211;9, 9&#8211;12, 12&#8211;15 and 15&#8211;18). Not all time points were sampled for each gene and for some periods and genes, <it>in situ </it>images were taken in a dorsal and/or lateral views. There is however no annotation of the orientation of the embryo; automatic registration being a difficult task for this problem. Hence, for each pair of images, we estimate the correlation between all possible orientations and take the maximum value. For a pair of genes and a developmental period, we repeat the above procedure for all pairs of images and again keep the maximum value. By an inspection of the distribution of the correlation coefficient, we select a value <it>k </it>of gene pairs to constrain. In other words, the gene pairs (<it>g</it><sub><it>i</it></sub>, <it>g</it><sub><it>j</it></sub>) displaying the <it>k</it>th highest correlations are positively constrained (<inline-formula><m:math name="1471-2105-8-S10-S3-i4" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>w</m:mi><m:mrow><m:mi>i</m:mi><m:mi>j</m:mi></m:mrow><m:mo>+</m:mo></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacqWG3bWDdaqhaaWcbaGaemyAaKMaemOAaOgabaGaey4kaScaaaaa@31EA@</m:annotation></m:semantics></m:math></inline-formula> = 1). Similarly, the gene pairs (<it>g</it><sub><it>i</it></sub>, <it>g</it><sub><it>j</it></sub>) displaying the <it>k</it>th lowest correlations are negatively constrained (<inline-formula><m:math name="1471-2105-8-S10-S3-i5" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>w</m:mi><m:mrow><m:mi>i</m:mi><m:mi>j</m:mi></m:mrow><m:mo>&#8722;</m:mo></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacqWG3bWDdaqhaaWcbaGaemyAaKMaemOAaOgabaGaeyOeI0caaaaa@31F5@</m:annotation></m:semantics></m:math></inline-formula> = 1). As we are interested in high quality constraints, we use conservative thresholds, which select only a small percentage of gene pairs to be constrained (less then 2% of genes with <it>in situ </it>images). As a last step, we need to combine the image constraints from the distinct developmental periods. Again, we use a conservative strategy, requiring that a pair of genes is only constrained if we observe a correlation coefficient exceeding our threshold in at least three respectively four developmental periods; cf. Fig. <figr fid="F2">2</figr> for an example. With support of at least three periods, there are 1,756 positive constraints within 170 genes and 2,544 negative constraints within 360 genes. With support of at least four stages, there are 270 positive constraints within 66 genes and 640 negative constraints within 151 genes.</p>
			</sec>
			<sec>
				<st>
					<p>Evaluation</p>
				</st>
				<p>We use multivariate Gaussians with diagonal covariance matrices <abbrgrp><abbr bid="B32">32</abbr></abbrgrp> as our components in all mixture estimations, as we are mainly interested in comparing our semi-supervised approach with the unsupervised scenario. For a given mixture parameterization, we initialize models randomly, repeat the estimation 15 times and choose the one with maximum likelihood. We estimate the optimal number of clusters with the Bayesian Information Criteria (BIC) in the unsupervised setting, which indicates 28 clusters. We use this number for all other runs described below. All data sets and a tool implementing the method are available in our Supplementary Material Web page at <url>http://algorithmics.molgen.mpg.de/Supplements/Insitu</url>, where we also display plots of the clusters, lists of genes, images constraining the clusters, GO term enrichment (as provided by GoStat <abbrgrp><abbr bid="B42">42</abbr></abbrgrp>) and ImaGO term enrichment.</p>
				<sec>
					<st>
						<p>ImaGO term enrichment</p>
					</st>
					<p>A controlled vocabulary, which follows the Gene Ontology <abbrgrp><abbr bid="B23">23</abbr></abbrgrp> standard, was used to annotate gene expression patterns <abbrgrp><abbr bid="B39">39</abbr></abbrgrp>. All images deposited in BDGP are annotated to at least one of these terms. We can, as usual with Gene Ontology <abbrgrp><abbr bid="B42">42</abbr></abbrgrp>, use a statistical tests to list ImaGO terms, which are overrepresented in a cluster. Given a set of <it>n </it>genes, we count the number <it>c </it>of genes in a given cluster, the number <it>t </it>of genes annotated with a given ImaGO term and the number <it>h </it>of genes that are both in the cluster and annotated with the ImaGO term. The resulting <it>p</it>-value, calculated with the Fisher Exact Test <abbrgrp><abbr bid="B43">43</abbr></abbrgrp> is then used to assess the significance of the count <it>h</it>, given <it>n</it>, <it>c </it>and <it>t</it>. The Fisher Exact Test assumes that the data comes from a Hyper-geometric distribution, and it is equivalent to the Hyper-geometric test. Lower <it>p</it>-values indicate an enrichment in ImaGO terms and, consequently, better results.</p>
					<p>This strategy is useful for evaluating the biological quality of a single cluster, but gives no global assessment for comparing the results given by two clustering solutions. One heuristic way to perform such an analysis is to compare the <it>p</it>-values obtained in two solutions <abbrgrp><abbr bid="B44">44</abbr></abbrgrp>. A superior method has a larger number of ImaGO terms with lower <it>p</it>-values.</p>
				</sec>
				<sec>
					<st>
						<p>Selection of constraints and parameters</p>
					</st>
					<p>We evaluate the use of constraints shared by either three or four developmental periods, the use of positive constraints and both positive and negative constraints, and four choices of the parameter <it>&#955;</it><sup>+ </sup>= <it>&#955;</it><sup>- </sup>(0.5, 1.0, 1.5 and 2.0). There is no theory guiding choices of <it>&#955;</it><sup>+ </sup>and <it>&#955;</it><sup>-</sup>, neither is there a definitive "gold standard" data set to optimize them. Hence, motivated by <abbrgrp><abbr bid="B24">24</abbr></abbrgrp>, we made the simple choice to give positive and negative constraints equal weight, which should have some impact on the clustering result, but not dominate it.</p>
					<p>As shown in Table <tblr tid="T1">1</tblr>, all constraint combinations lead to an increase in ImaGO term enrichment, except the use of positive and negative constraints from three stages. Furthermore, values of <it>&#955; </it>around 1 lead to an improvement, while higher values tend to deteriorate results. Thus, we choose to use positive constraints derived from three developmental periods and a constraint weight of <it>&#955;</it><sub>+ </sub>= 1.0 in agreement with <abbrgrp><abbr bid="B24">24</abbr></abbrgrp>.</p>
					<tbl id="T1">
						<title>
							<p>Table 1</p>
						</title>
						<caption>
							<p>Selection of Constraints.</p>
						</caption>
						<tblbdy cols="4">
							<r>
								<c>
									<p/>
								</c>
								<c>
									<p/>
								</c>
								<c cspan="2" ca="center">
									<p>Proportion of terms with lower <it>p</it>-values</p>
								</c>
							</r>
							<r>
								<c ca="left">
									<p>
										<it>&#955;</it>
										<sub>+</sub>
									</p>
								</c>
								<c ca="center">
									<p>
										<it>&#955;</it>
										<sub>-</sub>
									</p>
								</c>
								<c ca="center">
									<p># stages &#8805; 3</p>
								</c>
								<c ca="center">
									<p>&#8805;4</p>
								</c>
							</r>
							<r>
								<c cspan="4">
									<hr/>
								</c>
							</r>
							<r>
								<c ca="left">
									<p>0.5</p>
								</c>
								<c ca="center">
									<p>-</p>
								</c>
								<c ca="center">
									<p>51%</p>
								</c>
								<c ca="center">
									<p>48%</p>
								</c>
							</r>
							<r>
								<c ca="left">
									<p>1.0</p>
								</c>
								<c ca="center">
									<p>-</p>
								</c>
								<c ca="center">
									<p><b>60</b>%</p>
								</c>
								<c ca="center">
									<p><b>56</b>%</p>
								</c>
							</r>
							<r>
								<c ca="left">
									<p>1.5</p>
								</c>
								<c ca="center">
									<p>-</p>
								</c>
								<c ca="center">
									<p>57%</p>
								</c>
								<c ca="center">
									<p>49%</p>
								</c>
							</r>
							<r>
								<c ca="left">
									<p>2.0</p>
								</c>
								<c ca="center">
									<p>-</p>
								</c>
								<c ca="center">
									<p>43%</p>
								</c>
								<c ca="center">
									<p>46%</p>
								</c>
							</r>
							<r>
								<c cspan="4">
									<hr/>
								</c>
							</r>
							<r>
								<c ca="left">
									<p>0.5</p>
								</c>
								<c ca="center">
									<p>0.5</p>
								</c>
								<c ca="center">
									<p><b>49</b>%</p>
								</c>
								<c ca="center">
									<p>44%</p>
								</c>
							</r>
							<r>
								<c ca="left">
									<p>1.0</p>
								</c>
								<c ca="center">
									<p>1.0</p>
								</c>
								<c ca="center">
									<p>49%</p>
								</c>
								<c ca="center">
									<p>52%</p>
								</c>
							</r>
							<r>
								<c ca="left">
									<p>1.5</p>
								</c>
								<c ca="center">
									<p>1.5</p>
								</c>
								<c ca="center">
									<p>40%</p>
								</c>
								<c ca="center">
									<p><b>59</b>%</p>
								</c>
							</r>
							<r>
								<c ca="left">
									<p>2.0</p>
								</c>
								<c ca="center">
									<p>2.0</p>
								</c>
								<c ca="center">
									<p>43%</p>
								</c>
								<c ca="center">
									<p>47%</p>
								</c>
							</r>
						</tblbdy>
						<tblfn>
							<p>We show the proportion of ImaGO terms with lower <it>p</it>-values in cMoG compared to MoG for constraints from at least 3 or 4 stages, distinct constraint weights <it>&#955;</it><sub>+ </sub>and <it>&#955;</it><sub>- </sub>using positive constraints only respectively positive and negative constraints. Values exceeding 50% indicate an advantage of cMoG.</p>
						</tblfn>
					</tbl>
				</sec>
			</sec>
		</sec>
		<sec>
			<st>
				<p>Competing interests</p>
			</st>
			<p>The authors declare that they have no competing interests.</p>
		</sec>
		<sec>
			<st>
				<p>Authors' contributions</p>
			</st>
			<p>IC implemented the approach and performed the experiments. IC and RK evaluated the clustering results. LO processed the images and wrote the parts of the manuscript describing the image processing. AS designed this study. IC, RK and AS wrote the manuscript. All authors read and approved the final manuscript.</p>
		</sec>
	</bdy>
	<bm>
		<ack>
			<sec>
				<st>
					<p>Acknowledgements</p>
				</st>
				<p>Thanks to Ruben Schilling for producing the average image displays and Tilman Lange for discussions on the semi-supervised clustering method. The first author is supported by a scholarship from CNPq(Brazil)/DAAD.</p>
				<p>This article has been published as part of <it>BMC Bioinformatics </it>Volume 8 Supplement 10, 2007: Neural Information Processing Systems (NIPS) workshop on New Problems and Methods in Computational Biology. The full contents of the supplement are available online at <url>http://www.biomedcentral.com/1471-2105/8?issue=S10</url>.</p>
			</sec>
		</ack>
		<refgrp>
			<bibl id="B1">
				<title>
					<p>Gene expression during the life cycle of Drosophila melanogaster</p>
				</title>
				<aug>
					<au>
						<snm>Arbeitman</snm>
						<fnm>MN</fnm>
					</au>
					<au>
						<snm>Furlong</snm>
						<fnm>EE</fnm>
					</au>
					<au>
						<snm>Imam</snm>
						<fnm>F</fnm>
					</au>
					<au>
						<snm>Johnson</snm>
						<fnm>E</fnm>
					</au>
					<au>
						<snm>Null</snm>
						<fnm>BH</fnm>
					</au>
					<au>
						<snm>Baker</snm>
						<fnm>BS</fnm>
					</au>
					<au>
						<snm>Krasnow</snm>
						<fnm>MA</fnm>
					</au>
					<au>
						<snm>Scott</snm>
						<fnm>MP</fnm>
					</au>
					<au>
						<snm>Davis</snm>
						<fnm>RW</fnm>
					</au>
					<au>
						<snm>White</snm>
						<fnm>KP</fnm>
					</au>
				</aug>
				<source>Science</source>
				<pubdate>2002</pubdate>
				<volume>297</volume>
				<issue>5590</issue>
				<fpage>2270</fpage>
				<lpage>2275</lpage>
				<xrefbib>
					<pubid idtype="pmpid" link="fulltext">12351791</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B2">
				<title>
					<p>Identification of tightly regulated groups of genes during Drosophila melanogaster embryogenesis</p>
				</title>
				<aug>
					<au>
						<snm>Hooper</snm>
						<fnm>SD</fnm>
					</au>
					<au>
						<snm>Bou&#233;</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Krause</snm>
						<fnm>R</fnm>
					</au>
					<au>
						<snm>Jensen</snm>
						<fnm>LJ</fnm>
					</au>
					<au>
						<snm>Mason</snm>
						<fnm>CE</fnm>
					</au>
					<au>
						<snm>Ghanim</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>White</snm>
						<fnm>KP</fnm>
					</au>
					<au>
						<snm>Furlong</snm>
						<fnm>EE</fnm>
					</au>
					<au>
						<snm>Bork</snm>
						<fnm>P</fnm>
					</au>
				</aug>
				<source>Mol Syst Biol</source>
				<pubdate>2007</pubdate>
				<volume>3</volume>
				<fpage>72</fpage>
				<lpage>72</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">1800352</pubid>
						<pubid idtype="pmpid" link="fulltext">17224916</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B3">
				<title>
					<p>A gene expression map for the euchromatic genome of Drosophila melanogaster</p>
				</title>
				<aug>
					<au>
						<snm>Stolc</snm>
						<fnm>V</fnm>
					</au>
					<au>
						<snm>Gauhar</snm>
						<fnm>Z</fnm>
					</au>
					<au>
						<snm>Mason</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Halasz</snm>
						<fnm>G</fnm>
					</au>
					<au>
						<snm>van Batenburg</snm>
						<fnm>MF</fnm>
					</au>
					<au>
						<snm>Rifkin</snm>
						<fnm>SA</fnm>
					</au>
					<au>
						<snm>Hua</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Herreman</snm>
						<fnm>T</fnm>
					</au>
					<au>
						<snm>Tongprasit</snm>
						<fnm>W</fnm>
					</au>
					<au>
						<snm>Barbano</snm>
						<fnm>PE</fnm>
					</au>
					<au>
						<snm>Bussemaker</snm>
						<fnm>HJ</fnm>
					</au>
					<au>
						<snm>White</snm>
						<fnm>KP</fnm>
					</au>
				</aug>
				<source>Science</source>
				<pubdate>2004</pubdate>
				<volume>306</volume>
				<issue>5696</issue>
				<fpage>655</fpage>
				<lpage>660</lpage>
				<xrefbib>
					<pubid idtype="pmpid" link="fulltext">15499012</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B4">
				<title>
					<p>Systematic determination of patterns of gene expression during Drosophila embryogenesis</p>
				</title>
				<aug>
					<au>
						<snm>Tomancak</snm>
						<fnm>P</fnm>
					</au>
					<au>
						<snm>Beaton</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Weiszmann</snm>
						<fnm>R</fnm>
					</au>
					<au>
						<snm>Kwan</snm>
						<fnm>E</fnm>
					</au>
					<au>
						<snm>Shu</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Lewis</snm>
						<fnm>SE</fnm>
					</au>
					<au>
						<snm>Richards</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Ashburner</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Hartenstein</snm>
						<fnm>V</fnm>
					</au>
					<au>
						<snm>Celniker</snm>
						<fnm>SE</fnm>
					</au>
					<au>
						<snm>Rubin</snm>
						<fnm>GM</fnm>
					</au>
				</aug>
				<source>Genome Biol</source>
				<pubdate>2002</pubdate>
				<volume>3</volume>
				<issue>12</issue>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">151190</pubid>
						<pubid idtype="pmpid" link="fulltext">12537577</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B5">
				<title>
					<p>Cluster analysis and display of genome-wide expression patterns</p>
				</title>
				<aug>
					<au>
						<snm>Eisen</snm>
						<fnm>MB</fnm>
					</au>
					<au>
						<snm>Spellman</snm>
						<fnm>PT</fnm>
					</au>
					<au>
						<snm>Brown</snm>
						<fnm>PO</fnm>
					</au>
					<au>
						<snm>Botstein</snm>
						<fnm>D</fnm>
					</au>
				</aug>
				<source>Proc Natl Acad Sci USA</source>
				<pubdate>1998</pubdate>
				<volume>95</volume>
				<issue>25</issue>
				<fpage>14863</fpage>
				<lpage>14868</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">24541</pubid>
						<pubid idtype="pmpid" link="fulltext">9843981</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B6">
				<title>
					<p>Correlation between transcriptome and interactome mapping data from Saccharomyces cerevisiae</p>
				</title>
				<aug>
					<au>
						<snm>Ge</snm>
						<fnm>H</fnm>
					</au>
					<au>
						<snm>Liu</snm>
						<fnm>Z</fnm>
					</au>
					<au>
						<snm>Church</snm>
						<fnm>GM</fnm>
					</au>
					<au>
						<snm>Vidal</snm>
						<fnm>M</fnm>
					</au>
				</aug>
				<source>Nat Genet</source>
				<pubdate>2001</pubdate>
				<volume>29</volume>
				<issue>4</issue>
				<fpage>482</fpage>
				<lpage>486</lpage>
				<xrefbib>
					<pubid idtype="pmpid" link="fulltext">11694880</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B7">
				<title>
					<p>A probabilistic functional network of yeast genes</p>
				</title>
				<aug>
					<au>
						<snm>Lee</snm>
						<fnm>I</fnm>
					</au>
					<au>
						<snm>Date</snm>
						<fnm>SV</fnm>
					</au>
					<au>
						<snm>Adai</snm>
						<fnm>AT</fnm>
					</au>
					<au>
						<snm>Marcotte</snm>
						<fnm>EM</fnm>
					</au>
				</aug>
				<source>Science</source>
				<pubdate>2004</pubdate>
				<volume>306</volume>
				<issue>5701</issue>
				<fpage>1555</fpage>
				<lpage>1558</lpage>
				<xrefbib>
					<pubid idtype="pmpid" link="fulltext">15567862</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B8">
				<title>
					<p>Analyzing time series gene expression data</p>
				</title>
				<aug>
					<au>
						<snm>Bar-Joseph</snm>
						<fnm>Z</fnm>
					</au>
				</aug>
				<source>Bioinformatics</source>
				<pubdate>2004</pubdate>
				<volume>20</volume>
				<issue>16</issue>
				<fpage>2493</fpage>
				<lpage>2503</lpage>
				<xrefbib>
					<pubid idtype="pmpid" link="fulltext">15130923</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B9">
				<title>
					<p>Continuous representations of time-series gene expression data</p>
				</title>
				<aug>
					<au>
						<snm>Bar-Joseph</snm>
						<fnm>Z</fnm>
					</au>
					<au>
						<snm>Gerber</snm>
						<fnm>GK</fnm>
					</au>
					<au>
						<snm>Gifford</snm>
						<fnm>DK</fnm>
					</au>
					<au>
						<snm>Jaakkola</snm>
						<fnm>TS</fnm>
					</au>
					<au>
						<snm>Simon</snm>
						<fnm>I</fnm>
					</au>
				</aug>
				<source>J Comput Biol</source>
				<pubdate>2003</pubdate>
				<volume>10</volume>
				<issue>3&#8211;4</issue>
				<fpage>341</fpage>
				<lpage>356</lpage>
				<xrefbib>
					<pubid idtype="pmpid" link="fulltext">12935332</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B10">
				<title>
					<p>Clustering of time-course gene expression data using a mixed-effects model with B-splines</p>
				</title>
				<aug>
					<au>
						<snm>Luan</snm>
						<fnm>Y</fnm>
					</au>
					<au>
						<snm>Li</snm>
						<fnm>H</fnm>
					</au>
				</aug>
				<source>Bioinformatics</source>
				<pubdate>2003</pubdate>
				<volume>19</volume>
				<issue>4</issue>
				<fpage>474</fpage>
				<lpage>482</lpage>
				<xrefbib>
					<pubid idtype="pmpid" link="fulltext">12611802</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B11">
				<title>
					<p>A mixture model-based approach to the clustering of microarray expression data</p>
				</title>
				<aug>
					<au>
						<snm>McLachlan</snm>
						<fnm>GJ</fnm>
					</au>
					<au>
						<snm>Bean</snm>
						<fnm>RW</fnm>
					</au>
					<au>
						<snm>Peel</snm>
						<fnm>D</fnm>
					</au>
				</aug>
				<source>Bioinformatics</source>
				<pubdate>2002</pubdate>
				<volume>18</volume>
				<issue>3</issue>
				<fpage>413</fpage>
				<lpage>422</lpage>
				<xrefbib>
					<pubid idtype="pmpid" link="fulltext">11934740</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B12">
				<title>
					<p>Bayesian mixture model based clustering of replicated microarray data</p>
				</title>
				<aug>
					<au>
						<snm>Medvedovic</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Yeung</snm>
						<fnm>K</fnm>
					</au>
					<au>
						<snm>Bumgarner</snm>
						<fnm>R</fnm>
					</au>
				</aug>
				<source>Bioinformatics</source>
				<pubdate>2004</pubdate>
				<volume>20</volume>
				<issue>8</issue>
				<fpage>1222</fpage>
				<lpage>1232</lpage>
				<xrefbib>
					<pubid idtype="pmpid" link="fulltext">14871871</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B13">
				<title>
					<p>A Mixture model with random-effects components for clustering correlated gene-expression profiles</p>
				</title>
				<aug>
					<au>
						<snm>Ng</snm>
						<fnm>SK</fnm>
					</au>
					<au>
						<snm>McLachlan</snm>
						<fnm>GJ</fnm>
					</au>
					<au>
						<snm>Wang</snm>
						<fnm>K</fnm>
					</au>
					<au>
						<snm>Ben-Tovim Jones</snm>
						<fnm>L</fnm>
					</au>
					<au>
						<snm>Ng</snm>
						<fnm>SW</fnm>
					</au>
				</aug>
				<source>Bioinformatics</source>
				<pubdate>2006</pubdate>
				<volume>22</volume>
				<issue>14</issue>
				<fpage>1745</fpage>
				<lpage>1752</lpage>
				<xrefbib>
					<pubid idtype="pmpid" link="fulltext">16675467</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B14">
				<title>
					<p>Using hidden Markov models to analyze gene expression time course data</p>
				</title>
				<aug>
					<au>
						<snm>Schliep</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Sch&#246;nhuth</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Steinhoff</snm>
						<fnm>C</fnm>
					</au>
				</aug>
				<source>Bioinformatics</source>
				<pubdate>2003</pubdate>
				<volume>19</volume>
				<issue>Suppl 1</issue>
				<fpage>i255</fpage>
				<lpage>i263</lpage>
				<xrefbib>
					<pubid idtype="pmpid" link="fulltext">12855468</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B15">
				<title>
					<p>Robust inference of groups in gene expression time-courses using mixtures of HMMs</p>
				</title>
				<aug>
					<au>
						<snm>Schliep</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Steinhoff</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Schonhuth</snm>
						<fnm>A</fnm>
					</au>
				</aug>
				<source>Bioinformatics</source>
				<pubdate>2004</pubdate>
				<volume>20</volume>
				<issue>Suppl 1</issue>
				<fpage>I283</fpage>
				<lpage>I289</lpage>
				<xrefbib>
					<pubid idtype="pmpid" link="fulltext">15262810</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B16">
				<title>
					<p>Analyzing Gene Expression Time-Courses</p>
				</title>
				<aug>
					<au>
						<snm>Schliep</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Costa</snm>
						<fnm>IG</fnm>
					</au>
					<au>
						<snm>Steinhoff</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Schonhuth</snm>
						<fnm>A</fnm>
					</au>
				</aug>
				<source>IEEE/ACM Trans Comput Biol Bioinform</source>
				<pubdate>2005</pubdate>
				<volume>2</volume>
				<issue>3</issue>
				<fpage>179</fpage>
				<lpage>193</lpage>
				<xrefbib>
					<pubid idtype="pmpid" link="fulltext">17044182</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B17">
				<title>
					<p>Model-based clustering and data transformations for gene expression data</p>
				</title>
				<aug>
					<au>
						<snm>Yeung</snm>
						<fnm>KY</fnm>
					</au>
					<au>
						<snm>Fraley</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Murua</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Raftery</snm>
						<fnm>AE</fnm>
					</au>
					<au>
						<snm>Ruzzo</snm>
						<fnm>WL</fnm>
					</au>
				</aug>
				<source>Bioinformatics</source>
				<pubdate>2001</pubdate>
				<volume>17</volume>
				<issue>10</issue>
				<fpage>977</fpage>
				<lpage>987</lpage>
				<xrefbib>
					<pubid idtype="pmpid" link="fulltext">11673243</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B18">
				<title>
					<p>Clustering gene-expression data with repeated measurements</p>
				</title>
				<aug>
					<au>
						<snm>Yeung</snm>
						<fnm>KY</fnm>
					</au>
					<au>
						<snm>Medvedovic</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Bumgarner</snm>
						<fnm>RE</fnm>
					</au>
				</aug>
				<source>Genome Biol</source>
				<pubdate>2003</pubdate>
				<volume>4</volume>
				<issue>5</issue>
				<fpage>R34</fpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">156590</pubid>
						<pubid idtype="pmpid" link="fulltext">12734014</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B19">
				<aug>
					<au>
						<snm>Chapelle</snm>
						<fnm>O</fnm>
					</au>
					<au>
						<snm>Schoelkopf</snm>
						<fnm>B</fnm>
					</au>
					<au>
						<snm>Zien</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<cnm>(Eds)</cnm>
					</au>
				</aug>
				<source>Semi-Supervised Learning</source>
				<publisher>MIT Press</publisher>
				<pubdate>2006</pubdate>
			</bibl>
			<bibl id="B20">
				<title>
					<p>Learning with Constrained and Unlabelled Data</p>
				</title>
				<aug>
					<au>
						<snm>Lange</snm>
						<fnm>T</fnm>
					</au>
					<au>
						<snm>Law</snm>
						<fnm>MHC</fnm>
					</au>
					<au>
						<snm>Jain</snm>
						<fnm>AK</fnm>
					</au>
					<au>
						<snm>Buhmann</snm>
						<fnm>JM</fnm>
					</au>
				</aug>
				<source>CVPR (1)</source>
				<publisher>IEEE Computer Society</publisher>
				<pubdate>2005</pubdate>
				<fpage>731</fpage>
				<lpage>738</lpage>
			</bibl>
			<bibl id="B21">
				<title>
					<p>Semi-supervised Learning with Penalized Probabilistic Clustering</p>
				</title>
				<aug>
					<au>
						<snm>Lu</snm>
						<fnm>Z</fnm>
					</au>
					<au>
						<snm>Leen</snm>
						<fnm>T</fnm>
					</au>
				</aug>
				<source>Advances in Neural Information Processing Systems</source>
				<publisher>MIT Press</publisher>
				<editor>Saul LK, Weiss Y, Bottou L</editor>
				<pubdate>2005</pubdate>
				<volume>17</volume>
				<fpage>849</fpage>
				<lpage>856</lpage>
			</bibl>
			<bibl id="B22">
				<title>
					<p>Distance Metric Learning with Application to Clustering with Side-Information</p>
				</title>
				<aug>
					<au>
						<snm>Xing</snm>
						<fnm>EP</fnm>
					</au>
					<au>
						<snm>Ng</snm>
						<fnm>AY</fnm>
					</au>
					<au>
						<snm>Jordan</snm>
						<fnm>MI</fnm>
					</au>
					<au>
						<snm>Russell</snm>
						<fnm>S</fnm>
					</au>
				</aug>
				<source>Advances in Neural Information Processing Systems</source>
				<publisher>Cambridge, MA: MIT Press</publisher>
				<editor>S Becker ST, Obermayer K</editor>
				<pubdate>2003</pubdate>
				<volume>15</volume>
				<fpage>505</fpage>
				<lpage>512</lpage>
			</bibl>
			<bibl id="B23">
				<title>
					<p>Gene ontology: tool for the unification of biology. The Gene Ontology Consortium</p>
				</title>
				<aug>
					<au>
						<snm>Ashburner</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Ball</snm>
						<fnm>CA</fnm>
					</au>
					<au>
						<snm>Blake</snm>
						<fnm>JA</fnm>
					</au>
					<au>
						<snm>Botstein</snm>
						<fnm>D</fnm>
					</au>
					<au>
						<snm>Butler</snm>
						<fnm>H</fnm>
					</au>
					<au>
						<snm>Cherry</snm>
						<fnm>JM</fnm>
					</au>
					<au>
						<snm>Davis</snm>
						<fnm>AP</fnm>
					</au>
					<au>
						<snm>Dolinski</snm>
						<fnm>K</fnm>
					</au>
					<au>
						<snm>Dwight</snm>
						<fnm>SS</fnm>
					</au>
					<au>
						<snm>Eppig</snm>
						<fnm>JT</fnm>
					</au>
					<au>
						<snm>Harris</snm>
						<fnm>MA</fnm>
					</au>
					<au>
						<snm>Hill</snm>
						<fnm>DP</fnm>
					</au>
					<au>
						<snm>Issel-Tarver</snm>
						<fnm>L</fnm>
					</au>
					<au>
						<snm>Kasarskis</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Lewis</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Matese</snm>
						<fnm>JC</fnm>
					</au>
					<au>
						<snm>Richardson</snm>
						<fnm>JE</fnm>
					</au>
					<au>
						<snm>Ringwald</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Rubin</snm>
						<fnm>GM</fnm>
					</au>
					<au>
						<snm>Sherlock</snm>
						<fnm>G</fnm>
					</au>
				</aug>
				<source>Nat Genet</source>
				<pubdate>2000</pubdate>
				<volume>25</volume>
				<fpage>25</fpage>
				<lpage>29</lpage>
				<xrefbib>
					<pubid idtype="pmpid" link="fulltext">10802651</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B24">
				<title>
					<p>On the feasibility of Heterogeneous Analysis of Large Scale Biological Data</p>
				</title>
				<aug>
					<au>
						<snm>Costa</snm>
						<fnm>I</fnm>
					</au>
					<au>
						<snm>Schliep</snm>
						<fnm>A</fnm>
					</au>
				</aug>
				<source>Proceedings of ECML/PKDD 2006 Workshop on Data and Text Mining for Integrative Biology</source>
				<pubdate>2006</pubdate>
				<fpage>55</fpage>
				<lpage>60</lpage>
			</bibl>
			<bibl id="B25">
				<title>
					<p>Synexpression groups in eukaryotes</p>
				</title>
				<aug>
					<au>
						<snm>Niehrs</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Pollet</snm>
						<fnm>N</fnm>
					</au>
				</aug>
				<source>Nature</source>
				<pubdate>1999</pubdate>
				<volume>402</volume>
				<issue>6761</issue>
				<fpage>483</fpage>
				<lpage>487</lpage>
				<xrefbib>
					<pubid idtype="pmpid" link="fulltext">10591207</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B26">
				<title>
					<p>A non-radioactive in situ hybridization method for the localization of specific RNAs in Drosophila embryos reveals translational control of the segmentation gene hunchback</p>
				</title>
				<aug>
					<au>
						<snm>Tautz</snm>
						<fnm>D</fnm>
					</au>
					<au>
						<snm>Pfeifle</snm>
						<fnm>C</fnm>
					</au>
				</aug>
				<source>Chromosoma</source>
				<pubdate>1989</pubdate>
				<volume>98</volume>
				<issue>2</issue>
				<fpage>81</fpage>
				<lpage>85</lpage>
				<xrefbib>
					<pubid idtype="pmpid">2476281</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B27">
				<title>
					<p>Comparing in situ mRNA expression patterns of Drosophila embryos</p>
				</title>
				<aug>
					<au>
						<snm>Peng</snm>
						<fnm>H</fnm>
					</au>
					<au>
						<snm>Myers</snm>
						<fnm>EW</fnm>
					</au>
				</aug>
				<source>RECOMB</source>
				<publisher>ACM</publisher>
				<editor>Bourne PE, Gusfield D</editor>
				<pubdate>2004</pubdate>
				<fpage>157</fpage>
				<lpage>166</lpage>
			</bibl>
			<bibl id="B28">
				<title>
					<p>Automatic mining of fruit fly embryo images</p>
				</title>
				<aug>
					<au>
						<snm>Pan</snm>
						<fnm>JY</fnm>
					</au>
					<au>
						<snm>Guilherme</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Balan</snm>
						<fnm>R</fnm>
					</au>
					<au>
						<snm>Xing</snm>
						<fnm>EP</fnm>
					</au>
					<au>
						<snm>Traina</snm>
						<fnm>AJM</fnm>
					</au>
					<au>
						<snm>Faloutsos</snm>
						<fnm>C</fnm>
					</au>
				</aug>
				<source>KDD'06: Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining</source>
				<publisher>New York, NY, USA: ACM Press</publisher>
				<pubdate>2006</pubdate>
				<fpage>693</fpage>
				<lpage>698</lpage>
			</bibl>
			<bibl id="B29">
				<title>
					<p>Clustering gene expression patterns of fly embryos</p>
				</title>
				<aug>
					<au>
						<snm>Peng</snm>
						<fnm>H</fnm>
					</au>
					<au>
						<snm>Long</snm>
						<fnm>F</fnm>
					</au>
					<au>
						<snm>Eisen</snm>
						<fnm>MB</fnm>
					</au>
					<au>
						<snm>Myers</snm>
						<fnm>EW</fnm>
					</au>
				</aug>
				<source>ISBI</source>
				<publisher>IEEE</publisher>
				<pubdate>2006</pubdate>
				<fpage>1144</fpage>
				<lpage>1147</lpage>
			</bibl>
			<bibl id="B30">
				<title>
					<p>Three-dimensional morphology and gene expression in the Drosophila blastoderm at cellular resolution I: data acquisition pipeline</p>
				</title>
				<aug>
					<au>
						<snm>Hendriks</snm>
						<fnm>CLL</fnm>
					</au>
					<au>
						<snm>Ker&#228;nen</snm>
						<fnm>SVE</fnm>
					</au>
					<au>
						<snm>Fowlkes</snm>
						<fnm>CC</fnm>
					</au>
					<au>
						<snm>Simirenko</snm>
						<fnm>L</fnm>
					</au>
					<au>
						<snm>Weber</snm>
						<fnm>GH</fnm>
					</au>
					<au>
						<snm>DePace</snm>
						<fnm>AH</fnm>
					</au>
					<au>
						<snm>Henriquez</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Kaszuba</snm>
						<fnm>DW</fnm>
					</au>
					<au>
						<snm>Hamann</snm>
						<fnm>B</fnm>
					</au>
					<au>
						<snm>Eisen</snm>
						<fnm>MB</fnm>
					</au>
					<au>
						<snm>Malik</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Sudar</snm>
						<fnm>D</fnm>
					</au>
					<au>
						<snm>Biggin</snm>
						<fnm>MD</fnm>
					</au>
					<au>
						<snm>Knowles</snm>
						<fnm>DW</fnm>
					</au>
				</aug>
				<source>Genome Biol</source>
				<pubdate>2006</pubdate>
				<volume>7</volume>
				<issue>12</issue>
				<fpage>R123</fpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">1794436</pubid>
						<pubid idtype="pmpid" link="fulltext">17184546</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B31">
				<title>
					<p>Three-dimensional morphology and gene expressionin the Drosophila blastoderm at cellular resolution II:dynamics</p>
				</title>
				<aug>
					<au>
						<snm>Ker&#228;nen</snm>
						<fnm>SVE</fnm>
					</au>
					<au>
						<snm>Fowlkes</snm>
						<fnm>CC</fnm>
					</au>
					<au>
						<snm>Hendriks</snm>
						<fnm>CLL</fnm>
					</au>
					<au>
						<snm>Sudar</snm>
						<fnm>D</fnm>
					</au>
					<au>
						<snm>Knowles</snm>
						<fnm>DW</fnm>
					</au>
					<au>
						<snm>Malik</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Biggin</snm>
						<fnm>MD</fnm>
					</au>
				</aug>
				<source>Genome Biol</source>
				<pubdate>2006</pubdate>
				<volume>7</volume>
				<issue>12</issue>
				<fpage>R124</fpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">1794437</pubid>
						<pubid idtype="pmpid" link="fulltext">17184547</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B32">
				<aug>
					<au>
						<snm>McLachlan</snm>
						<fnm>G</fnm>
					</au>
					<au>
						<snm>Peel</snm>
						<fnm>D</fnm>
					</au>
				</aug>
				<source>Finite Mixture Models. Wiley Series in Probability and Statistics</source>
				<publisher>Wiley, New York</publisher>
				<pubdate>2000</pubdate>
			</bibl>
			<bibl id="B33">
				<title>
					<p>Diversification of cell cycle controls in developing embryos</p>
				</title>
				<aug>
					<au>
						<snm>Edgar</snm>
						<fnm>B</fnm>
					</au>
				</aug>
				<source>Curr Opin Cell Biol</source>
				<pubdate>1995</pubdate>
				<volume>7</volume>
				<issue>6</issue>
				<fpage>815</fpage>
				<lpage>824</lpage>
				<xrefbib>
					<pubid idtype="pmpid" link="fulltext">8608012</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B34">
				<title>
					<p>Gastrulation in Drosophila: the logic and the cellular mechanisms</p>
				</title>
				<aug>
					<au>
						<snm>Leptin</snm>
						<fnm>M</fnm>
					</au>
				</aug>
				<source>The EMBO Journal</source>
				<pubdate>1999</pubdate>
				<volume>18</volume>
				<fpage>3187</fpage>
				<lpage>3192</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">1171399</pubid>
						<pubid idtype="pmpid" link="fulltext">10369659</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B35">
				<title>
					<p>Supplementary Material</p>
				</title>
				<url>http://algorithmics.molgen.mpg.de/Supplements/Insitu/</url>
			</bibl>
			<bibl id="B36">
				<title>
					<p>Analysis of fused in-situ hybridization and gene expression data</p>
				</title>
				<aug>
					<au>
						<snm>Opitz</snm>
						<fnm>L</fnm>
					</au>
					<au>
						<snm>Schliep</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Posch</snm>
						<fnm>S</fnm>
					</au>
				</aug>
				<source>Advances in Data Analysis</source>
				<publisher>Heidelberg, Germany: Springer</publisher>
				<pubdate>2006</pubdate>
				<fpage>157</fpage>
				<lpage>166</lpage>
			</bibl>
			<bibl id="B37">
				<title>
					<p>A Gentle Tutorial on the EM Algorithm and its Application to Parameter Estimation for Gaussian Mixture and Hidden Markov Models</p>
				</title>
				<aug>
					<au>
						<snm>Bilmes</snm>
						<fnm>J</fnm>
					</au>
				</aug>
				<source>Tech rep</source>
				<publisher>International Computer Science Institute</publisher>
				<pubdate>1997</pubdate>
			</bibl>
			<bibl id="B38">
				<title>
					<p>Incorporating gene functions as priors in model-based clustering of microarray gene expression data</p>
				</title>
				<aug>
					<au>
						<snm>Pan</snm>
						<fnm>W</fnm>
					</au>
				</aug>
				<source>Bioinformatics</source>
				<pubdate>2006</pubdate>
				<volume>22</volume>
				<issue>7</issue>
				<fpage>795</fpage>
				<lpage>801</lpage>
				<xrefbib>
					<pubid idtype="pmpid" link="fulltext">16434443</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B39">
				<title>
					<p>Systematic determination of patterns of gene expression during Drosophila embryogenesis</p>
				</title>
				<aug>
					<au>
						<snm>Tomancak</snm>
						<fnm>P</fnm>
					</au>
					<au>
						<snm>Beaton</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Weiszmann</snm>
						<fnm>R</fnm>
					</au>
					<au>
						<snm>Kwan</snm>
						<fnm>E</fnm>
					</au>
					<au>
						<snm>Shu</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Lewis</snm>
						<fnm>E</fnm>
					</au>
					<au>
						<snm>Richards</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Ashburner</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Hartenstein</snm>
						<fnm>V</fnm>
					</au>
					<au>
						<snm>Celniker</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Rubin</snm>
						<fnm>G</fnm>
					</au>
				</aug>
				<source>Genome Biology</source>
				<pubdate>2002</pubdate>
				<volume>3</volume>
				<issue>12</issue>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">151190</pubid>
						<pubid idtype="pmpid" link="fulltext">12537577</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B40">
				<title>
					<p>BDGP: Berkeley Drosophila Genome Project</p>
				</title>
				<url>http://www.fruitfly.org</url>
			</bibl>
			<bibl id="B41">
				<aug>
					<au>
						<snm>Gonzalez</snm>
						<fnm>R</fnm>
					</au>
					<au>
						<snm>Wintz</snm>
						<fnm>P</fnm>
					</au>
				</aug>
				<source>Digital image processing</source>
				<publisher>Addison-Wesley</publisher>
				<pubdate>1991</pubdate>
			</bibl>
			<bibl id="B42">
				<title>
					<p>GOstat: find statistically overrepresented Gene Ontologies within a group of genes</p>
				</title>
				<aug>
					<au>
						<snm>Beissbarth</snm>
						<fnm>T</fnm>
					</au>
					<au>
						<snm>Speed</snm>
						<fnm>TP</fnm>
					</au>
				</aug>
				<source>Bioinformatics</source>
				<pubdate>2004</pubdate>
				<volume>20</volume>
				<issue>9</issue>
				<fpage>1464</fpage>
				<lpage>1465</lpage>
				<xrefbib>
					<pubid idtype="pmpid" link="fulltext">14962934</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B43">
				<aug>
					<au>
						<snm>Sokal</snm>
						<fnm>R</fnm>
					</au>
					<au>
						<snm>Rohlf</snm>
						<fnm>F</fnm>
					</au>
				</aug>
				<source>Biometry</source>
				<publisher>New York: W.H. Freeman and Company</publisher>
				<pubdate>1995</pubdate>
			</bibl>
			<bibl id="B44">
				<title>
					<p>Clustering short time series gene expression data</p>
				</title>
				<aug>
					<au>
						<snm>Ernst</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Nau</snm>
						<fnm>GJ</fnm>
					</au>
					<au>
						<snm>Bar-Joseph</snm>
						<fnm>Z</fnm>
					</au>
				</aug>
				<source>Bioinformatics</source>
				<pubdate>2005</pubdate>
				<volume>21</volume>
				<issue>suppl 1</issue>
				<fpage>i159</fpage>
				<lpage>168</lpage>
				<xrefbib>
					<pubid idtype="pmpid" link="fulltext">15961453</pubid>
				</xrefbib>
			</bibl>
		</refgrp>
	</bm>
</art>
