<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1471-2105-7-514</ui>
   <ji>1471-2105</ji>
   <fm>
      <dochead>Research article</dochead>
      <bibl>
         <title>
            <p>A hierarchical Na&#239;ve Bayes Model for handling sample heterogeneity in classification problems: an application to tissue microarrays</p>
         </title>
         <aug>
            <au id="A1">
               <snm>Demichelis</snm>
               <fnm>Francesca</fnm>
               <insr iid="I1"/>
               <insr iid="I2"/>
               <insr iid="I3"/>
               <email>fdemichelis@partners.org</email>
            </au>
            <au id="A2">
               <snm>Magni</snm>
               <fnm>Paolo</fnm>
               <insr iid="I4"/>
               <email>paolo.magni@unipv.it</email>
            </au>
            <au id="A3">
               <snm>Piergiorgi</snm>
               <fnm>Paolo</fnm>
               <insr iid="I4"/>
               <email>Ppiergiorgi@gmail.com</email>
            </au>
            <au id="A4" ca="yes">
               <snm>Rubin</snm>
               <mi>A</mi>
               <fnm>Mark</fnm>
               <insr iid="I2"/>
               <insr iid="I3"/>
               <insr iid="I5"/>
               <email>marubin@partners.org</email>
            </au>
            <au id="A5">
               <snm>Bellazzi</snm>
               <fnm>Riccardo</fnm>
               <insr iid="I4"/>
               <email>riccardo.bellazzi@unipv.it</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Bionformatics, SRA, ITC-irst &amp; Dept. of Information and Communication Technology, University of Trento, Trento, Italy</p>
            </ins>
            <ins id="I2">
               <p>Department of Pathology, Brigham and Women's Hospital, Boston, MA, USA</p>
            </ins>
            <ins id="I3">
               <p>Harvard Medical School, Boston, MA, USA</p>
            </ins>
            <ins id="I4">
               <p>Dipartimento di Informatica e Sistemistica, Universit&#224; di Pavia, Pavia, Italy</p>
            </ins>
            <ins id="I5">
               <p>Dana Farber Harvard Cancer Center, Boston, MA, USA</p>
            </ins>
         </insg>
         <source>BMC Bioinformatics</source>
         <issn>1471-2105</issn>
         <pubdate>2006</pubdate>
         <volume>7</volume>
         <issue>1</issue>
         <fpage>514</fpage>
         <url>http://www.biomedcentral.com/1471-2105/7/514</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">17125514</pubid>
               <pubid idtype="doi">10.1186/1471-2105-7-514</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>20</day>
               <month>7</month>
               <year>2006</year>
            </date>
         </rec>
         <acc>
            <date>
               <day>24</day>
               <month>11</month>
               <year>2006</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>24</day>
               <month>11</month>
               <year>2006</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2006</year>
         <collab>Demichelis et al; licensee BioMed Central Ltd.</collab>
         <note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>Uncertainty often affects molecular biology experiments and data for different reasons. Heterogeneity of gene or protein expression within the same tumor tissue is an example of biological uncertainty which should be taken into account when molecular markers are used in decision making. Tissue Microarray (TMA) experiments allow for large scale profiling of tissue biopsies, investigating protein patterns characterizing specific disease states. TMA studies deal with multiple sampling of the same patient, and therefore with multiple measurements of same protein target, to account for possible biological heterogeneity. The aim of this paper is to provide and validate a classification model taking into consideration the uncertainty associated with measuring replicate samples.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>We propose an extension of the well-known Na&#239;ve Bayes classifier, which accounts for biological heterogeneity in a probabilistic framework, relying on Bayesian hierarchical models. The model, which can be efficiently learned from the training dataset, exploits a closed-form of classification equation, thus providing no additional computational cost with respect to the standard Na&#239;ve Bayes classifier. We validated the approach on several simulated datasets comparing its performances with the Na&#239;ve Bayes classifier. Moreover, we demonstrated that explicitly dealing with heterogeneity can improve classification accuracy on a TMA prostate cancer dataset.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusion</p>
               </st>
               <p>The proposed Hierarchical Na&#239;ve Bayes classifier can be conveniently applied in problems where within sample heterogeneity must be taken into account, such as TMA experiments and biological contexts where several measurements (replicates) are available for the same biological sample. The performance of the new approach is better than the standard Na&#239;ve Bayes model, in particular when the within sample heterogeneity is different in the different classes.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <meta>
      <classifications>
         <classification type="bmc" subtype="user_supplied_xml" id="endnote"/>
      </classifications>
   </meta>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>The biomedical sciences are fraught with uncertainty. The sources of this uncertainty are manifold. Devices used to monitor biological processes vary in terms of resolutions. Gaps in the full understanding of basic biology compound this problem. Biological diversity or heterogeneity may make predictions difficult. Finally, uncertainty may be due to the unpredictable sources of noise, which can be inside or outside the biological system itself.</p>
         <p>In molecular biology uncertainty is ubiquitous; for example, tissue heterogeneity makes it difficult to compare a tissue sample composed of pure tumor cell populations with one composed of tumor and other non-tumoral elements such as supporting structural tissues (i.e. stroma) and vessels. However, in molecular biology, one rarely can examine an entire tumor and biopsies are taken with the assumption that they represent a portion of the whole tumor.</p>
         <p>This paper addresses the uncertainty associated with measuring replicate samples. Understanding such kind of uncertainty would help guide decision making and allow for alternate strategies to be explored. Usually, the measurements of replicate samples are averaged to derive a single measurement. This value is then used for example when building a classification system which may play a critical role in the decision making process. Unfortunately, an average measurement (or median value) hides the uncertainty or heterogeneity present in the replicates, and may thus lead to decision making rules which are too reliant on this pooled data. This process may lead to a model that is not sufficiently robust to work in an independent dataset.</p>
         <p>TMA studies represent a context where the issue of biological heterogeneity is particularly relevant. Where gene expression microarray experiments provide researchers with quantitative evaluation of transcripts, TMA evaluate DNA, RNA or protein targets through <it>in situ </it>investigations (analyses performed on tissues). <it>In situ </it>evaluations are characterized by the fact that the morphology of the analyzed samples is intact and therefore the potential biological heterogeneity within tumor tissue can be analyzed. TMAs allow for large scale profiling of tissue samples. For example, TMAs can be used to investigate panels of proteins that may play a role in tumor progression. They have the potential to be easily translatable to a clinical application such as the development of diagnostic biomarkers, e.g., AMACR <abbrgrp><abbr bid="B1">1</abbr></abbrgrp> or to access a therapeutic target, e.g., Her-2-neu <abbrgrp><abbr bid="B2">2</abbr></abbrgrp>.</p>
         <p>TMA datasets usually include replicate core biopsies of the same tissue from the same individual to ensure that enough representative tissue is available in each experiment and to better represent the biological variability of the tissue itself and of the protein activity (i.e. accurate sampling). Most TMA datasets are evaluated using straightforward pooling of the data from replicates, thus ignoring variations among biopsies from the same patient (the so called within sample variability). The mean, the maximum or the minimum is usually adopted and the strategy may be based on biological knowledge or on known protein associations. However, it has been found that different choices can lead to covariates with different significance levels in Cox regression <abbrgrp><abbr bid="B3">3</abbr></abbrgrp>. Interestingly, when multiple biomarkers are evaluated, one approach is chosen and applied for all of them regardless of the biologic implications.</p>
         <p>However the degree of heterogeneity of the tumor tissue may be an important biological parameter. In a probabilistic framework, for example, accounting for the within sample variability caused by the tumor tissue heterogeneity, could alter the probability of a case belonging to a certain class (even changing the predicted class), providing insight into the particular case study. When measurement occurs at different levels, i.e. different biopsies of the same tumor or different tumors, standard statistical techniques are not appropriate because they either assume that groups belong to entirely different populations or ignore the aggregate information entirely.</p>
         <p>Hierarchical models (multilevel models) provide a way of pooling the information for the different groups without assuming that they belong to precisely the same population <abbrgrp><abbr bid="B4">4</abbr></abbrgrp>. They are typically used when information is available but the observation units differ (i.e., meta-analysis of separate randomized trials).</p>
         <p>Herein we propose a classification model, which accounts for the tumor within sample variability in a probabilistic framework, relying on Bayesian hierarchical models. Hierarchical Bayes models have been used for modeling individual effects in several experimental contexts, ranging from toxicology to tumor risk <abbrgrp><abbr bid="B4">4</abbr></abbrgrp>. For this reason, their use in the classification context seems particularly suitable to handle TMA data and tumor heterogeneity.</p>
         <p>The paper is structured as follows: we first provide relevant background on Bayesian classifiers (specifically on the Na&#239;ve Bayes classifiers) and on Bayesian hierarchical models. Then we describe the proposed method and compare its performances to a Naive Bayesian classifier, in which we applied standard pooling strategies. The results will be shown on simulated datasets characterized by different ratios of within and between samples variability and on a real classification problem based on TMA data we generated in our laboratory from a prostate cancer progression array (TMA Core of Dana Farber Harvard Cancer Center, Boston, MA) developed to identify proteins that can distinguish aggressive from indolent forms of this common tumor type.</p>
         <sec>
            <st>
               <p>Bayesian classifiers and the Na&#239;ve Bayes</p>
            </st>
            <p>In this paper we focus on classification problems, where, given the data coming from a target case, we must decide to which class the case belongs. For example, given the set of tumor marker values measured on biopsies of a tissue of a given patient, we must decide if the patient is affected by a particular kind of tumor.</p>
            <p>From a Bayesian viewpoint, a classification problem can be written as the problem of finding the class with maximum probability given a set of observed attribute values. Such probability is seen as the posterior probability of the class given the data, and is usually computed using the Bayes theorem, as <it>P</it>(<it>C</it>|<b><it>X</it></b>) = <it>P</it>(<b><it>X</it></b>|<it>C</it>) <it>P</it>(<it>C</it>)/<it>P</it>(<b><it>X</it></b>), where <it>C </it>is any of the possible class values, <b><it>X </it></b>is a vector of <it>N</it><sub><it>feature </it></sub>attribute values, while <it>P</it>(<it>C</it>) and <it>P</it>(<b><it>X</it></b>|<it>C</it>) are the prior probability of the class and the conditional probability of the attribute values given the class, respectively. Usually Bayesian classifiers maximize <it>P</it>(<b><it>X</it></b>|<it>C</it>)<it>P</it>(<it>C</it>), which is proportional to <it>P</it>(<it>C</it>|<b><it>X</it></b>), being <it>P</it>(<b><it>X</it></b>) constant given a dataset.</p>
            <p>Bayesian classifiers are known to be the optimal classifiers, since they minimize the risk of misclassification. However, they require defining <it>P</it>(<b><it>X</it></b>|<it>C</it>), i.e. the <it>joint </it>probability of the attributes given the class. Estimating this probability distribution from a training dataset is a difficult problem, because it may require a very large dataset even for a moderate number of attributes in order to significantly explore all the possible combinations.</p>
            <p>Conversely, in the framework of Na&#239;ve Bayes classifiers, the attributes are assumed to be independent from each other given the class. This allows us to write, following Bayes theorem, the posterior probability of the class <it>C </it>as: <it>p</it>(<it>C</it>|<b><it>X</it></b>) = &#8719;<sub><it>l </it>= 1</sub><sup><it>Nfeature </it></sup><it>p</it>(<b>X</b><sup><it>l</it></sup>|<it>C</it>) <it>p</it>(<it>C</it>)/<it>P</it>(<b><it>X</it></b>). The Na&#239;ve Bayes classifier is therefore fully defined simply by the conditional probabilities of each attribute given the class. The conditional independence property largely simplifies the learning process of the model from data. In presence of discrete and Gaussian data this process turns out to be straightforward. Despite its simplicity, the Na&#239;ve Bayes classifier is known to be a robust method, which shows on average good performance in terms of classification accuracy, also when the independence assumption does not hold <abbrgrp><abbr bid="B5">5</abbr><abbr bid="B6">6</abbr></abbrgrp>. Due to its fast induction, the Na&#239;ve Bayes classifier is often considered as a reference method in classification studies. Several approaches have been proposed to generalize such classifier <abbrgrp><abbr bid="B7">7</abbr></abbrgrp> and there has been a recent interest in applying hierarchical models to Bayesian classification, such as in the field of expression array analysis <abbrgrp><abbr bid="B8">8</abbr><abbr bid="B9">9</abbr></abbrgrp>. In this paper we present a step forward, describing a hierarchical Na&#239;ve Bayesian model which can be convenientln used for classification purposes in the presence of replicated measurements, as for TMA data.</p>
         </sec>
         <sec>
            <st>
               <p>Bayesian Hierarchical Models</p>
            </st>
            <p>Bayesian hierarchical models (BHM) <abbrgrp><abbr bid="B4">4</abbr></abbrgrp> are powerful instruments able to describe complex interactions between the parameters of a stochastic model. In particular, BHMs are often used to describe population models, in which the parameters characterizing the model of an individual are considered to be related to the parameters of the other individuals belonging to the same population. In this paper we will cope with the problem of classifying tumors of different patients for which repeated measurements of tumor markers are available. The probability distribution of such measurements will depend on the patient; however, all the patients suffering from the same disease will be assumed to be related to each other in terms of tumor marker probability distributions.</p>
            <p>Bayesian hierarchical models provide a natural way to represent this relationship by specifying a suitable conditional independence structure and a suitable set of conditional probability distributions.</p>
            <p>The simpler structure of a BHM can be summarized as follows: let us suppose that a certain variable <it>x </it>(e.g. a tumor marker) has been measured <it>n</it><sub><it>rep </it></sub>times in <it>m </it>patients belonging to the same population, i.e. they have the same tumor type. Let us also suppose that <it>x </it>is a stochastic variable depending on a set of parameter <it>&#952;</it>, so that for the <it>i</it>-th subject, such dependency is expressed by the probability <it>p</it>(<it>x</it><sub>i</sub>| <it>&#952;</it><sub><it>i</it></sub>). The assumption that the individuals are "related" to each other can be then represented by introducing the conditional probability <it>p</it>(<it>&#952;</it><sub><it>i</it></sub>|<it>&#981;</it>), where <it>&#981; </it>is a set of hyper-parameters typical of a population. In this way, each subject is characterized by a probability distribution that depends on population parameters which are common for all individuals of the same population. If we assign a prior distribution to <it>&#981;</it>, say <it>p</it>(<it>&#981;</it>), the joint prior distribution will be <it>p</it>(<it>&#981;</it>,<it>&#952;</it>) = <it>p</it>(<it>&#952; </it>|<it>&#981;</it>)<it>p</it>(<it>&#981;</it>), where <it>&#952; </it>= {<it>&#952;</it><sub>1</sub>, <it>&#952;</it><sub>2</sub>, ..., <it>&#952;</it><sub><it>m</it></sub>}. Once a data set <b><it>X </it></b>= {<b><it>X</it></b><sub>1</sub>,..., <b><it>X</it></b><sub><it>m</it></sub>} is observed on all <it>m </it>patients, where <b>X</b><sub><it>i </it></sub>= (<it>x</it><sub><it>i</it>1</sub>, <it>x</it><sub><it>i</it>2</sub>,..., <m:math name="1471-2105-7-514-i1" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msub><m:mi>x</m:mi><m:mrow><m:mi>i</m:mi><m:msub><m:mi>n</m:mi><m:mrow><m:mi>r</m:mi><m:mi>e</m:mi><m:mi>p</m:mi><m:mi>i</m:mi></m:mrow></m:msub></m:mrow></m:msub></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacqWG4baEdaWgaaWcbaGaemyAaKMaemOBa42aaSbaaWqaaiabdkhaYjabdwgaLjabdchaWjabdMgaPbqabaaaleqaaaaa@36CD@</m:annotation></m:semantics></m:math>) is the measurement vector for the <it>i</it>-th patient, the joint posterior distribution <it>p</it>(<it>&#981;</it>, <it>&#952;</it>|<b><it>X</it></b>) can be computed by applying the Bayes theorem. It is easy to show that such distribution is proportional to <it>p</it>(<b><it>X</it></b>|<it>&#952;</it>) <it>p</it>(<it>&#952;</it>|<it>&#981;</it>) <it>p</it>(<it>&#981;</it>). Since the population parameters are usually unknown, the integral of such equation over <it>&#981; </it>allows to calculate the posterior distributions for the model parameters <it>&#952; </it>given the data coming from all patients.</p>
            <p>BHMs have been applied in a variety of contexts, ranging from signal processing <abbrgrp><abbr bid="B10">10</abbr></abbrgrp> to medicine and pharmacology <abbrgrp><abbr bid="B11">11</abbr><abbr bid="B12">12</abbr></abbrgrp> and to bioinformatics. <abbrgrp><abbr bid="B13">13</abbr><abbr bid="B14">14</abbr><abbr bid="B15">15</abbr><abbr bid="B16">16</abbr></abbrgrp>. Moreover, several computational techniques for specifying and fitting BHMs have been introduced also to deal with discrete responses, multivariate models, survival models and time series models. Useful reviews can be found in papers and books <abbrgrp><abbr bid="B17">17</abbr><abbr bid="B18">18</abbr></abbrgrp>. Recently, hierarchical models have been applied to extend the Na&#239;ve Bayes model in order to relax the assumptions of conditional independence between attributes <abbrgrp><abbr bid="B19">19</abbr></abbrgrp>. In our paper we will use hierarchical models to handle repeated measurements and their heterogeneity.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Results</p>
         </st>
         <sec>
            <st>
               <p>The Hierarchical Na&#239;ve Bayes approach</p>
            </st>
            <p>From a probabilistic perspective, a classification problem can be viewed as the selection of the class which has the highest (posterior) probability given the available data. Here we explicitly handle data with multiple replicate values.</p>
            <p>In the case of TMA, we can think of one case as being the tumor tissue of one patient and replicates being the multiple biopsies from that patient. Therefore, the evaluation of a target protein (feature) on a TMA section will provide the pathologist with multiple evaluations of protein expression for each patient (case).</p>
            <p>Let <m:math name="1471-2105-7-514-i2" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>x</m:mi><m:mrow><m:mi>i</m:mi><m:mi>j</m:mi></m:mrow><m:mrow><m:mi>l</m:mi><m:mi>k</m:mi></m:mrow></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacqWG4baEdaqhaaWcbaGaemyAaKMaemOAaOgabaGaemiBaWMaem4AaSgaaaaa@33CA@</m:annotation></m:semantics></m:math> be the <it>j</it>-th replicate measurement of the <it>l</it>-th feature of the <it>i</it>-th case corresponding to class <it>k</it>. For the sake of simplicity, given a class <it>k </it>and a feature <it>l</it>, we will write it as <it>x</it><sub><it>ij</it></sub>, <it>j </it>= 1,...,<it>n</it><sub><it>rep</it></sub>, <it>i </it>= 1,...,<it>N</it><sub><it>Ck</it></sub>, where <it>N</it><sub><it>Ck </it></sub>is the number of cases in the class <it>C</it><sub><it>k</it></sub>.</p>
            <p>Let us assume that the values of replicates of the generic case <it>i </it>are normally distributed around a mean value <it>&#956;</it><sub><it>i </it></sub>with a variance <it>&#963;</it><sup>2 </sup>(independent from <it>i</it>, but dependent on the class <it>k</it>), i.e. <it>x</it><sub><it>ij</it></sub>~<it>N</it>(<it>&#956;</it><sub><it>i</it></sub>, <it>&#963;</it><sup>2</sup>). The mean values <it>&#956;</it><sub><it>i </it></sub>are, at their turn, normally distributed around a "population" mean value <it>M </it>with variance <it>&#964;</it><sup>2</sup>, i.e. <it>&#956;</it><sub><it>i</it></sub>~<it>N</it>(<it>M</it>, <it>&#964;</it><sup>2</sup>). The assumption that the variance is the same for all patients belonging to the same class reflects the intuitive notion that the variability over replicates, due for example to the tissue heterogeneity, is a property of the disease. Such an assumption, which is realistic in TMA data, turns out to be convenient when estimating the variance from the data: the reliability of the estimate is increased by the higher number of measurements exploited. The resulting hierarchical model is presented in Fig. <figr fid="F1">1</figr>.</p>
            <fig id="F1">
               <title>
                  <p>Figure 1</p>
               </title>
               <caption>
                  <p>Structure of a hierarchical model</p>
               </caption>
               <text>
                  <p><b>Structure of a hierarchical model</b>. The replicates <it>j </it>of the generic subject <it>i </it>are normally distributed around a mean value <it>&#956;</it><sub><it>i </it></sub>with a within sample variance <it>&#963;</it><sup>2</sup>, i.e. <it>x</it><sub><it>ij</it></sub>~<it>N</it>(<it>&#956;</it><sub><it>i</it></sub>, <it>&#963;</it><sup>2</sup>). The mean values <it>&#956;</it><sub><it>i </it></sub>are normally distributed around a "population" mean value <it>M </it>with between sample variance <it>&#964;</it><sup>2</sup>, i.e. <it>&#956;</it><sub><it>i</it></sub>~<it>N</it>(<it>M</it>, <it>&#964;</it><sup>2</sup>).</p>
               </text>
               <graphic file="1471-2105-7-514-1"/>
            </fig>
            <p>Given this probabilistic model, we here describe how to classify a new case, supposing that the class model parameters <it>M</it>, <it>&#964;</it><sup>2</sup>, <it>&#963;</it><sup>2 </sup>are known for each class. In the Methods section we detail how to learn the model parameters from a training dataset. Moreover, the same section reports the classification and the learning phase of Standard Na&#239;ve Bayes (StNB) classifier, in order to highlight the differences.</p>
            <p>To classify a new case in the Bayesian framework it is necessary to evaluate the posterior probability of each class given the case data. Let us define the vector <b>X</b><sub><it>i </it></sub>= (<it>x</it><sub><it>i1</it></sub>, <it>x</it><sub><it>i</it>2</sub>,..., <it>x</it><sub><it>inrepi</it></sub>), which represents the replicate measurements of the <it>i</it>-th case for a given feature (univariate case). For sake of simplicity, we omit the sub-index <it>i </it>hereafter.</p>
            <p>By applying the Bayes' theorem, the posterior probability for the class C<sub>k </sub>given the set of data is</p>
            <p>
               <m:math name="1471-2105-7-514-i3" xmlns:m="http://www.w3.org/1998/Math/MathML">
                  <m:semantics>
                     <m:mrow>
                        <m:mi>P</m:mi>
                        <m:mo stretchy="false">(</m:mo>
                        <m:msub>
                           <m:mi>C</m:mi>
                           <m:mi>k</m:mi>
                        </m:msub>
                        <m:mo stretchy="false">|</m:mo>
                        <m:mi>X</m:mi>
                        <m:mo>,</m:mo>
                        <m:msup>
                           <m:mi>&#963;</m:mi>
                           <m:mn>2</m:mn>
                        </m:msup>
                        <m:mo>,</m:mo>
                        <m:mi>M</m:mi>
                        <m:mo>,</m:mo>
                        <m:msup>
                           <m:mi>&#964;</m:mi>
                           <m:mn>2</m:mn>
                        </m:msup>
                        <m:mo stretchy="false">)</m:mo>
                        <m:mo>=</m:mo>
                        <m:mfrac>
                           <m:mrow>
                              <m:mi>P</m:mi>
                              <m:mo stretchy="false">(</m:mo>
                              <m:mi>X</m:mi>
                              <m:mo>|</m:mo>
                              <m:msub>
                                 <m:mi>C</m:mi>
                                 <m:mi>k</m:mi>
                              </m:msub>
                              <m:mo>,</m:mo>
                              <m:msup>
                                 <m:mi>&#963;</m:mi>
                                 <m:mn>2</m:mn>
                              </m:msup>
                              <m:mo>,</m:mo>
                              <m:mi>M</m:mi>
                              <m:mo>,</m:mo>
                              <m:msup>
                                 <m:mi>&#964;</m:mi>
                                 <m:mn>2</m:mn>
                              </m:msup>
                              <m:mo stretchy="false">)</m:mo>
                              <m:mi>P</m:mi>
                              <m:mo stretchy="false">(</m:mo>
                              <m:msub>
                                 <m:mi>C</m:mi>
                                 <m:mi>k</m:mi>
                              </m:msub>
                              <m:mo stretchy="false">)</m:mo>
                           </m:mrow>
                           <m:mrow>
                              <m:mi>p</m:mi>
                              <m:mo stretchy="false">(</m:mo>
                              <m:mi>X</m:mi>
                              <m:mo stretchy="false">)</m:mo>
                           </m:mrow>
                        </m:mfrac>
                        <m:mo>&#8733;</m:mo>
                        <m:mi>P</m:mi>
                        <m:mo stretchy="false">(</m:mo>
                        <m:mi>X</m:mi>
                        <m:mo stretchy="false">|</m:mo>
                        <m:msub>
                           <m:mi>C</m:mi>
                           <m:mi>k</m:mi>
                        </m:msub>
                        <m:mo>,</m:mo>
                        <m:msup>
                           <m:mi>&#963;</m:mi>
                           <m:mn>2</m:mn>
                        </m:msup>
                        <m:mo>,</m:mo>
                        <m:mi>M</m:mi>
                        <m:mo>,</m:mo>
                        <m:msup>
                           <m:mi>&#964;</m:mi>
                           <m:mn>2</m:mn>
                        </m:msup>
                        <m:mo stretchy="false">)</m:mo>
                        <m:mi>P</m:mi>
                        <m:mo stretchy="false">(</m:mo>
                        <m:msub>
                           <m:mi>C</m:mi>
                           <m:mi>k</m:mi>
                        </m:msub>
                        <m:mo stretchy="false">)</m:mo>
                        <m:mo>.</m:mo>
                     </m:mrow>
                     <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacqWGqbaucqGGOaakcqWGdbWqdaWgaaWcbaGaem4AaSgabeaakiabcYha8Hqabiab=HfayjabcYcaSGGaciab+n8aZnaaCaaaleqabaGaeGOmaidaaOGaeiilaWIaemyta0KaeiilaWIae4hXdq3aaWbaaSqabeaacqaIYaGmaaGccqGGPaqkcqGH9aqpdaWcaaqaaiabdcfaqjabcIcaOiab=HfayjabcYha8jabdoeadnaaBaaaleaacqWGRbWAaeqaaOGaeiilaWIae43Wdm3aaWbaaSqabeaacqaIYaGmaaGccqGGSaalcqWGnbqtcqGGSaalcqGFepaDdaahaaWcbeqaaiabikdaYaaakiabcMcaPiabdcfaqjabcIcaOiabdoeadnaaBaaaleaacqWGRbWAaeqaaOGaeiykaKcabaGaemiCaaNaeiikaGIae8hwaGLaeiykaKcaaiabg2Hi1kabdcfaqjabcIcaOiab=HfayjabcYha8jabdoeadnaaBaaaleaacqWGRbWAaeqaaOGaeiilaWIae43Wdm3aaWbaaSqabeaacqaIYaGmaaGccqGGSaalcqWGnbqtcqGGSaalcqGFepaDdaahaaWcbeqaaiabikdaYaaakiabcMcaPiabdcfaqjabcIcaOiabdoeadnaaBaaaleaacqWGRbWAaeqaaOGaeiykaKIaeiOla4caaa@74D9@</m:annotation>
                  </m:semantics>
               </m:math>
            </p>
            <p>To evaluate the posterior probability, the marginal likelihood <it>P</it>(<b><it>X</it></b>| <it>C</it><sub><it>k</it></sub>, <it>&#963;</it><sup>2</sup>, <it>M</it>, <it>&#964;</it><sup>2</sup>) can be computed exploiting the conditional independence assumptions described in the hierarchical model of Figure <figr fid="F1">1</figr> as:</p>
            <p>
               <m:math name="1471-2105-7-514-i4" xmlns:m="http://www.w3.org/1998/Math/MathML">
                  <m:semantics>
                     <m:mrow>
                        <m:mi>P</m:mi>
                        <m:mo stretchy="false">(</m:mo>
                        <m:mi>X</m:mi>
                        <m:mo>|</m:mo>
                        <m:msub>
                           <m:mi>C</m:mi>
                           <m:mi>k</m:mi>
                        </m:msub>
                        <m:mo>,</m:mo>
                        <m:msup>
                           <m:mi>&#963;</m:mi>
                           <m:mn>2</m:mn>
                        </m:msup>
                        <m:mo>,</m:mo>
                        <m:mi>M</m:mi>
                        <m:mo>,</m:mo>
                        <m:msup>
                           <m:mi>&#964;</m:mi>
                           <m:mn>2</m:mn>
                        </m:msup>
                        <m:mo stretchy="false">)</m:mo>
                        <m:mo>=</m:mo>
                        <m:mstyle displaystyle="true">
                           <m:mrow>
                              <m:munder>
                                 <m:mo>&#8747;</m:mo>
                                 <m:mi>&#956;</m:mi>
                              </m:munder>
                              <m:mrow>
                                 <m:mi>P</m:mi>
                                 <m:mo stretchy="false">(</m:mo>
                                 <m:mi>X</m:mi>
                                 <m:mo>|</m:mo>
                                 <m:mi>&#956;</m:mi>
                                 <m:mo>,</m:mo>
                                 <m:msup>
                                    <m:mi>&#963;</m:mi>
                                    <m:mn>2</m:mn>
                                 </m:msup>
                                 <m:mo stretchy="false">)</m:mo>
                                 <m:mi>P</m:mi>
                                 <m:mo stretchy="false">(</m:mo>
                                 <m:mi>&#956;</m:mi>
                                 <m:mo>|</m:mo>
                                 <m:mi>M</m:mi>
                                 <m:mo>,</m:mo>
                                 <m:msup>
                                    <m:mi>&#964;</m:mi>
                                    <m:mn>2</m:mn>
                                 </m:msup>
                                 <m:mo stretchy="false">)</m:mo>
                                 <m:mi>d</m:mi>
                                 <m:mi>&#956;</m:mi>
                                 <m:mo>.</m:mo>
                              </m:mrow>
                           </m:mrow>
                        </m:mstyle>
                     </m:mrow>
                     <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacqWGqbaucqGGOaakieqacqWFybawcqGG8baFcqWGdbWqdaWgaaWcbaGaem4AaSgabeaakiabcYcaSGGaciab+n8aZnaaCaaaleqabaGaeGOmaidaaOGaeiilaWIaemyta0KaeiilaWIae4hXdq3aaWbaaSqabeaacqaIYaGmaaGccqGGPaqkcqGH9aqpdaWdrbqaaiabdcfaqjabcIcaOiab=HfayjabcYha8jab+X7aTjabcYcaSiab+n8aZnaaCaaaleqabaGaeGOmaidaaOGaeiykaKIaemiuaaLaeiikaGIae4hVd0MaeiiFaWNaemyta0KaeiilaWIae4hXdq3aaWbaaSqabeaacqaIYaGmaaGccqGGPaqkcqWGKbazcqGF8oqBcqGGUaGlaSqaaiab+X7aTbqab0Gaey4kIipaaaa@5D68@</m:annotation>
                  </m:semantics>
               </m:math>
            </p>
            <p>The marginal likelihood can be written as (for sake of readability the subscript <it>k </it>and the model parameters <it>M</it>, <it>&#964;</it><sup>2</sup>, <it>&#963;</it><sup>2 </sup>have been omitted in the left hand side of the equation):</p>
            <p>
               <m:math name="1471-2105-7-514-i5" xmlns:m="http://www.w3.org/1998/Math/MathML">
                  <m:semantics>
                     <m:mrow>
                        <m:mi>P</m:mi>
                        <m:mo stretchy="false">(</m:mo>
                        <m:mi>X</m:mi>
                        <m:mo stretchy="false">|</m:mo>
                        <m:mi>C</m:mi>
                        <m:mo stretchy="false">)</m:mo>
                        <m:mo>=</m:mo>
                        <m:mfrac>
                           <m:mn>1</m:mn>
                           <m:mrow>
                              <m:msup>
                                 <m:mrow>
                                    <m:mrow>
                                       <m:mo>(</m:mo>
                                       <m:mrow>
                                          <m:mi>&#963;</m:mi>
                                          <m:msqrt>
                                             <m:mrow>
                                                <m:mn>2</m:mn>
                                                <m:mi>&#960;</m:mi>
                                             </m:mrow>
                                          </m:msqrt>
                                       </m:mrow>
                                       <m:mo>)</m:mo>
                                    </m:mrow>
                                 </m:mrow>
                                 <m:mrow>
                                    <m:msub>
                                       <m:mi>n</m:mi>
                                       <m:mrow>
                                          <m:mi>r</m:mi>
                                          <m:mi>e</m:mi>
                                          <m:mi>p</m:mi>
                                       </m:mrow>
                                    </m:msub>
                                 </m:mrow>
                              </m:msup>
                              <m:mrow>
                                 <m:mo>(</m:mo>
                                 <m:mrow>
                                    <m:mi>&#964;</m:mi>
                                    <m:msqrt>
                                       <m:mrow>
                                          <m:mn>2</m:mn>
                                          <m:mi>&#960;</m:mi>
                                       </m:mrow>
                                    </m:msqrt>
                                 </m:mrow>
                                 <m:mo>)</m:mo>
                              </m:mrow>
                           </m:mrow>
                        </m:mfrac>
                        <m:mo>*</m:mo>
                        <m:mstyle displaystyle="true">
                           <m:mrow>
                              <m:munder>
                                 <m:mo>&#8747;</m:mo>
                                 <m:mi>&#956;</m:mi>
                              </m:munder>
                              <m:mrow>
                                 <m:mi>exp</m:mi>
                                 <m:mo>&#8289;</m:mo>
                              </m:mrow>
                           </m:mrow>
                        </m:mstyle>
                        <m:mrow>
                           <m:mo>(</m:mo>
                           <m:mrow>
                              <m:mo>&#8722;</m:mo>
                              <m:mfrac>
                                 <m:mn>1</m:mn>
                                 <m:mrow>
                                    <m:mn>2</m:mn>
                                    <m:msup>
                                       <m:mi>&#963;</m:mi>
                                       <m:mn>2</m:mn>
                                    </m:msup>
                                 </m:mrow>
                              </m:mfrac>
                              <m:mstyle displaystyle="true">
                                 <m:munder>
                                    <m:mo>&#8721;</m:mo>
                                    <m:mi>j</m:mi>
                                 </m:munder>
                                 <m:mrow>
                                    <m:msup>
                                       <m:mrow>
                                          <m:mo stretchy="false">(</m:mo>
                                          <m:msub>
                                             <m:mi>x</m:mi>
                                             <m:mi>j</m:mi>
                                          </m:msub>
                                          <m:mo>&#8722;</m:mo>
                                          <m:mi>&#956;</m:mi>
                                          <m:mo stretchy="false">)</m:mo>
                                       </m:mrow>
                                       <m:mn>2</m:mn>
                                    </m:msup>
                                 </m:mrow>
                              </m:mstyle>
                              <m:mo>&#8722;</m:mo>
                              <m:mfrac>
                                 <m:mn>1</m:mn>
                                 <m:mrow>
                                    <m:mn>2</m:mn>
                                    <m:msup>
                                       <m:mi>&#964;</m:mi>
                                       <m:mn>2</m:mn>
                                    </m:msup>
                                 </m:mrow>
                              </m:mfrac>
                              <m:msup>
                                 <m:mrow>
                                    <m:mo stretchy="false">(</m:mo>
                                    <m:mi>&#956;</m:mi>
                                    <m:mo>&#8722;</m:mo>
                                    <m:mi>M</m:mi>
                                    <m:mo stretchy="false">)</m:mo>
                                 </m:mrow>
                                 <m:mn>2</m:mn>
                              </m:msup>
                           </m:mrow>
                           <m:mo>)</m:mo>
                        </m:mrow>
                        <m:mi>d</m:mi>
                        <m:mi>&#956;</m:mi>
                        <m:mo>.</m:mo>
                     </m:mrow>
                     <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacqWGqbaucqGGOaakieqacqWFybawcqGG8baFcqWGdbWqcqGGPaqkcqGH9aqpdaWcaaqaaiabigdaXaqaamaabmaabaacciGae43Wdm3aaOaaaeaacqaIYaGmcqGFapaCaSqabaaakiaawIcacaGLPaaadaahaaWcbeqaaiabd6gaUnaaBaaameaacqWGYbGCcqWGLbqzcqWGWbaCaeqaaaaakmaabmaabaGae4hXdq3aaOaaaeaacqaIYaGmcqGFapaCaSqabaaakiaawIcacaGLPaaaaaGaeiOkaOYaa8quaeaacyGGLbqzcqGG4baEcqGGWbaCaSqaaiab+X7aTbqab0Gaey4kIipakmaabmaabaGaeyOeI0YaaSaaaeaacqaIXaqmaeaacqaIYaGmcqGFdpWCdaahaaWcbeqaaiabikdaYaaaaaGcdaaeqbqaaiabcIcaOiabdIha4naaBaaaleaacqWGQbGAaeqaaOGaeyOeI0Iae4hVd0MaeiykaKYaaWbaaSqabeaacqaIYaGmaaaabaGaemOAaOgabeqdcqGHris5aOGaeyOeI0YaaSaaaeaacqaIXaqmaeaacqaIYaGmcqGFepaDdaahaaWcbeqaaiabikdaYaaaaaGccqGGOaakcqGF8oqBcqGHsislcqWGnbqtcqGGPaqkdaahaaWcbeqaaiabikdaYaaaaOGaayjkaiaawMcaaiabdsgaKjab+X7aTjabc6caUaaa@7425@</m:annotation>
                  </m:semantics>
               </m:math>
            </p>
            <p>Applying simple algebra (details are reported <supplr sid="S1">Additional file 1</supplr>), we obtain:</p>
            <suppl id="S1">
               <title>
                  <p>Additional file 1</p>
               </title>
               <text>
                  <p>The marginal likelihood for the Hierarchical Na&#239;ve Bayes Model.</p>
               </text>
               <file name="1471-2105-7-514-S1.doc">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <p>
               <m:math name="1471-2105-7-514-i6" xmlns:m="http://www.w3.org/1998/Math/MathML">
                  <m:semantics>
                     <m:mrow>
                        <m:mi>P</m:mi>
                        <m:mo stretchy="false">(</m:mo>
                        <m:mi>X</m:mi>
                        <m:mo stretchy="false">|</m:mo>
                        <m:mi>C</m:mi>
                        <m:mo stretchy="false">)</m:mo>
                        <m:mo>=</m:mo>
                        <m:mfrac>
                           <m:mi>&#963;</m:mi>
                           <m:mrow>
                              <m:msup>
                                 <m:mrow>
                                    <m:mrow>
                                       <m:mo>(</m:mo>
                                       <m:mrow>
                                          <m:msqrt>
                                             <m:mrow>
                                                <m:mn>2</m:mn>
                                                <m:mi>&#960;</m:mi>
                                             </m:mrow>
                                          </m:msqrt>
                                          <m:mi>&#963;</m:mi>
                                       </m:mrow>
                                       <m:mo>)</m:mo>
                                    </m:mrow>
                                 </m:mrow>
                                 <m:mrow>
                                    <m:msub>
                                       <m:mi>n</m:mi>
                                       <m:mrow>
                                          <m:mi>r</m:mi>
                                          <m:mi>e</m:mi>
                                          <m:mi>p</m:mi>
                                       </m:mrow>
                                    </m:msub>
                                 </m:mrow>
                              </m:msup>
                              <m:msqrt>
                                 <m:mrow>
                                    <m:msub>
                                       <m:mi>n</m:mi>
                                       <m:mrow>
                                          <m:mi>r</m:mi>
                                          <m:mi>e</m:mi>
                                          <m:mi>p</m:mi>
                                       </m:mrow>
                                    </m:msub>
                                    <m:msup>
                                       <m:mi>&#964;</m:mi>
                                       <m:mn>2</m:mn>
                                    </m:msup>
                                    <m:mo>+</m:mo>
                                    <m:msup>
                                       <m:mi>&#963;</m:mi>
                                       <m:mn>2</m:mn>
                                    </m:msup>
                                 </m:mrow>
                              </m:msqrt>
                           </m:mrow>
                        </m:mfrac>
                        <m:mi>exp</m:mi>
                        <m:mo>&#8289;</m:mo>
                        <m:mo>&#8722;</m:mo>
                        <m:mrow>
                           <m:mo>(</m:mo>
                           <m:mrow>
                              <m:mfrac>
                                 <m:mrow>
                                    <m:mstyle displaystyle="true">
                                       <m:munder>
                                          <m:mo>&#8721;</m:mo>
                                          <m:mi>j</m:mi>
                                       </m:munder>
                                       <m:mrow>
                                          <m:msubsup>
                                             <m:mi>x</m:mi>
                                             <m:mi>j</m:mi>
                                             <m:mn>2</m:mn>
                                          </m:msubsup>
                                       </m:mrow>
                                    </m:mstyle>
                                 </m:mrow>
                                 <m:mrow>
                                    <m:mn>2</m:mn>
                                    <m:msup>
                                       <m:mi>&#963;</m:mi>
                                       <m:mn>2</m:mn>
                                    </m:msup>
                                 </m:mrow>
                              </m:mfrac>
                              <m:mo>+</m:mo>
                              <m:mfrac>
                                 <m:mrow>
                                    <m:msup>
                                       <m:mi>M</m:mi>
                                       <m:mn>2</m:mn>
                                    </m:msup>
                                 </m:mrow>
                                 <m:mrow>
                                    <m:mn>2</m:mn>
                                    <m:msup>
                                       <m:mi>&#964;</m:mi>
                                       <m:mn>2</m:mn>
                                    </m:msup>
                                 </m:mrow>
                              </m:mfrac>
                           </m:mrow>
                           <m:mo>)</m:mo>
                        </m:mrow>
                        <m:mo>*</m:mo>
                        <m:mi>exp</m:mi>
                        <m:mo>&#8289;</m:mo>
                        <m:mrow>
                           <m:mo>(</m:mo>
                           <m:mrow>
                              <m:mfrac>
                                 <m:mrow>
                                    <m:mo stretchy="false">(</m:mo>
                                    <m:mfrac>
                                       <m:mrow>
                                          <m:msup>
                                             <m:mi>&#964;</m:mi>
                                             <m:mn>2</m:mn>
                                          </m:msup>
                                          <m:msubsup>
                                             <m:mi>n</m:mi>
                                             <m:mrow>
                                                <m:mi>r</m:mi>
                                                <m:mi>e</m:mi>
                                                <m:mi>p</m:mi>
                                             </m:mrow>
                                             <m:mn>2</m:mn>
                                          </m:msubsup>
                                          <m:msubsup>
                                             <m:mi>x</m:mi>
                                             <m:mrow>
                                                <m:mi>m</m:mi>
                                                <m:mi>e</m:mi>
                                                <m:mi>a</m:mi>
                                                <m:mi>n</m:mi>
                                             </m:mrow>
                                             <m:mn>2</m:mn>
                                          </m:msubsup>
                                       </m:mrow>
                                       <m:mrow>
                                          <m:msup>
                                             <m:mi>&#963;</m:mi>
                                             <m:mn>2</m:mn>
                                          </m:msup>
                                       </m:mrow>
                                    </m:mfrac>
                                    <m:mo>+</m:mo>
                                    <m:mfrac>
                                       <m:mrow>
                                          <m:msup>
                                             <m:mi>&#963;</m:mi>
                                             <m:mn>2</m:mn>
                                          </m:msup>
                                          <m:msup>
                                             <m:mi>M</m:mi>
                                             <m:mn>2</m:mn>
                                          </m:msup>
                                       </m:mrow>
                                       <m:mrow>
                                          <m:msup>
                                             <m:mi>&#964;</m:mi>
                                             <m:mn>2</m:mn>
                                          </m:msup>
                                       </m:mrow>
                                    </m:mfrac>
                                    <m:mo>+</m:mo>
                                    <m:mn>2</m:mn>
                                    <m:msub>
                                       <m:mi>n</m:mi>
                                       <m:mrow>
                                          <m:mi>r</m:mi>
                                          <m:mi>e</m:mi>
                                          <m:mi>p</m:mi>
                                       </m:mrow>
                                    </m:msub>
                                    <m:msub>
                                       <m:mi>x</m:mi>
                                       <m:mrow>
                                          <m:mi>m</m:mi>
                                          <m:mi>e</m:mi>
                                          <m:mi>a</m:mi>
                                          <m:mi>n</m:mi>
                                       </m:mrow>
                                    </m:msub>
                                    <m:mi>M</m:mi>
                                    <m:mo stretchy="false">)</m:mo>
                                 </m:mrow>
                                 <m:mrow>
                                    <m:mn>2</m:mn>
                                    <m:mo stretchy="false">(</m:mo>
                                    <m:msub>
                                       <m:mi>n</m:mi>
                                       <m:mrow>
                                          <m:mi>r</m:mi>
                                          <m:mi>e</m:mi>
                                          <m:mi>p</m:mi>
                                       </m:mrow>
                                    </m:msub>
                                    <m:msup>
                                       <m:mi>&#964;</m:mi>
                                       <m:mn>2</m:mn>
                                    </m:msup>
                                    <m:mo>+</m:mo>
                                    <m:msup>
                                       <m:mi>&#963;</m:mi>
                                       <m:mn>2</m:mn>
                                    </m:msup>
                                    <m:mo stretchy="false">)</m:mo>
                                 </m:mrow>
                              </m:mfrac>
                           </m:mrow>
                           <m:mo>)</m:mo>
                        </m:mrow>
                        <m:mo>.</m:mo>
                     </m:mrow>
                     <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacqWGqbaucqGGOaakieqacqWFybawcqGG8baFcqWGdbWqcqGGPaqkcqGH9aqpdaWcaaqaaGGaciab+n8aZbqaamaabmaabaWaaOaaaeaacqaIYaGmcqGFapaCaSqabaGccqGFdpWCaiaawIcacaGLPaaadaahaaWcbeqaaiabd6gaUnaaBaaameaacqWGYbGCcqWGLbqzcqWGWbaCaeqaaaaakmaakaaabaGaemOBa42aaSbaaSqaaiabdkhaYjabdwgaLjabdchaWbqabaGccqGFepaDdaahaaWcbeqaaiabikdaYaaakiabgUcaRiab+n8aZnaaCaaaleqabaGaeGOmaidaaaqabaaaaOGagiyzauMaeiiEaGNaeiiCaaNaeyOeI0YaaeWaaeaadaWcaaqaamaaqafabaGaemiEaG3aa0baaSqaaiabdQgaQbqaaiabikdaYaaaaeaacqWGQbGAaeqaniabggHiLdaakeaacqaIYaGmcqGFdpWCdaahaaWcbeqaaiabikdaYaaaaaGccqGHRaWkdaWcaaqaaiabd2eannaaCaaaleqabaGaeGOmaidaaaGcbaGaeGOmaiJae4hXdq3aaWbaaSqabeaacqaIYaGmaaaaaaGccaGLOaGaayzkaaGaeiOkaOIagiyzauMaeiiEaGNaeiiCaa3aaeWaaeaadaWcaaqaaiabcIcaOmaalaaabaGae4hXdq3aaWbaaSqabeaacqaIYaGmaaGccqWGUbGBdaqhaaWcbaGaemOCaiNaemyzauMaemiCaahabaGaeGOmaidaaOGaemiEaG3aa0baaSqaaiabd2gaTjabdwgaLjabdggaHjabd6gaUbqaaiabikdaYaaaaOqaaiab+n8aZnaaCaaaleqabaGaeGOmaidaaaaakiabgUcaRmaalaaabaGae43Wdm3aaWbaaSqabeaacqaIYaGmaaGccqWGnbqtdaahaaWcbeqaaiabikdaYaaaaOqaaiab+r8a0naaCaaaleqabaGaeGOmaidaaaaakiabgUcaRiabikdaYiabd6gaUnaaBaaaleaacqWGYbGCcqWGLbqzcqWGWbaCaeqaaOGaemiEaG3aaSbaaSqaaiabd2gaTjabdwgaLjabdggaHjabd6gaUbqabaGccqWGnbqtcqGGPaqkaeaacqaIYaGmcqGGOaakcqWGUbGBdaWgaaWcbaGaemOCaiNaemyzauMaemiCaahabeaakiab+r8a0naaCaaaleqabaGaeGOmaidaaOGaey4kaSIae43Wdm3aaWbaaSqabeaacqaIYaGmaaGccqGGPaqkaaaacaGLOaGaayzkaaGaeiOla4caaa@ADAC@</m:annotation>
                  </m:semantics>
               </m:math>
            </p>
            <p>Finally, given the model parameters, the new case <b><it>X </it></b>can be classified into the class that maximize the posterior probability, which is proportional to the marginal likelihood if the classes are <it>a priori </it>equally likely.</p>
            <p>It is interesting to note that the main novelty of the method is that the classification rule (through the marginal likelihood) includes the information on the within sample heterogeneity. Such information is expressed by the parameter <it>&#963;</it><sup>2</sup>, which may therefore guide decisions when there is a clear difference between the within sample heterogeneity of cases belonging to the different classes. Let us note that standard approaches, such as the StNB or the quadratic discriminant analysis, can take into account only between samples variability, expressed in our model by the parameter <it>&#964;</it><sup>2 </sup>. Moreover, since the classification rule can be calculated in closed-form, it can be used in real-time applications, such as the StNB classifier.</p>
            <p>The generalization for the multivariate case, i.e. <m:math name="1471-2105-7-514-i7" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>X</m:mi><m:mo>&#175;</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaaieqacuWFybawgaqeaaaa@2E03@</m:annotation></m:semantics></m:math> = (<b>X</b><sup>1</sup>, <b>X</b><sup>2</sup><b>,...,X</b><sup><it>Nfeature</it></sup>), can be obtained by assuming, as in the StNB classifier, the conditional independence of the features given the class, i.e. <m:math name="1471-2105-7-514-i8" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:mi>P</m:mi><m:mo stretchy="false">(</m:mo><m:mover accent="true"><m:mi>X</m:mi><m:mo>&#175;</m:mo></m:mover><m:mo stretchy="false">|</m:mo><m:mi>C</m:mi><m:mo stretchy="false">)</m:mo><m:mo>=</m:mo><m:mstyle displaystyle="true"><m:munderover><m:mo>&#8719;</m:mo><m:mrow><m:mi>l</m:mi><m:mo>=</m:mo><m:mn>1</m:mn></m:mrow><m:mrow><m:mi>N</m:mi><m:mi>f</m:mi><m:mi>e</m:mi><m:mi>a</m:mi><m:mi>t</m:mi><m:mi>u</m:mi><m:mi>r</m:mi><m:mi>e</m:mi></m:mrow></m:munderover><m:mrow><m:mi>P</m:mi><m:mo stretchy="false">(</m:mo><m:msup><m:mi>X</m:mi><m:mi>l</m:mi></m:msup><m:mo>|</m:mo><m:mi>C</m:mi><m:mo stretchy="false">)</m:mo></m:mrow></m:mstyle></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacqWGqbaucqGGOaakieqacuWFybawgaqeaiabcYha8jabdoeadjabcMcaPiabg2da9maarahabaGaemiuaaLaeiikaGIae8hwaG1aaWbaaSqabeaacqWGSbaBaaGccqGG8baFcqWGdbWqcqGGPaqkaSqaaiabdYgaSjabg2da9iabigdaXaqaaiabd6eaojabdAgaMjabdwgaLjabdggaHjabdsha0jabdwha1jabdkhaYjabdwgaLbqdcqGHpis1aaaa@4CEE@</m:annotation></m:semantics></m:math></p>
            <p>In this case, the posterior probability of class <it>k </it>is</p>
            <p>
               <m:math name="1471-2105-7-514-i9" xmlns:m="http://www.w3.org/1998/Math/MathML">
                  <m:semantics>
                     <m:mrow>
                        <m:mi>P</m:mi>
                        <m:mo stretchy="false">(</m:mo>
                        <m:msub>
                           <m:mi>C</m:mi>
                           <m:mi>k</m:mi>
                        </m:msub>
                        <m:mo stretchy="false">|</m:mo>
                        <m:mover accent="true">
                           <m:mi>X</m:mi>
                           <m:mo>&#175;</m:mo>
                        </m:mover>
                        <m:mo stretchy="false">)</m:mo>
                        <m:mo>=</m:mo>
                        <m:mfrac>
                           <m:mrow>
                              <m:mi>P</m:mi>
                              <m:mo stretchy="false">(</m:mo>
                              <m:mover accent="true">
                                 <m:mi>X</m:mi>
                                 <m:mo>&#175;</m:mo>
                              </m:mover>
                              <m:mo>|</m:mo>
                              <m:msub>
                                 <m:mi>C</m:mi>
                                 <m:mi>k</m:mi>
                              </m:msub>
                              <m:mo stretchy="false">)</m:mo>
                              <m:mi>P</m:mi>
                              <m:mo stretchy="false">(</m:mo>
                              <m:msub>
                                 <m:mi>C</m:mi>
                                 <m:mi>k</m:mi>
                              </m:msub>
                              <m:mo stretchy="false">)</m:mo>
                           </m:mrow>
                           <m:mrow>
                              <m:mi>P</m:mi>
                              <m:mo stretchy="false">(</m:mo>
                              <m:mover accent="true">
                                 <m:mi>X</m:mi>
                                 <m:mo>&#175;</m:mo>
                              </m:mover>
                              <m:mo stretchy="false">)</m:mo>
                           </m:mrow>
                        </m:mfrac>
                        <m:mo>&#8733;</m:mo>
                        <m:mstyle displaystyle="true">
                           <m:munderover>
                              <m:mo>&#8719;</m:mo>
                              <m:mrow>
                                 <m:mi>l</m:mi>
                                 <m:mo>=</m:mo>
                                 <m:mn>1</m:mn>
                              </m:mrow>
                              <m:mrow>
                                 <m:mi>N</m:mi>
                                 <m:mi>f</m:mi>
                                 <m:mi>e</m:mi>
                                 <m:mi>a</m:mi>
                                 <m:mi>t</m:mi>
                                 <m:mi>u</m:mi>
                                 <m:mi>r</m:mi>
                                 <m:mi>e</m:mi>
                              </m:mrow>
                           </m:munderover>
                           <m:mrow>
                              <m:mi>P</m:mi>
                              <m:mo stretchy="false">(</m:mo>
                              <m:msup>
                                 <m:mi>X</m:mi>
                                 <m:mi>l</m:mi>
                              </m:msup>
                              <m:mo>|</m:mo>
                              <m:msub>
                                 <m:mi>C</m:mi>
                                 <m:mi>k</m:mi>
                              </m:msub>
                              <m:mo stretchy="false">)</m:mo>
                              <m:mi>P</m:mi>
                              <m:mo stretchy="false">(</m:mo>
                              <m:msub>
                                 <m:mi>C</m:mi>
                                 <m:mi>k</m:mi>
                              </m:msub>
                              <m:mo stretchy="false">)</m:mo>
                           </m:mrow>
                        </m:mstyle>
                        <m:mo>.</m:mo>
                     </m:mrow>
                     <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacqWGqbaucqGGOaakcqWGdbWqdaWgaaWcbaGaem4AaSgabeaakiabcYha8Hqabiqb=HfayzaaraGaeiykaKIaeyypa0ZaaSaaaeaacqWGqbaucqGGOaakcuWFybawgaqeaiabcYha8jabdoeadnaaBaaaleaacqWGRbWAaeqaaOGaeiykaKIaemiuaaLaeiikaGIaem4qam0aaSbaaSqaaiabdUgaRbqabaGccqGGPaqkaeaacqWGqbaucqGGOaakcuWFybawgaqeaiabcMcaPaaacqGHDisTdaqeWbqaaiabdcfaqjabcIcaOiab=HfaynaaCaaaleqabaGaemiBaWgaaOGaeiiFaWNaem4qam0aaSbaaSqaaiabdUgaRbqabaGccqGGPaqkcqWGqbaucqGGOaakcqWGdbWqdaWgaaWcbaGaem4AaSgabeaakiabcMcaPaWcbaGaemiBaWMaeyypa0JaeGymaedabaGaemOta4KaemOzayMaemyzauMaemyyaeMaemiDaqNaemyDauNaemOCaiNaemyzauganiabg+GivdGccqGGUaGlaaa@6A08@</m:annotation>
                  </m:semantics>
               </m:math>
            </p>
         </sec>
         <sec>
            <st>
               <p>Results on simulated and real data</p>
            </st>
            <p>We present the results we obtained using both computationally generated datasets and a real TMA protein expression dataset.</p>
            <p>A first set of simulated data was generated to represent the best scenario, by incrementally varying the within sample variance <it>&#963;</it><sup>2 </sup>for one class only. A second set of normally distributed data was generated using a variety of parameter values (see <supplr sid="S2">Additional file 2</supplr>). The training and classification properties of the proposed algorithm were evaluated in both cases. Finally, we analyzed a set of real TMA protein expression data, in order to evaluate the potentials of the method for the analysis of real data.</p>
            <suppl id="S2">
               <title>
                  <p>Additional file 2</p>
               </title>
               <text>
                  <p>Parameter values used to generate simulated data.</p>
               </text>
               <file name="1471-2105-7-514-S2.doc">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <p>In both studies, being the Hierarchical Na&#239;ve Bayes (HierNB) classifier an extension of the StNB classifier to cope with replicate measurements, results are compared with the StNB classifier to highlight, without introducing additional bias due to the different classification techniques, the advantage of the new approach. However, the real data were analyzed by several other classification methods and the results are reported in the <supplr sid="S3">Additional file 3</supplr>.</p>
            <suppl id="S3">
               <title>
                  <p>Additional file 3</p>
               </title>
               <text>
                  <p>Comparison of the proposed approach with other classification strategies on the TMA protein expression dataset.</p>
               </text>
               <file name="1471-2105-7-514-S3.doc">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <p>The classifiers are compared on the basis of two indexes: the accuracy (defined as the ratio of the properly classified and the total classified cases) and the Brier score <abbrgrp><abbr bid="B20">20</abbr></abbrgrp>, measuring the difference between the probability of an event and its occurrence, expressed as 0 or 1 depending on if the event has occurred or not. Confidence intervals of accuracy are evaluated by repeating several times learning and classification after a suitable data randomization <abbrgrp><abbr bid="B21">21</abbr></abbrgrp>. Moreover, in the case of the real TMA dataset, we use sensitivity (defined as the ratio of the true positive classified cases and all positive cases), specificity (defined as the ratio of the true negative classified cases and all negative cases) and the area under the ROC curve <abbrgrp><abbr bid="B21">21</abbr></abbrgrp>.</p>
            <p>The data analysis reported in this paper has been implemented in the R statistical package <abbrgrp><abbr bid="B22">22</abbr></abbrgrp>.</p>
         </sec>
         <sec>
            <st>
               <p>Simulated data</p>
            </st>
            <sec>
               <st>
                  <p>Data description</p>
               </st>
               <p>We generated simulated datasets with 4000 patients (2000 for class 1 and 2000 for class 2), 5 replicates each and a number of independent features ranging from 1 to 10. For each feature, the values of the five replicates were randomly extracted from a normal distribution with fixed variance <it>&#963;</it><sup>2 </sup>(dependent on the class and on the feature) and mean randomly generated from a normal distribution with fixed mean M and variance <it>&#964;</it><sup>2</sup>, again dependent on the class and on the feature.</p>
               <p>A first set of experiments was run for the univariate case (one feature), so that the two classes basically had the same parameter values, but for the within sample variance parameter <it>&#963;</it><sup>2 </sup>that is assumed bigger in the second class. In particular they had a similar population mean M and exactly the same class variance <it>&#964;</it><sup>2 </sup>(M<sub>1 </sub>= 100, M<sub>2 </sub>= 105, <it>&#964;</it><sub>1</sub><sup>2 </sup>= <it>&#964;</it><sub>2</sub><sup>2 </sup>= 300, <it>&#963;</it><sub>1</sub><sup>2 </sup>= 10). The parameter <it>&#963;</it><sub>2</sub><sup>2 </sup>varied from 15 to 75.</p>
               <p>A second set of experiments was run simulating both univariate case studies and multivariate case studies (here we report results for two, three and ten features). The values of the first feature were generated for all these experiments using 90, 300 and 100 as M, <it>&#964;</it><sup>2 </sup>and <it>&#963;</it><sup>2 </sup>for class 1 and 140, 600 and 400 as M, <it>&#964;</it><sup>2 </sup>and <it>&#963;</it><sup>2 </sup>for class 2.</p>
               <p>The complete set of parameter values used for the model in the multivariate set of experiment is reported in the <supplr sid="S2">Additional file 2</supplr>.</p>
               <p>We assessed the performances of the two classifiers by equally dividing the dataset into training and test set.</p>
            </sec>
            <sec>
               <st>
                  <p>Data results</p>
               </st>
               <p>The results of the first experiment are shown in Tables <tblr tid="T1">1</tblr>. There is an enormous advantage (both in term of accuracy and Brier score) in the HierNB model, respect to the standard approach, where basically no distinction can be made between the two classes (the two classes are not well separated). The advantage of the HierNB increases as the difference between the variance of the replicates in the two classes increases. This first experiment shows that the proposed method is able to extend the current classification approaches by taking advantage from the information which can be derived from within sample heterogeneity.</p>
               <tbl id="T1">
                  <title>
                     <p>Table 1</p>
                  </title>
                  <caption>
                     <p>Results on simulated data for 1 feature with different level of within sample heterogeneity in the different classes.</p>
                  </caption>
                  <tblbdy cols="6">
                     <r>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c cspan="2" ca="center">
                           <p>HierNB Classifier</p>
                        </c>
                        <c cspan="2" ca="center">
                           <p>StNB Classifier</p>
                        </c>
                     </r>
                     <r>
                        <c cspan="6">
                           <hr/>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>Exp</p>
                        </c>
                        <c ca="center">
                           <p>
                              <it>&#963;</it>
                              <sub>2</sub>
                              <sup>2</sup>
                           </p>
                        </c>
                        <c ca="center">
                           <p>Acc</p>
                        </c>
                        <c ca="center">
                           <p>Brier</p>
                        </c>
                        <c ca="center">
                           <p>Acc</p>
                        </c>
                        <c ca="center">
                           <p>Brier</p>
                        </c>
                     </r>
                     <r>
                        <c cspan="6">
                           <hr/>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>1</p>
                        </c>
                        <c ca="center">
                           <p>15</p>
                        </c>
                        <c ca="center">
                           <p>0.620 [0.604 0.636]</p>
                        </c>
                        <c ca="center">
                           <p>0.449</p>
                        </c>
                        <c ca="center">
                           <p>0.559 [0.539 0.580]</p>
                        </c>
                        <c ca="center">
                           <p>0.490</p>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>2</p>
                        </c>
                        <c ca="center">
                           <p>30</p>
                        </c>
                        <c ca="center">
                           <p>0.762 [0.740 0.790]</p>
                        </c>
                        <c ca="center">
                           <p>0.307</p>
                        </c>
                        <c ca="center">
                           <p>0.560 [0.540 0.580]</p>
                        </c>
                        <c ca="center">
                           <p>0.490</p>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>3</p>
                        </c>
                        <c ca="center">
                           <p>45</p>
                        </c>
                        <c ca="center">
                           <p>0.830 [0.813 0.849]</p>
                        </c>
                        <c ca="center">
                           <p>0.228</p>
                        </c>
                        <c ca="center">
                           <p>0.554 [0.537 0.570]</p>
                        </c>
                        <c ca="center">
                           <p>0.490</p>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>4</p>
                        </c>
                        <c ca="center">
                           <p>60</p>
                        </c>
                        <c ca="center">
                           <p>0.878 [0.866 0.888]</p>
                        </c>
                        <c ca="center">
                           <p>0.176</p>
                        </c>
                        <c ca="center">
                           <p>0.556 [0.531 0.582]</p>
                        </c>
                        <c ca="center">
                           <p>0.490</p>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>5</p>
                        </c>
                        <c ca="center">
                           <p>75</p>
                        </c>
                        <c ca="center">
                           <p>0.899 [0.883 0.914]</p>
                        </c>
                        <c ca="center">
                           <p>0.147</p>
                        </c>
                        <c ca="center">
                           <p>0.560 [0.534 0.586]</p>
                        </c>
                        <c ca="center">
                           <p>0.490</p>
                        </c>
                     </r>
                  </tblbdy>
                  <tblfn>
                     <p>Exp = experiment number, Acc = Accuracy, Brier = Brier Score. In brackets the 95% confidence intervals for the estimate of the accuracy.</p>
                  </tblfn>
               </tbl>
               <p>Results for the second set of four experiments are presented in Table <tblr tid="T2">2</tblr>. For each experiment we report the number of features, the accuracy and Brier Score for both the HierNB and the StNB classifier.</p>
               <tbl id="T2">
                  <title>
                     <p>Table 2</p>
                  </title>
                  <caption>
                     <p>Results on simulated data: experiments were done using increasing number of features.</p>
                  </caption>
                  <tblbdy cols="6">
                     <r>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c cspan="2" ca="center">
                           <p>HierNB Classifier</p>
                        </c>
                        <c cspan="2" ca="center">
                           <p>StNB Classifier</p>
                        </c>
                     </r>
                     <r>
                        <c cspan="6">
                           <hr/>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>Exp</p>
                        </c>
                        <c ca="center">
                           <p>N Feat.</p>
                        </c>
                        <c ca="center">
                           <p>Acc</p>
                        </c>
                        <c ca="center">
                           <p>Brier</p>
                        </c>
                        <c ca="center">
                           <p>Acc</p>
                        </c>
                        <c ca="center">
                           <p>Brier</p>
                        </c>
                     </r>
                     <r>
                        <c cspan="6">
                           <hr/>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>1</p>
                        </c>
                        <c ca="center">
                           <p>1</p>
                        </c>
                        <c ca="center">
                           <p>0.925 [0.917, 0.933]</p>
                        </c>
                        <c ca="center">
                           <p>0.112</p>
                        </c>
                        <c ca="center">
                           <p>0.874 [0.864, 0.884]</p>
                        </c>
                        <c ca="center">
                           <p>0.184</p>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>2</p>
                        </c>
                        <c ca="center">
                           <p>2</p>
                        </c>
                        <c ca="center">
                           <p>0.966 [0.960, 0.971]</p>
                        </c>
                        <c ca="center">
                           <p>0.052</p>
                        </c>
                        <c ca="center">
                           <p>0.921 [0.912, 0.929]</p>
                        </c>
                        <c ca="center">
                           <p>0.118</p>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>3</p>
                        </c>
                        <c ca="center">
                           <p>3</p>
                        </c>
                        <c ca="center">
                           <p>0.987 [0.983, 0.990]</p>
                        </c>
                        <c ca="center">
                           <p>0.020</p>
                        </c>
                        <c ca="center">
                           <p>0.946 [0.938, 0.952]</p>
                        </c>
                        <c ca="center">
                           <p>0.082</p>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>4</p>
                        </c>
                        <c ca="center">
                           <p>10</p>
                        </c>
                        <c ca="center">
                           <p>0.998 [0.997, 0.999]</p>
                        </c>
                        <c ca="center">
                           <p>0.002</p>
                        </c>
                        <c ca="center">
                           <p>0.985 [0.981, 0.989]</p>
                        </c>
                        <c ca="center">
                           <p>0.023</p>
                        </c>
                     </r>
                  </tblbdy>
                  <tblfn>
                     <p>Exp = number of experiment, N. Feat = Number of features, Acc = Accuracy, Brier = Brier Score. In brackets the 95% confidence intervals for the estimate of the accuracy.</p>
                  </tblfn>
               </tbl>
               <p>The HierNB classifier performs better in all the experiments, showing higher accuracy and lower Brier score.</p>
               <p>In Figure <figr fid="F2">2</figr>, the posterior probabilities of both the classifiers evaluated on the test datasets of one experiment (Table <tblr tid="T2">2</tblr>, experiment #3) are shown.</p>
               <fig id="F2">
                  <title>
                     <p>Figure 2</p>
                  </title>
                  <caption>
                     <p>Posterior probabilities of the three feature simulated experiment</p>
                  </caption>
                  <text>
                     <p><b>Posterior probabilities of the three feature simulated experiment</b>. Histograms of posterior probabilities of the three feature experiment (Exp.3, Table 2) on a simulated dataset. Panels A and B show the results obtained with the HierNB classifier for class 1 and 2 respectively; panels C and D show results obtained with the StNB classifier. In the upper right corner of each panel the frequency of the bin corresponding to the highest posterior probability range is reported.</p>
                  </text>
                  <graphic file="1471-2105-7-514-2"/>
               </fig>
               <p>The HierNB classifier shows a better separation between the two classes not only in term of accuracy but also in term of credibility of the classification (as highlighted by the Brier Score). The confidence intervals of the estimated accuracy confirm that in all the experiments the proposed method outperforms the standard classifier. This second experiment shows that, even if the experimental context is complex since the classes show an overlap due to the values of the within and between sample variability, the method is able to perform equal or better than the StNB.</p>
            </sec>
         </sec>
         <sec>
            <st>
               <p>Real data</p>
            </st>
            <sec>
               <st>
                  <p>Data description</p>
               </st>
               <p>We used a research dataset obtained from a recently constructed prostate progression TMA, as previously described <abbrgrp><abbr bid="B23">23</abbr></abbrgrp>.</p>
               <p>The TMA was constructed to test molecular differences between localized and metastatic prostate cancer samples, on a total of 288 core biopsies. In this paper, we explore the expression of two proteins, i.e. EZH2 and AMACR known to be differentially expressed in non aggressive or localized tumors (class 1 or negative class) versus aggressive or metastatic prostate cancers (class 2 or positive class). The Polycomb Group protein, EZH2, is over-expressed in hormone-refractory and metastatic prostate cancer <abbrgrp><abbr bid="B24">24</abbr></abbrgrp> and may play a role in the progression of prostate cancer as well as serve as a marker distinguishing indolent prostate cancer from those at risk of lethal progression. <it>&#945;</it>-Methylacyl CoA racemase (AMACR) is a biomarker that was identified by both differential display and expression array analysis as a gene abundantly expressed in prostate cancer relative to benign prostate epithelium <abbrgrp><abbr bid="B25">25</abbr><abbr bid="B26">26</abbr><abbr bid="B27">27</abbr></abbrgrp>. AMACR is used as a clinical marker to diagnose prostate cancer <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>. Prostate cancers that produce lower levels of AMACR have a worse clinical outcome even after controlling for other clinical parameters <abbrgrp><abbr bid="B28">28</abbr></abbrgrp>.</p>
               <p>The TMA dataset includes 72 patients (samples), 36 for each class, each case having four replicates. After the processing of the TMA slides, 35 and 34 cases were suitable for analysis for class 1 and class 2 respectively (69 cases) and each case was characterized measuring from 1 to 4 times for two proteins. The assumption that the data have a Gaussian distribution given the class has been verified by applying the Kolmogorov-Smirnov test. Table <tblr tid="T3">3</tblr> describes the dataset through the model parameters <it>M </it>and <it>&#964;</it><sup>2 </sup>estimated according to the StNB and <it>&#963;</it><sup>2 </sup>estimated as the average within case variance for the two classes. Since the estimate of <it>&#963;</it><sup>2 </sup>is different in the two classes, this classification problem may benefit from the use of the HierNB approach.</p>
               <tbl id="T3">
                  <title>
                     <p>Table 3</p>
                  </title>
                  <caption>
                     <p>TMA data description: model parameters of localized (class 1) and metastatic prostate cancer tumors (class 2) for two proteins.</p>
                  </caption>
                  <tblbdy cols="7">
                     <r>
                        <c>
                           <p/>
                        </c>
                        <c cspan="2" ca="center">
                           <p>M</p>
                        </c>
                        <c cspan="2" ca="center">
                           <p>
                              <it>&#964;</it>
                              <sup>2</sup>
                           </p>
                        </c>
                        <c cspan="2" ca="center">
                           <p>
                              <it>&#963;</it>
                              <sup>2</sup>
                           </p>
                        </c>
                     </r>
                     <r>
                        <c cspan="7">
                           <hr/>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>Class</p>
                        </c>
                        <c ca="center">
                           <p>1</p>
                        </c>
                        <c ca="center">
                           <p>2</p>
                        </c>
                        <c ca="center">
                           <p>1</p>
                        </c>
                        <c ca="center">
                           <p>2</p>
                        </c>
                        <c ca="center">
                           <p>1</p>
                        </c>
                        <c ca="center">
                           <p>2</p>
                        </c>
                     </r>
                     <r>
                        <c cspan="7">
                           <hr/>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>AMACR</p>
                        </c>
                        <c ca="center">
                           <p>155.2</p>
                        </c>
                        <c ca="center">
                           <p>148.8</p>
                        </c>
                        <c ca="center">
                           <p>201.2</p>
                        </c>
                        <c ca="center">
                           <p>208.8</p>
                        </c>
                        <c ca="center">
                           <p>49.1</p>
                        </c>
                        <c ca="center">
                           <p>85.7</p>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>EZH2Int</p>
                        </c>
                        <c ca="center">
                           <p>146.2</p>
                        </c>
                        <c ca="center">
                           <p>141.6</p>
                        </c>
                        <c ca="center">
                           <p>86.7</p>
                        </c>
                        <c ca="center">
                           <p>107.9</p>
                        </c>
                        <c ca="center">
                           <p>135.7</p>
                        </c>
                        <c ca="center">
                           <p>53.9</p>
                        </c>
                     </r>
                  </tblbdy>
                  <tblfn>
                     <p>M = class mean; <it>&#964;</it><sup>2 </sup>= class variance; <it>&#963;</it><sup>2 </sup>= averaged within sample variance.</p>
                  </tblfn>
               </tbl>
               <p>We assessed the performances of the two classifiers by applying one hundred times a 10 fold cross-validation procedure with different fold randomization and then by computing the average results.</p>
            </sec>
            <sec>
               <st>
                  <p>Data results</p>
               </st>
               <p>Results obtained for the prostate cancer dataset are presented in Table <tblr tid="T4">4</tblr>. We report accuracy, specificity, sensitivity, area under the ROC curve (ROC curves are shown in the <supplr sid="S4">Additional file 4</supplr>) and Brier Score for the HierNB and for the StNB classifier. In Figure <figr fid="F3">3</figr>, we report histograms of posterior probabilities obtained by the two models.</p>
               <tbl id="T4">
                  <title>
                     <p>Table 4</p>
                  </title>
                  <caption>
                     <p>Results on TMA dataset for the two proteins.</p>
                  </caption>
                  <tblbdy cols="6">
                     <r>
                        <c>
                           <p/>
                        </c>
                        <c ca="center">
                           <p>
                              <b>Acc</b>
                           </p>
                        </c>
                        <c ca="center">
                           <p>
                              <b>Spec</b>
                           </p>
                        </c>
                        <c ca="center">
                           <p>
                              <b>Sens</b>
                           </p>
                        </c>
                        <c ca="center">
                           <p>
                              <b>AUC</b>
                           </p>
                        </c>
                        <c ca="center">
                           <p>
                              <b>Brier</b>
                           </p>
                        </c>
                     </r>
                     <r>
                        <c cspan="6">
                           <hr/>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>
                              <b>HierNB Model</b>
                           </p>
                        </c>
                        <c ca="center">
                           <p>0.65 [0.62&#8211;0.68]</p>
                        </c>
                        <c ca="center">
                           <p>0.71 [0.66&#8211;0.74]</p>
                        </c>
                        <c ca="center">
                           <p>0.60 [0.56&#8211;0.62]</p>
                        </c>
                        <c ca="center">
                           <p>0.69 [0.689&#8211;0.693]</p>
                        </c>
                        <c ca="center">
                           <p>0.41 [0.39&#8211;0.42]</p>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>
                              <b>StNB Model</b>
                           </p>
                        </c>
                        <c ca="center">
                           <p>0.58 [0.54&#8211;0.61]</p>
                        </c>
                        <c ca="center">
                           <p>0.58 [0.54&#8211;0.62]</p>
                        </c>
                        <c ca="center">
                           <p>0.57 [0.53&#8211;0.61]</p>
                        </c>
                        <c ca="center">
                           <p>0.62 [0.617&#8211;0.622]</p>
                        </c>
                        <c ca="center">
                           <p>0.47 [0.46&#8211;0.48]</p>
                        </c>
                     </r>
                  </tblbdy>
                  <tblfn>
                     <p>Acc = Accuracy, Spec = specificity, Sens = Sensitivity, Brier = Brier Score. In brackets the 95% confidence intervals for the estimates, AUC = area under the ROC.</p>
                  </tblfn>
               </tbl>
               <suppl id="S4">
                  <title>
                     <p>Additional file 4</p>
                  </title>
                  <text>
                     <p>ROC Curves for the TMA protein expression dataset as calculated by running 100 times 10-fold cross validation (Table <tblr tid="T4">4</tblr> in the paper).</p>
                  </text>
                  <file name="1471-2105-7-514-S4.doc">
                     <p>Click here for file</p>
                  </file>
               </suppl>
               <fig id="F3">
                  <title>
                     <p>Figure 3</p>
                  </title>
                  <caption>
                     <p>Posterior probabilities of prostate cancer cases</p>
                  </caption>
                  <text>
                     <p><b>Posterior probabilities of prostate cancer cases</b>. Histograms of the posterior probabilities of prostate cancer cases. Panels A and B show the results evaluated with the HierNB classifier for class 1 (localized tumors) and 2 (aggressive tumors) respectively; panels C and D show results evaluated with the StNB classifier.</p>
                  </text>
                  <graphic file="1471-2105-7-514-3"/>
               </fig>
               <p>The classification performance of the HierNB model clearly outperforms the StNB one, for what concerns all the evaluation parameters considered. In particular, both accuracy and the Brier score are significantly better in the HierNB case than in the StNB one, also considering the 95% confidence interval of the estimates. The fact that classification accuracy in distinguishing localized prostate cancer from metastases is only about 60% is not surprising. The complexity of this classification problem has been recently discussed in Bismar, Demichelis et al. <abbrgrp><abbr bid="B23">23</abbr></abbrgrp>.</p>
               <p>The HierNB classifier also shows a significantly higher specificity, and a similar sensitivity. We note from the histogram of posterior probabilities (Figure <figr fid="F3">3</figr>) that the HierNB method has better performances for metastatic cancer (panel B) than the StNB approach (which shows uncertain classification for many patients, panel D).</p>
            </sec>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Discussion</p>
         </st>
         <p>Few studies have dealt with the problem of uncertain data in classification <abbrgrp><abbr bid="B29">29</abbr><abbr bid="B30">30</abbr></abbrgrp>. In the bioinformatics arena, several recent studies addressed the topic of uncertain data, in particular on DNA microarrays data. However, their main emphasis is related to the management of uncertainty when applying feature selection strategies <abbrgrp><abbr bid="B16">16</abbr><abbr bid="B31">31</abbr></abbrgrp>. Another example of handling uncertainty in classification is provided by Bhattacharyya et al., where they characterize each data point with an uncertainty model based on ellipsoids <abbrgrp><abbr bid="B32">32</abbr></abbrgrp>.</p>
         <p>In this paper we have proposed a classifier based on Bayesian hierarchical models and have applied it on TMA datasets. The approach permits embedding in the classification model the tumor variability (heterogeneity of protein levels across tumor tissue), using the tuple of protein level measurements of each case instead of unique representative value as done by conventional approaches.</p>
         <p>Bayesian hierarchical models have two main advantages with respect to other methods: i) they coherently manage uncertainty in the framework of probability theory; ii) they make explicit the assumptions which the model relies on. The implementation of the Bayesian classifier presented in this paper is an extension of the well known Na&#239;ve Bayes classifier. It assumes that all the attributes are independent among each other given the class. Moreover, we have also assumed that the probability distributions are conditionally Gaussian.</p>
         <p>Preliminary performance tests on simulated data give us some clues about the applicability of the proposed model. With respect to classification, we observed that when classes have similar within sample variances no differences in terms of classification accuracy are obtained, as expected. However, increasing differences in the posterior distributions are detected as the difference of the within sample variability increases, e.g. <it>&#963;</it><sub><it>1</it></sub><sup><it>2</it></sup>&lt;&lt;<it>&#963;</it><sub><it>2</it></sub><sup>2</sup>. In this case the HierNB model outperforms the standard approach.</p>
         <p>On TMA real data, we saw that the hierarchical model may improve specificity, which is part of the clinical question, and emphasizes the information available at every level, accounting for the spread of the replicate measures and thus may provide interesting insights into the biology of the tumor samples being analyzed. Rather interestingly, in this case the classification model is able to improve the data comprehension, highlighting if the heterogeneity of the tumor tissue sample is critical or not in the decision making process. Moreover, hierarchical models are also able to exploit the information on the lack of heterogeneity.</p>
         <p>Heterogeneous and homogeneous protein expression may reflect different biological processes occurring in tumors. Exploiting this data may be critical in understanding the underlying biology.</p>
         <p>Finally, the HierNB presents interesting robustness properties when comparing the results obtained in the data-rich case of the simulation study (4000 samples) with the relatively data-poor real one (69 samples). The real case is much more difficult than the simulated one, due to the lower number of samples and the smaller difference of the mean values of the markers in the two classes. Such a difficulty results in more spread posterior distributions and in lower accuracy and higher Brier score values of all tested classification models. However, in both the simulated and the real cases the HierNB shows nearly the same gain in accuracy with respect to the StNB, taking advantage from the within sample variability information to better separate classes.</p>
         <p>From a practical point of view, in TMA experiments in which hundreds of cases are evaluated and only a fraction do not fit well into one class or another, one can imagine that by using the hierarchical Na&#239;ve Bayes model, cases with a posterior probability within a certain window around 0.5 would be classified as ambiguous and would require re-review.</p>
         <p>From a methodological point of view, in order to generalize the proposed approach, we are now working on the following aspects:</p>
         <p>1) Learning: while the classification step fully follows the Bayesian approach, the learning phase of the proposed method is not fully Bayesian. This choice was motivated by the need to perform fast learning from potentially large datasets for the needed probability distributions. However, it is also possible to resort to a more rigorous learning procedure by paying the price of implementing iterative procedures, such as Expectation Maximization (EM) or Monte Carlo Markov chain (MCMC) approaches <abbrgrp><abbr bid="B33">33</abbr></abbrgrp>. Future versions of our tool will include also such kind of estimation algorithms <abbrgrp><abbr bid="B34">34</abbr></abbrgrp>.</p>
         <p>2) Non Gaussian distributions: we have also implemented a version of the hierarchical Na&#239;ve Bayes approach for discrete variables, relying on multinomial and Dirichlet probability distributions <abbrgrp><abbr bid="B34">34</abbr><abbr bid="B35">35</abbr></abbrgrp>. This extension allows managing arbitrary data distributions after proper discretization.</p>
      </sec>
      <sec>
         <st>
            <p>Conclusion</p>
         </st>
         <p>We have proposed a novel approach for dealing with uncertain data in classification, with applications to TMA microarrays. The proposed model has, as its unique property, the capability of handling data heterogeneity in a sound probabilistic way, without requiring additional computational burden with respect to the standard Na&#239;ve Bayes approach. Based on the results obtained on simulated and real data, we can conclude that the proposed approach is particularly useful when the within sample heterogeneity differs between classes. Its application to TMA data has been shown to provide more insight into the information available in the database and to improve the decision making process also in presence of a very limited number of features. The proposed model can be conveniently applied and extended to deal with other application domains in Bioinformatics.</p>
      </sec>
      <sec>
         <st>
            <p>Methods</p>
         </st>
         <sec>
            <st>
               <p>Tissue microarray technology</p>
            </st>
            <p>TMAs were recently developed to facilitate tissue-based research <abbrgrp><abbr bid="B36">36</abbr></abbrgrp>. TMAs can be used for any type of study where standard tissue slides had previously been used. However, they present numerous advantages <abbrgrp><abbr bid="B37">37</abbr></abbrgrp>. TMAs allow for screening of a large number of tissue samples under similar experimental conditions (large scale process) while conserving tissue and resources. Typically a TMA block contains up to 600 tissue biopsies, depending on the needle diameter used to transfer the samples (from 0.6 to 2 mm). TMA sections are serially obtained at 4&#8211;5 micrometers thickness with a microtome. TMA sections are then processed as conventional histological tissue sections.</p>
            <p>Cylindrical tissue biopsies are transferred with a biopsy needle from carefully selected morphologically representative areas from original paraffin blocks (donor blocks), each containing tumor tissue from a patient. Core tissue biopsies are then arrayed into a new "recipient" paraffin block by using a tissue arrayer using a precise spacing pattern along x and y axis, which generates a regular matrix of cores. Typically, more than one biopsy from each patient is included in a TMA block; replicates allow for good representation of the patient's tumor and to potentially detect heterogeneous expression of markers (e.g. proteins) of interest within the tumor. How well TMA samples represent entire tumors has been the focus of several recent studies <abbrgrp><abbr bid="B38">38</abbr></abbrgrp>. The results of those studies are dependant on tumor types and study purposes. A biomarker with homogenous expression throughout the entire tumor will not require as many replicates as a biomarker that is only focally expressed by the target tissue.</p>
         </sec>
         <sec>
            <st>
               <p>Learning the Hierarchical Na&#239;ve Bayes Model</p>
            </st>
            <p>The classification algorithm described in the Results section assumes that the model parameters (<it>M</it>, <it>&#964;</it><sup><it>2</it></sup>, <it>&#963;</it><sup><it>2</it></sup>) have been estimated from a training dataset. In the implementation of the method presented in this paper, we have adopted an approximation of the maximum likelihood estimation approach, called empirical learning <abbrgrp><abbr bid="B39">39</abbr></abbrgrp>.</p>
            <p>Following such approach, the within sample means and variances are estimated as:</p>
            <p><m:math name="1471-2105-7-514-i10" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msub><m:mover accent="true"><m:mi>&#956;</m:mi><m:mo>^</m:mo></m:mover><m:mi>i</m:mi></m:msub><m:mo>=</m:mo><m:mfrac><m:mrow><m:mstyle displaystyle="true"><m:munderover><m:mo>&#8721;</m:mo><m:mi>j</m:mi><m:mrow><m:msub><m:mi>n</m:mi><m:mrow><m:mi>r</m:mi><m:mi>e</m:mi><m:mi>p</m:mi></m:mrow></m:msub></m:mrow></m:munderover><m:mrow><m:msub><m:mi>x</m:mi><m:mrow><m:mi>i</m:mi><m:mi>j</m:mi></m:mrow></m:msub></m:mrow></m:mstyle></m:mrow><m:mrow><m:msub><m:mi>n</m:mi><m:mrow><m:mi>r</m:mi><m:mi>e</m:mi><m:msub><m:mi>p</m:mi><m:mi>i</m:mi></m:msub></m:mrow></m:msub></m:mrow></m:mfrac></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaaiiGacuWF8oqBgaqcamaaBaaaleaacqWGPbqAaeqaaOGaeyypa0ZaaSaaaeaadaaeWbqaaiabdIha4naaBaaaleaacqWGPbqAcqWGQbGAaeqaaaqaaiabdQgaQbqaaiabd6gaUnaaBaaameaacqWGYbGCcqWGLbqzcqWGWbaCaeqaaaqdcqGHris5aaGcbaGaemOBa42aaSbaaSqaaiabdkhaYjabdwgaLjabdchaWnaaBaaameaacqWGPbqAaeqaaaWcbeaaaaaaaa@4623@</m:annotation></m:semantics></m:math> and <m:math name="1471-2105-7-514-i11" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msup><m:mover accent="true"><m:mi>&#963;</m:mi><m:mo>^</m:mo></m:mover><m:mn>2</m:mn></m:msup><m:mo>=</m:mo><m:mstyle displaystyle="true"><m:munderover><m:mo>&#8721;</m:mo><m:mrow><m:mi>i</m:mi><m:mo>=</m:mo><m:mn>1</m:mn></m:mrow><m:mrow><m:msub><m:mi>N</m:mi><m:mrow><m:msub><m:mi>C</m:mi><m:mi>k</m:mi></m:msub></m:mrow></m:msub></m:mrow></m:munderover><m:mrow><m:mfrac><m:mrow><m:mstyle displaystyle="true"><m:munderover><m:mo>&#8721;</m:mo><m:mi>j</m:mi><m:mrow><m:msub><m:mi>n</m:mi><m:mrow><m:mi>r</m:mi><m:mi>e</m:mi><m:mi>p</m:mi></m:mrow></m:msub></m:mrow></m:munderover><m:mrow><m:msup><m:mrow><m:mo stretchy="false">(</m:mo><m:msub><m:mi>x</m:mi><m:mrow><m:mi>i</m:mi><m:mi>j</m:mi></m:mrow></m:msub><m:mo>&#8722;</m:mo><m:msub><m:mover accent="true"><m:mi>&#956;</m:mi><m:mo>^</m:mo></m:mover><m:mi>i</m:mi></m:msub><m:mo stretchy="false">)</m:mo></m:mrow><m:mn>2</m:mn></m:msup></m:mrow></m:mstyle></m:mrow><m:mrow><m:msub><m:mi>N</m:mi><m:mrow><m:msub><m:mi>C</m:mi><m:mi>k</m:mi></m:msub></m:mrow></m:msub><m:msub><m:mi>n</m:mi><m:mrow><m:mi>r</m:mi><m:mi>e</m:mi><m:msub><m:mi>p</m:mi><m:mi>i</m:mi></m:msub></m:mrow></m:msub></m:mrow></m:mfrac></m:mrow></m:mstyle></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaaiiGacuWFdpWCgaqcamaaCaaaleqabaGaeGOmaidaaOGaeyypa0ZaaabCaeaadaWcaaqaamaaqahabaGaeiikaGIaemiEaG3aaSbaaSqaaiabdMgaPjabdQgaQbqabaGccqGHsislcuWF8oqBgaqcamaaBaaaleaacqWGPbqAaeqaaOGaeiykaKYaaWbaaSqabeaacqaIYaGmaaaabaGaemOAaOgabaGaemOBa42aaSbaaWqaaiabdkhaYjabdwgaLjabdchaWbqabaaaniabggHiLdaakeaacqWGobGtdaWgaaWcbaGaem4qam0aaSbaaWqaaiabdUgaRbqabaaaleqaaOGaemOBa42aaSbaaSqaaiabdkhaYjabdwgaLjabdchaWnaaBaaameaacqWGPbqAaeqaaaWcbeaaaaaabaGaemyAaKMaeyypa0JaeGymaedabaGaemOta40aaSbaaWqaaiabdoeadnaaBaaabaGaem4AaSgabeaaaeqaaaqdcqGHris5aaaa@5A4C@</m:annotation></m:semantics></m:math>, while the population mean and variance as:</p>
            <p><m:math name="1471-2105-7-514-i12" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msub><m:mover accent="true"><m:mi>M</m:mi><m:mo>^</m:mo></m:mover><m:mrow><m:mi>p</m:mi><m:mi>o</m:mi><m:mi>o</m:mi><m:mi>l</m:mi></m:mrow></m:msub><m:mo>=</m:mo><m:mfrac><m:mrow><m:mstyle displaystyle="true"><m:munderover><m:mo>&#8721;</m:mo><m:mi>i</m:mi><m:mrow><m:msub><m:mi>N</m:mi><m:mrow><m:msub><m:mi>C</m:mi><m:mi>k</m:mi></m:msub></m:mrow></m:msub></m:mrow></m:munderover><m:mrow><m:mfrac><m:mrow><m:msub><m:mover accent="true"><m:mi>&#956;</m:mi><m:mo>^</m:mo></m:mover><m:mi>i</m:mi></m:msub></m:mrow><m:mrow><m:msubsup><m:mover accent="true"><m:mi>&#963;</m:mi><m:mo>^</m:mo></m:mover><m:mi>i</m:mi><m:mn>2</m:mn></m:msubsup></m:mrow></m:mfrac></m:mrow></m:mstyle></m:mrow><m:mrow><m:mstyle displaystyle="true"><m:munder><m:mo>&#8721;</m:mo><m:mi>i</m:mi></m:munder><m:mrow><m:mfrac><m:mn>1</m:mn><m:mrow><m:msubsup><m:mover accent="true"><m:mi>&#963;</m:mi><m:mo>^</m:mo></m:mover><m:mi>i</m:mi><m:mn>2</m:mn></m:msubsup></m:mrow></m:mfrac></m:mrow></m:mstyle></m:mrow></m:mfrac></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuWGnbqtgaqcamaaBaaaleaacqWGWbaCcqWGVbWBcqWGVbWBcqWGSbaBaeqaaOGaeyypa0ZaaSaaaeaadaaeWbqaamaalaaabaacciGaf8hVd0MbaKaadaWgaaWcbaGaemyAaKgabeaaaOqaaiqb=n8aZzaajaWaa0baaSqaaiabdMgaPbqaaiabikdaYaaaaaaabaGaemyAaKgabaGaemOta40aaSbaaWqaaiabdoeadnaaBaaabaGaem4AaSgabeaaaeqaaaqdcqGHris5aaGcbaWaaabuaeaadaWcaaqaaiabigdaXaqaaiqb=n8aZzaajaWaa0baaSqaaiabdMgaPbqaaiabikdaYaaaaaaabaGaemyAaKgabeqdcqGHris5aaaaaaa@4CB1@</m:annotation></m:semantics></m:math> and <m:math name="1471-2105-7-514-i13" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mover accent="true"><m:mi>&#964;</m:mi><m:mo>^</m:mo></m:mover><m:mrow><m:mi>c</m:mi><m:mi>o</m:mi><m:mi>r</m:mi><m:mi>r</m:mi></m:mrow><m:mn>2</m:mn></m:msubsup><m:mo>=</m:mo><m:mfrac><m:mrow><m:mstyle displaystyle="true"><m:munderover><m:mo>&#8721;</m:mo><m:mi>i</m:mi><m:mrow><m:msub><m:mi>N</m:mi><m:mrow><m:msub><m:mi>C</m:mi><m:mi>k</m:mi></m:msub></m:mrow></m:msub></m:mrow></m:munderover><m:mrow><m:msup><m:mrow><m:mo stretchy="false">(</m:mo><m:msub><m:mover accent="true"><m:mi>&#956;</m:mi><m:mo>^</m:mo></m:mover><m:mi>i</m:mi></m:msub><m:mo>&#8722;</m:mo><m:msub><m:mover accent="true"><m:mi>M</m:mi><m:mo>^</m:mo></m:mover><m:mrow><m:mi>p</m:mi><m:mi>o</m:mi><m:mi>o</m:mi><m:mi>l</m:mi></m:mrow></m:msub><m:mo stretchy="false">)</m:mo></m:mrow><m:mn>2</m:mn></m:msup></m:mrow></m:mstyle></m:mrow><m:mrow><m:msub><m:mi>N</m:mi><m:mrow><m:msub><m:mi>C</m:mi><m:mi>k</m:mi></m:msub></m:mrow></m:msub></m:mrow></m:mfrac><m:mo>&#8722;</m:mo><m:mfrac><m:mrow><m:mstyle displaystyle="true"><m:munderover><m:mo>&#8721;</m:mo><m:mi>i</m:mi><m:mrow><m:msub><m:mi>N</m:mi><m:mrow><m:msub><m:mi>C</m:mi><m:mi>k</m:mi></m:msub></m:mrow></m:msub></m:mrow></m:munderover><m:mrow><m:mstyle displaystyle="true"><m:munderover><m:mo>&#8721;</m:mo><m:mi>j</m:mi><m:mrow><m:msub><m:mi>n</m:mi><m:mrow><m:mi>r</m:mi><m:mi>e</m:mi><m:mi>p</m:mi></m:mrow></m:msub></m:mrow></m:munderover><m:mrow><m:msup><m:mrow><m:mo stretchy="false">(</m:mo><m:msub><m:mi>x</m:mi><m:mrow><m:mi>i</m:mi><m:mi>j</m:mi></m:mrow></m:msub><m:mo>&#8722;</m:mo><m:msub><m:mover accent="true"><m:mi>&#956;</m:mi><m:mo>^</m:mo></m:mover><m:mi>i</m:mi></m:msub><m:mo stretchy="false">)</m:mo></m:mrow><m:mn>2</m:mn></m:msup></m:mrow></m:mstyle></m:mrow></m:mstyle></m:mrow><m:mrow><m:msub><m:mi>N</m:mi><m:mrow><m:msub><m:mi>C</m:mi><m:mi>k</m:mi></m:msub></m:mrow></m:msub><m:msubsup><m:mi>n</m:mi><m:mrow><m:mi>r</m:mi><m:mi>e</m:mi><m:msub><m:mi>p</m:mi><m:mi>i</m:mi></m:msub></m:mrow><m:mn>2</m:mn></m:msubsup></m:mrow></m:mfrac></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaaiiGacuWFepaDgaqcamaaDaaaleaacqWGJbWycqWGVbWBcqWGYbGCcqWGYbGCaeaacqaIYaGmaaGccqGH9aqpdaWcaaqaamaaqahabaGaeiikaGIaf8hVd0MbaKaadaWgaaWcbaGaemyAaKgabeaakiabgkHiTiqbd2eanzaajaWaaSbaaSqaaiabdchaWjabd+gaVjabd+gaVjabdYgaSbqabaGccqGGPaqkdaahaaWcbeqaaiabikdaYaaaaeaacqWGPbqAaeaacqWGobGtdaWgaaadbaGaem4qam0aaSbaaeaacqWGRbWAaeqaaaqabaaaniabggHiLdaakeaacqWGobGtdaWgaaWcbaGaem4qam0aaSbaaWqaaiabdUgaRbqabaaaleqaaaaakiabgkHiTmaalaaabaWaaabCaeaadaaeWbqaaiabcIcaOiabdIha4naaBaaaleaacqWGPbqAcqWGQbGAaeqaaOGaeyOeI0Iaf8hVd0MbaKaadaWgaaWcbaGaemyAaKgabeaakiabcMcaPmaaCaaaleqabaGaeGOmaidaaaqaaiabdQgaQbqaaiabd6gaUnaaBaaameaacqWGYbGCcqWGLbqzcqWGWbaCaeqaaaqdcqGHris5aaWcbaGaemyAaKgabaGaemOta40aaSbaaWqaaiabdoeadnaaBaaabaGaem4AaSgabeaaaeqaaaqdcqGHris5aaGcbaGaemOta40aaSbaaSqaaiabdoeadnaaBaaameaacqWGRbWAaeqaaaWcbeaakiabd6gaUnaaDaaaleaacqWGYbGCcqWGLbqzcqWGWbaCdaWgaaadbaGaemyAaKgabeaaaSqaaiabikdaYaaaaaaaaa@7972@</m:annotation></m:semantics></m:math> where <m:math name="1471-2105-7-514-i14" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mover accent="true"><m:mi>&#963;</m:mi><m:mo>^</m:mo></m:mover><m:mi>i</m:mi><m:mn>2</m:mn></m:msubsup><m:mo>=</m:mo><m:mfrac><m:mrow><m:msup><m:mover accent="true"><m:mi>&#963;</m:mi><m:mo>^</m:mo></m:mover><m:mn>2</m:mn></m:msup></m:mrow><m:mrow><m:msub><m:mi>n</m:mi><m:mrow><m:mi>r</m:mi><m:mi>e</m:mi><m:msub><m:mi>p</m:mi><m:mi>i</m:mi></m:msub></m:mrow></m:msub></m:mrow></m:mfrac></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaaiiGacuWFdpWCgaqcamaaDaaaleaacqWGPbqAaeaacqaIYaGmaaGccqGH9aqpdaWcaaqaaiqb=n8aZzaajaWaaWbaaSqabeaacqaIYaGmaaaakeaacqWGUbGBdaWgaaWcbaGaemOCaiNaemyzauMaemiCaa3aaSbaaWqaaiabdMgaPbqabaaaleqaaaaaaaa@3C64@</m:annotation></m:semantics></m:math>.</p>
            <p>The estimate of the population variance includes two terms, representing <it>between </it>samples and <it>within </it>sample variances (expressing the inter-subject variability and the intra-subject variability, respectively). The estimate of <it>&#964; </it>is particularly critical: it is valid as far as the within sample variability is less than the between sample one, i.e. under the assumption that the measurements heterogeneity is not too large <abbrgrp><abbr bid="B4">4</abbr></abbrgrp>. In case such assumption does not hold, other learning strategies can be conveniently applied, such as the EM estimate. We note that empirical learning allows the exploitation of the HierNB approach without additional computational burden with respect to the standard (non hierarchical) approach.</p>
         </sec>
         <sec>
            <st>
               <p>Classification by standard Na&#239;ve Bayes Model</p>
            </st>
            <p>Also in this case the classification of a new case requires the evaluation of the posterior probability of each class given the case data. However, the standard Na&#239;ve Bayes classifier does not consider the replicate measurements, but only their aggregate value (standard pooling strategy, e.g. mean value). Let X be the mean value of a given feature (univariate case) of the case to classify. The likelihood of X, X &#8712; R, is <m:math name="1471-2105-7-514-i15" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:mi>P</m:mi><m:mo stretchy="false">(</m:mo><m:mi>X</m:mi><m:mo>|</m:mo><m:mi>C</m:mi><m:mo stretchy="false">)</m:mo><m:mo>=</m:mo><m:mfrac><m:mn>1</m:mn><m:mrow><m:msqrt><m:mrow><m:mn>2</m:mn><m:mi>&#960;</m:mi></m:mrow></m:msqrt><m:mi>&#964;</m:mi></m:mrow></m:mfrac><m:mi>exp</m:mi><m:mo>&#8289;</m:mo><m:mo stretchy="false">(</m:mo><m:mo>&#8722;</m:mo><m:mfrac><m:mn>1</m:mn><m:mrow><m:mn>2</m:mn><m:msup><m:mi>&#964;</m:mi><m:mn>2</m:mn></m:msup></m:mrow></m:mfrac><m:msup><m:mrow><m:mo stretchy="false">(</m:mo><m:mi>X</m:mi><m:mo>&#8722;</m:mo><m:mi>M</m:mi><m:mo stretchy="false">)</m:mo></m:mrow><m:mn>2</m:mn></m:msup><m:mo stretchy="false">)</m:mo></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacqWGqbaucqGGOaakcqWGybawcqGG8baFcqWGdbWqcqGGPaqkcqGH9aqpdaWcaaqaaiabigdaXaqaamaakaaabaGaeGOmaidcciGae8hWdahaleqaaOGae8hXdqhaaiGbcwgaLjabcIha4jabcchaWjabcIcaOiabgkHiTmaalaaabaGaeGymaedabaGaeGOmaiJae8hXdq3aaWbaaSqabeaacqaIYaGmaaaaaOGaeiikaGIaemiwaGLaeyOeI0Iaemyta0KaeiykaKYaaWbaaSqabeaacqaIYaGmaaGccqGGPaqkaaa@4BC2@</m:annotation></m:semantics></m:math>, being <it>&#964; </it>and M, the variance and the mean of the distribution of the feature values in the class <it>C</it>. By Bayes' theorem, the posterior probability of each class can be easily evaluated and the new instance classified into the class with maximal posterior probability. The generalization for the multivariate case can be obtained by exploiting the assumption of conditional independence of the features given the class as discussed in the Background section.</p>
         </sec>
         <sec>
            <st>
               <p>Learning of standard Na&#239;ve Bayes Model</p>
            </st>
            <p>Also in this case, the model parameters (M, <it>&#964;</it><sup>2</sup>) have to be estimated from a training dataset. They can be computed as:</p>
            <p><m:math name="1471-2105-7-514-i16" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:mover accent="true"><m:mi>M</m:mi><m:mo>^</m:mo></m:mover><m:mo>=</m:mo><m:mfrac><m:mrow><m:mstyle displaystyle="true"><m:munderover><m:mo>&#8721;</m:mo><m:mi>i</m:mi><m:mrow><m:msub><m:mi>N</m:mi><m:mrow><m:msub><m:mi>C</m:mi><m:mi>k</m:mi></m:msub></m:mrow></m:msub></m:mrow></m:munderover><m:mrow><m:msub><m:mover accent="true"><m:mi>&#956;</m:mi><m:mo>^</m:mo></m:mover><m:mi>i</m:mi></m:msub></m:mrow></m:mstyle></m:mrow><m:mrow><m:msub><m:mi>N</m:mi><m:mrow><m:msub><m:mi>C</m:mi><m:mi>k</m:mi></m:msub></m:mrow></m:msub></m:mrow></m:mfrac></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuWGnbqtgaqcaiabg2da9maalaaabaWaaabCaeaaiiGacuWF8oqBgaqcamaaBaaaleaacqWGPbqAaeqaaaqaaiabdMgaPbqaaiabd6eaonaaBaaameaacqWGdbWqdaWgaaqaaiabdUgaRbqabaaabeaaa0GaeyyeIuoaaOqaaiabd6eaonaaBaaaleaacqWGdbWqdaWgaaadbaGaem4AaSgabeaaaSqabaaaaaaa@3DBD@</m:annotation></m:semantics></m:math> and <m:math name="1471-2105-7-514-i17" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msup><m:mover accent="true"><m:mi>&#964;</m:mi><m:mo>^</m:mo></m:mover><m:mn>2</m:mn></m:msup><m:mo>=</m:mo><m:mfrac><m:mrow><m:mstyle displaystyle="true"><m:munderover><m:mo>&#8721;</m:mo><m:mi>i</m:mi><m:mrow><m:msub><m:mi>N</m:mi><m:mrow><m:msub><m:mi>C</m:mi><m:mi>k</m:mi></m:msub></m:mrow></m:msub></m:mrow></m:munderover><m:mrow><m:msup><m:mrow><m:mo stretchy="false">(</m:mo><m:msub><m:mover accent="true"><m:mi>&#956;</m:mi><m:mo>^</m:mo></m:mover><m:mi>i</m:mi></m:msub><m:mo>&#8722;</m:mo><m:mover accent="true"><m:mi>M</m:mi><m:mo>^</m:mo></m:mover><m:mo stretchy="false">)</m:mo></m:mrow><m:mn>2</m:mn></m:msup></m:mrow></m:mstyle></m:mrow><m:mrow><m:msub><m:mi>N</m:mi><m:mrow><m:msub><m:mi>C</m:mi><m:mi>k</m:mi></m:msub></m:mrow></m:msub></m:mrow></m:mfrac></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaaiiGacuWFepaDgaqcamaaCaaaleqabaGaeGOmaidaaOGaeyypa0ZaaSaaaeaadaaeWbqaaiabcIcaOiqb=X7aTzaajaWaaSbaaSqaaiabdMgaPbqabaGccqGHsislcuWGnbqtgaqcaiabcMcaPmaaCaaaleqabaGaeGOmaidaaaqaaiabdMgaPbqaaiabd6eaonaaBaaameaacqWGdbWqdaWgaaqaaiabdUgaRbqabaaabeaaa0GaeyyeIuoaaOqaaiabd6eaonaaBaaaleaacqWGdbWqdaWgaaadbaGaem4AaSgabeaaaSqabaaaaaaa@447E@</m:annotation></m:semantics></m:math>.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Authors' contributions</p>
         </st>
         <p>FD, PM and RB developed and implemented the Hierarchical Na&#239;ve Bayes Model presented in the paper. MAR was involved in the discussions about the suitability of the model to face protein expression heterogeneity in human tumors and generated the TMA protein expression datasets. PP helped in the evaluation of the model performances. All authors read and approved the final manuscript.</p>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>The authors would like to thank Robert Kim for his technical expertise in performing the TMA experiments, Juan- Miguel Mosquera and Kirsten D. Mertz for their pathology evaluation of the tissue microarray experiments and Andrea Sboner and Rossana Dell'Anna for critical discussion on the paper. RB and PM acknowledge the FIRB project "Learning theory and engineering applications", funded by the Italian Ministry of University and scientific research. FD was supported by a Prostate Cancer Foundation award.</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>Prospective evaluation of AMACR (P504S) and basal cell markers in the assessment of routine prostate needle biopsy specimens</p>
            </title>
            <aug>
               <au>
                  <snm>Browne</snm>
                  <fnm>TJ</fnm>
               </au>
               <au>
                  <snm>Hirsch</snm>
                  <fnm>MS</fnm>
               </au>
               <au>
                  <snm>Brodsky</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Welch</snm>
                  <fnm>WR</fnm>
               </au>
               <au>
                  <snm>Loda</snm>
                  <fnm>MF</fnm>
               </au>
               <au>
                  <snm>Rubin</snm>
                  <fnm>MA</fnm>
               </au>
            </aug>
            <source>Hum Pathol</source>
            <pubdate>2004</pubdate>
            <volume>35</volume>
            <issue>12</issue>
            <fpage>1462</fpage>
            <lpage>1468</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.humpath.2004.09.009</pubid>
                  <pubid idtype="pmpid" link="fulltext">15619204</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <title>
               <p>Evaluation of HER-2/neu oncogene status in breast tumors on tissue microarrays</p>
            </title>
            <aug>
               <au>
                  <snm>Zhang</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Salto-Tellez</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Do</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Putti</snm>
                  <fnm>TC</fnm>
               </au>
               <au>
                  <snm>Koay</snm>
                  <fnm>ES</fnm>
               </au>
            </aug>
            <source>Hum Pathol</source>
            <pubdate>2003</pubdate>
            <volume>34</volume>
            <issue>4</issue>
            <fpage>362</fpage>
            <lpage>368</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1053/hupa.2003.60</pubid>
                  <pubid idtype="pmpid" link="fulltext">12733117</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>Statistical methods for analyzing tissue microarray data</p>
            </title>
            <aug>
               <au>
                  <snm>Liu</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Minin</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Huang</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Seligson</snm>
                  <fnm>DB</fnm>
               </au>
               <au>
                  <snm>Horvath</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>J Biopharm Stat</source>
            <pubdate>2004</pubdate>
            <volume>14</volume>
            <issue>3</issue>
            <fpage>671</fpage>
            <lpage>685</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1081/BIP-200025657</pubid>
                  <pubid idtype="pmpid">15468758</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>Bayesian Data Analysis, 2nd ed</p>
            </title>
            <aug>
               <au>
                  <snm>Gelman</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <publisher> Chapman &amp; Hall</publisher>
            <pubdate>2004</pubdate>
         </bibl>
         <bibl id="B5">
            <title>
               <p>Intelligent data analysis for medical diagnosis: using machine learning and temporal abstraction</p>
            </title>
            <aug>
               <au>
                  <snm>Lavrac</snm>
                  <fnm>N</fnm>
               </au>
            </aug>
            <source>AI Communications</source>
            <pubdate>1998</pubdate>
            <volume>11</volume>
            <fpage>191</fpage>
            <lpage>218</lpage>
         </bibl>
         <bibl id="B6">
            <title>
               <p>Learning Patterns in Noisy Data: The AQ Approach</p>
            </title>
            <aug>
               <au>
                  <snm>Michalski</snm>
                  <fnm>RS</fnm>
               </au>
               <au>
                  <snm>Kaufman</snm>
                  <fnm>K</fnm>
               </au>
            </aug>
            <source>Machine Learning and its Applications</source>
            <publisher>Berlin , Springer-Verlag</publisher>
            <editor>Paliouras G, Karkaletsis V, Spyropoulos C</editor>
            <pubdate>2001</pubdate>
            <fpage>22</fpage>
            <lpage>38</lpage>
         </bibl>
         <bibl id="B7">
            <title>
               <p>A bayesian approach to joint feature selection and classifier design</p>
            </title>
            <aug>
               <au>
                  <snm>Krishnapuram</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Hartemink</snm>
                  <fnm>AJ</fnm>
               </au>
               <au>
                  <snm>Carin</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Figueiredo</snm>
                  <fnm>MA</fnm>
               </au>
            </aug>
            <source>IEEE Trans Pattern Anal Mach Intell</source>
            <pubdate>2004</pubdate>
            <volume>26</volume>
            <issue>9</issue>
            <fpage>1105</fpage>
            <lpage>1111</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1109/TPAMI.2004.55</pubid>
                  <pubid idtype="pmpid">15742887</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>A Bayesian network classification methodology for gene expression data</p>
            </title>
            <aug>
               <au>
                  <snm>Helman</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Veroff</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Atlas</snm>
                  <fnm>SR</fnm>
               </au>
               <au>
                  <snm>Willman</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>J Comput Biol</source>
            <pubdate>2004</pubdate>
            <volume>11</volume>
            <issue>4</issue>
            <fpage>581</fpage>
            <lpage>615</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1089/cmb.2004.11.581</pubid>
                  <pubid idtype="pmpid" link="fulltext">15579233</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p>Prognostic meta-signature of breast cancer developed by two-stage mixture modeling of microarray data</p>
            </title>
            <aug>
               <au>
                  <snm>Shen</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Ghosh</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Chinnaiyan</snm>
                  <fnm>AM</fnm>
               </au>
            </aug>
            <source>BMC Genomics</source>
            <pubdate>2004</pubdate>
            <volume>5</volume>
            <issue>1</issue>
            <fpage>94</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">544889</pubid>
                  <pubid idtype="pmpid" link="fulltext">15598354</pubid>
                  <pubid idtype="doi">10.1186/1471-2164-5-94</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B10">
            <title>
               <p>A hierarchical Bayesian model for learning nonlinear statistical regularities in nonstationary natural signals</p>
            </title>
            <aug>
               <au>
                  <snm>Karklin</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Lewicki</snm>
                  <fnm>MS</fnm>
               </au>
            </aug>
            <source>Neural Comput</source>
            <pubdate>2005</pubdate>
            <volume>17</volume>
            <issue>2</issue>
            <fpage>397</fpage>
            <lpage>423</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1162/0899766053011474</pubid>
                  <pubid idtype="pmpid" link="fulltext">15720773</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>Nonparametric AUC estimation in population studies with incomplete sampling: a Bayesian approach</p>
            </title>
            <aug>
               <au>
                  <snm>Magni</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Bellazzi</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>De Nicolao</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Poggesi</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Rocchetti</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>J Pharmacokinet Pharmacodyn</source>
            <pubdate>2002</pubdate>
            <volume>29</volume>
            <issue>5-6</issue>
            <fpage>445</fpage>
            <lpage>471</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1023/A:1022920403166</pubid>
                  <pubid idtype="pmpid" link="fulltext">12795241</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>A hierarchical Binomial-Poisson model for the analysis of a crossover design for correlated binary data when the number of trials is dose-dependent</p>
            </title>
            <aug>
               <au>
                  <snm>Shkedy</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Molenberghs</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Van Craenendonck</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Steckler</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Bijnens</snm>
                  <fnm>L</fnm>
               </au>
            </aug>
            <source>J Biopharm Stat</source>
            <pubdate>2005</pubdate>
            <volume>15</volume>
            <issue>2</issue>
            <fpage>225</fpage>
            <lpage>239</lpage>
            <xrefbib>
               <pubid idtype="pmpid">15796291</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B13">
            <title>
               <p>Gene selection using a two-level hierarchical Bayesian model</p>
            </title>
            <aug>
               <au>
                  <snm>Bae</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Mallick</snm>
                  <fnm>BK</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2004</pubdate>
            <volume>20</volume>
            <issue>18</issue>
            <fpage>3423</fpage>
            <lpage>3430</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/bth419</pubid>
                  <pubid idtype="pmpid" link="fulltext">15256404</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B14">
            <title>
               <p>Bayesian hierarchical error model for analysis of gene expression data</p>
            </title>
            <aug>
               <au>
                  <snm>Cho</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Lee</snm>
                  <fnm>JK</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2004</pubdate>
            <volume>20</volume>
            <issue>13</issue>
            <fpage>2016</fpage>
            <lpage>2025</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/bth192</pubid>
                  <pubid idtype="pmpid" link="fulltext">15044230</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B15">
            <title>
               <p>Random walk models for bayesian clustering of gene expression profiles</p>
            </title>
            <aug>
               <au>
                  <snm>Ferrazzi</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Magni</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Bellazzi</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Appl Bioinformatics</source>
            <pubdate>2005</pubdate>
            <volume>4</volume>
            <issue>4</issue>
            <fpage>263</fpage>
            <lpage>276</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.2165/00822942-200504040-00006</pubid>
                  <pubid idtype="pmpid">16309344</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B16">
            <title>
               <p>BGX: a fully Bayesian integrated approach to the analysis of Affymetrix GeneChip data</p>
            </title>
            <aug>
               <au>
                  <snm>Hein</snm>
                  <fnm>AM</fnm>
               </au>
               <au>
                  <snm>Richardson</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Causton</snm>
                  <fnm>HC</fnm>
               </au>
               <au>
                  <snm>Ambler</snm>
                  <fnm>GK</fnm>
               </au>
               <au>
                  <snm>Green</snm>
                  <fnm>PJ</fnm>
               </au>
            </aug>
            <source>Biostatistics</source>
            <pubdate>2005</pubdate>
            <volume>6</volume>
            <issue>3</issue>
            <fpage>349</fpage>
            <lpage>373</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/biostatistics/kxi016</pubid>
                  <pubid idtype="pmpid" link="fulltext">15831583</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B17">
            <title>
               <p>Multilevel modelling of medical data</p>
            </title>
            <aug>
               <au>
                  <snm>Goldstein</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Browne</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Rasbash</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Stat Med</source>
            <pubdate>2002</pubdate>
            <volume>21</volume>
            <issue>21</issue>
            <fpage>3291</fpage>
            <lpage>3315</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1002/sim.1264</pubid>
                  <pubid idtype="pmpid" link="fulltext">12375305</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B18">
            <title>
               <p>Multilevel Modelling of Health Statistics</p>
            </title>
            <aug>
               <au>
                  <snm>Leyland</snm>
                  <fnm>AH</fnm>
               </au>
               <au>
                  <snm>Goldstein</snm>
                  <fnm>H</fnm>
               </au>
            </aug>
            <publisher> Wiley: Chichester</publisher>
            <pubdate>2001</pubdate>
         </bibl>
         <bibl id="B19">
            <title>
               <p>Classification using Hierarchical Naive Bayes models</p>
            </title>
            <aug>
               <au>
                  <snm>Langseth</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Nielsen</snm>
                  <fnm>TD</fnm>
               </au>
            </aug>
            <source>Machine Learning</source>
            <pubdate>2006</pubdate>
            <volume>63</volume>
            <fpage>135</fpage>
            <lpage>159</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1007/s10994-006-6136-2</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B20">
            <title>
               <p>Assessing predictive accuracy: how to compare Brier scores</p>
            </title>
            <aug>
               <au>
                  <snm>Redelmeier</snm>
                  <fnm>DA</fnm>
               </au>
               <au>
                  <snm>Bloch</snm>
                  <fnm>DA</fnm>
               </au>
               <au>
                  <snm>Hickam</snm>
                  <fnm>DH</fnm>
               </au>
            </aug>
            <source>J Clin Epidemiol</source>
            <pubdate>1991</pubdate>
            <volume>44</volume>
            <issue>11</issue>
            <fpage>1141</fpage>
            <lpage>1146</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/0895-4356(91)90146-Z</pubid>
                  <pubid idtype="pmpid">1941009</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B21">
            <title>
               <p>Data Mining</p>
            </title>
            <aug>
               <au>
                  <snm>Witten</snm>
                  <fnm>IH</fnm>
               </au>
               <au>
                  <snm>Frank</snm>
                  <fnm>E</fnm>
               </au>
            </aug>
            <publisher>San Francisco (CA) , Morgan Kaufmann</publisher>
            <pubdate>2000</pubdate>
         </bibl>
         <bibl id="B22">
            <title>
               <p>R: A Language for Data Analysis and Graphics</p>
            </title>
            <aug>
               <au>
                  <snm>Ihaka</snm>
                  <fnm>RG</fnm>
                  <suf>R</suf>
               </au>
            </aug>
            <source>Journal of Computational and Graphical Statistics</source>
            <pubdate>1996</pubdate>
            <volume>5</volume>
            <issue>3</issue>
            <fpage>299</fpage>
            <lpage>314</lpage>
            <xrefbib>
               <pubid idtype="doi">10.2307/1390807</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B23">
            <title>
               <p>Defining aggressive prostate cancer using a 12-gene model</p>
            </title>
            <aug>
               <au>
                  <snm>Bismar</snm>
                  <fnm>TA</fnm>
               </au>
               <au>
                  <snm>Demichelis</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Riva</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Kim</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Varambally</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>He</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Kutok</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Aster</snm>
                  <fnm>JC</fnm>
               </au>
               <au>
                  <snm>Tang</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Kuefer</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Hofer</snm>
                  <fnm>MD</fnm>
               </au>
               <au>
                  <snm>Febbo</snm>
                  <fnm>PG</fnm>
               </au>
               <au>
                  <snm>Chinnaiyan</snm>
                  <fnm>AM</fnm>
               </au>
               <au>
                  <snm>Rubin</snm>
                  <fnm>MA</fnm>
               </au>
            </aug>
            <source>Neoplasia</source>
            <pubdate>2006</pubdate>
            <volume>8</volume>
            <issue>1</issue>
            <fpage>59</fpage>
            <lpage>68</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1593/neo.05664</pubid>
                  <pubid idtype="pmpid" link="fulltext">16533427</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B24">
            <title>
               <p>The polycomb group protein EZH2 is involved in progression of prostate cancer</p>
            </title>
            <aug>
               <au>
                  <snm>Varambally</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Dhanasekaran</snm>
                  <fnm>SM</fnm>
               </au>
               <au>
                  <snm>Zhou</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Barrette</snm>
                  <fnm>TR</fnm>
               </au>
               <au>
                  <snm>Kumar-Sinha</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Sanda</snm>
                  <fnm>MG</fnm>
               </au>
               <au>
                  <snm>Ghosh</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Pienta</snm>
                  <fnm>KJ</fnm>
               </au>
               <au>
                  <snm>Sewalt</snm>
                  <fnm>RG</fnm>
               </au>
               <au>
                  <snm>Otte</snm>
                  <fnm>AP</fnm>
               </au>
               <au>
                  <snm>Rubin</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Chinnaiyan</snm>
                  <fnm>AM</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2002</pubdate>
            <volume>419</volume>
            <issue>6907</issue>
            <fpage>624</fpage>
            <lpage>629</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature01075</pubid>
                  <pubid idtype="pmpid" link="fulltext">12374981</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B25">
            <title>
               <p>Meta-analysis of microarrays: interstudy validation of gene expression profiles reveals pathway dysregulation in prostate cancer</p>
            </title>
            <aug>
               <au>
                  <snm>Rhodes</snm>
                  <fnm>DR</fnm>
               </au>
               <au>
                  <snm>Barrette</snm>
                  <fnm>TR</fnm>
               </au>
               <au>
                  <snm>Rubin</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Ghosh</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Chinnaiyan</snm>
                  <fnm>AM</fnm>
               </au>
            </aug>
            <source>Cancer Res</source>
            <pubdate>2002</pubdate>
            <volume>62</volume>
            <issue>15</issue>
            <fpage>4427</fpage>
            <lpage>4433</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12154050</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B26">
            <title>
               <p>alpha-Methylacyl coenzyme A racemase as a tissue biomarker for prostate cancer</p>
            </title>
            <aug>
               <au>
                  <snm>Rubin</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Zhou</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Dhanasekaran</snm>
                  <fnm>SM</fnm>
               </au>
               <au>
                  <snm>Varambally</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Barrette</snm>
                  <fnm>TR</fnm>
               </au>
               <au>
                  <snm>Sanda</snm>
                  <fnm>MG</fnm>
               </au>
               <au>
                  <snm>Pienta</snm>
                  <fnm>KJ</fnm>
               </au>
               <au>
                  <snm>Ghosh</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Chinnaiyan</snm>
                  <fnm>AM</fnm>
               </au>
            </aug>
            <source>Jama</source>
            <pubdate>2002</pubdate>
            <volume>287</volume>
            <issue>13</issue>
            <fpage>1662</fpage>
            <lpage>1670</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1001/jama.287.13.1662</pubid>
                  <pubid idtype="pmpid" link="fulltext">11926890</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B27">
            <title>
               <p>Identification of differentially expressed genes in human prostate cancer using subtraction and microarray</p>
            </title>
            <aug>
               <au>
                  <snm>Xu</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Stolk</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Silva</snm>
                  <fnm>SJ</fnm>
               </au>
               <au>
                  <snm>Houghton</snm>
                  <fnm>RL</fnm>
               </au>
               <au>
                  <snm>Matsumura</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Vedvick</snm>
                  <fnm>TS</fnm>
               </au>
               <au>
                  <snm>Leslie</snm>
                  <fnm>KB</fnm>
               </au>
               <au>
                  <snm>Badaro</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Reed</snm>
                  <fnm>SG</fnm>
               </au>
            </aug>
            <source>Cancer Res</source>
            <pubdate>2000</pubdate>
            <volume>60</volume>
            <issue>6</issue>
            <fpage>1677</fpage>
            <lpage>1682</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">10749139</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B28">
            <title>
               <p>Decreased alpha-methylacyl CoA racemase expression in localized prostate cancer is associated with an increased rate of biochemical recurrence and cancer-specific death</p>
            </title>
            <aug>
               <au>
                  <snm>Rubin</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Bismar</snm>
                  <fnm>TA</fnm>
               </au>
               <au>
                  <snm>Andren</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Mucci</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Kim</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Shen</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Ghosh</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Wei</snm>
                  <fnm>JT</fnm>
               </au>
               <au>
                  <snm>Chinnaiyan</snm>
                  <fnm>AM</fnm>
               </au>
               <au>
                  <snm>Adami</snm>
                  <fnm>HO</fnm>
               </au>
               <au>
                  <snm>Kantoff</snm>
                  <fnm>PW</fnm>
               </au>
               <au>
                  <snm>Johansson</snm>
                  <fnm>JE</fnm>
               </au>
            </aug>
            <source>Cancer Epidemiol Biomarkers Prev</source>
            <pubdate>2005</pubdate>
            <volume>14</volume>
            <issue>6</issue>
            <fpage>1424</fpage>
            <lpage>1432</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1158/1055-9965.EPI-04-0801</pubid>
                  <pubid idtype="pmpid" link="fulltext">15941951</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B29">
            <title>
               <p> Support vector classification with input data uncertainty.</p>
            </title>
            <aug>
               <au>
                  <snm>Bi</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <publisher>NIPS </publisher>
            <editor>Zhang T</editor>
            <pubdate>2004</pubdate>
         </bibl>
         <bibl id="B30">
            <title>
               <p>Neural Networks that learn from Fuzzy If-Then rules</p>
            </title>
            <aug>
               <au>
                  <snm>Ishibuchi</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>al.</snm>
                  <fnm/>
               </au>
            </aug>
            <source>IEEE Trans Fuzzy Systems</source>
            <pubdate>1993</pubdate>
            <volume>1</volume>
            <issue>2</issue>
            <fpage>85</fpage>
            <lpage>97</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1109/91.227388</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B31">
            <title>
               <p>Bayesian model averaging: development of an improved multi-class, gene selection and classification tool for microarray data</p>
            </title>
            <aug>
               <au>
                  <snm>Yeung</snm>
                  <fnm>KY</fnm>
               </au>
               <au>
                  <snm>Bumgarner</snm>
                  <fnm>RE</fnm>
               </au>
               <au>
                  <snm>Raftery</snm>
                  <fnm>AE</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2005</pubdate>
            <volume>21</volume>
            <issue>10</issue>
            <fpage>2394</fpage>
            <lpage>2402</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/bti319</pubid>
                  <pubid idtype="pmpid" link="fulltext">15713736</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B32">
            <title>
               <p>Robust sparse hyperplane classifiers: application to uncertain molecular profiling data</p>
            </title>
            <aug>
               <au>
                  <snm>Bhattacharyya</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Grate</snm>
                  <fnm>LR</fnm>
               </au>
               <au>
                  <snm>Jordan</snm>
                  <fnm>MI</fnm>
               </au>
               <au>
                  <snm>El Ghaoui</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Mian</snm>
                  <fnm>IS</fnm>
               </au>
            </aug>
            <source>J Comput Biol</source>
            <pubdate>2004</pubdate>
            <volume>11</volume>
            <issue>6</issue>
            <fpage>1073</fpage>
            <lpage>1089</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1089/cmb.2004.11.1073</pubid>
                  <pubid idtype="pmpid" link="fulltext">15662199</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B33">
            <title>
               <p>Markov Chain Monte Carlo in Practice</p>
            </title>
            <aug>
               <au>
                  <snm>Gilks</snm>
                  <fnm>WR</fnm>
               </au>
            </aug>
            <publisher> Chapman &amp; Hall</publisher>
            <pubdate>1996</pubdate>
         </bibl>
         <bibl id="B34">
            <title>
               <p>Hierarchical Naive Bayes Classifiers for uncertain data</p>
            </title>
            <aug>
               <au>
                  <snm>Bellazzi</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Demichelis</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Piergiorgi</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Magni</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>DIS Technical Report</source>
            <publisher>Pavia, I , Laboratory for Biomedical Informatics, University of Pavia</publisher>
            <pubdate>2006</pubdate>
         </bibl>
         <bibl id="B35">
            <title>
               <p>Learning temporal probabilistic causal models from longitudinal data</p>
            </title>
            <aug>
               <au>
                  <snm>Riva</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Bellazzi</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Artif Intell Med</source>
            <pubdate>1996</pubdate>
            <volume>8</volume>
            <issue>3</issue>
            <fpage>217</fpage>
            <lpage>234</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/0933-3657(95)00034-8</pubid>
                  <pubid idtype="pmpid" link="fulltext">8830923</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B36">
            <title>
               <p>Tissue microarrays for high-throughput molecular profiling of tumor specimens</p>
            </title>
            <aug>
               <au>
                  <snm>Kononen</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Bubendorf</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Kallioniemi</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Barlund</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Schraml</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Leighton</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Torhorst</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Mihatsch</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Sauter</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Kallioniemi</snm>
                  <fnm>OP</fnm>
               </au>
            </aug>
            <source>Nat Med</source>
            <pubdate>1998</pubdate>
            <volume>4</volume>
            <issue>7</issue>
            <fpage>844</fpage>
            <lpage>847</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nm0798-844</pubid>
                  <pubid idtype="pmpid">9662379</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B37">
            <title>
               <p>Tissue microarray (TMA) technology: miniaturized pathology archives for high-throughput in situ studies</p>
            </title>
            <aug>
               <au>
                  <snm>Bubendorf</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Nocito</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Moch</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Sauter</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>J Pathol</source>
            <pubdate>2001</pubdate>
            <volume>195</volume>
            <issue>1</issue>
            <fpage>72</fpage>
            <lpage>79</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1002/path.893</pubid>
                  <pubid idtype="pmpid" link="fulltext">11568893</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B38">
            <title>
               <p>Neuroendocrine expression in metastatic prostate cancer: evaluation of high throughput tissue microarrays to detect heterogeneous protein expression</p>
            </title>
            <aug>
               <au>
                  <snm>Mucci</snm>
                  <fnm>NR</fnm>
               </au>
               <au>
                  <snm>Akdas</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Manely</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Rubin</snm>
                  <fnm>MA</fnm>
               </au>
            </aug>
            <source>Hum Pathol</source>
            <pubdate>2000</pubdate>
            <volume>31</volume>
            <issue>4</issue>
            <fpage>406</fpage>
            <lpage>414</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1053/hp.2000.7295</pubid>
                  <pubid idtype="pmpid" link="fulltext">10821485</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B39">
            <title>
               <p>Machine Learning</p>
            </title>
            <aug>
               <au>
                  <snm>Mitchell</snm>
                  <fnm>TM</fnm>
               </au>
            </aug>
            <publisher>New York , McGraw-Hill</publisher>
            <pubdate>1997</pubdate>
         </bibl>
      </refgrp>
   </bm>
</art>
