<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1471-2105-8-67</ui>
   <ji>1471-2105</ji>
   <fm>
      <dochead>Methodology article</dochead>
      <bibl>
         <title>
            <p>Kernel-imbedded Gaussian processes for disease classification using microarray gene expression data</p>
         </title>
         <aug>
            <au id="A1">
               <snm>Zhao</snm>
               <fnm>Xin</fnm>
               <insr iid="I1"/>
               <email>xinz@hawaii.edu</email>
            </au>
            <au id="A2" ca="yes">
               <snm>Cheung</snm>
               <mnm>Wang-Kit</mnm>
               <fnm>Leo</fnm>
               <insr iid="I2"/>
               <insr iid="I3"/>
               <email>lcheung@lumc.edu</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Department of Information and Computer Sciences, University of Hawaii, 1680 East-West Road, Honolulu, Hawaii, 96822 USA</p>
            </ins>
            <ins id="I2">
               <p>Bioinformatics Core, Stritch School of Medicine, Loyola University Medical Center, 2160 South First Avenue, Maywood, Illinois 60153 USA</p>
            </ins>
            <ins id="I3">
               <p>Department of Preventive Medicine and Epidemiology, Stritch School of Medicine, Loyola University Medical Center, 2160 South First Avenue, Maywood, Illinois 60153 USA</p>
            </ins>
         </insg>
         <source>BMC Bioinformatics</source>
         <issn>1471-2105</issn>
         <pubdate>2007</pubdate>
         <volume>8</volume>
         <issue>1</issue>
         <fpage>67</fpage>
         <url>http://www.biomedcentral.com/1471-2105/8/67</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">17328811</pubid>
               <pubid idtype="doi">10.1186/1471-2105-8-67</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>16</day>
               <month>6</month>
               <year>2006</year>
            </date>
         </rec>
         <acc>
            <date>
               <day>28</day>
               <month>2</month>
               <year>2007</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>28</day>
               <month>2</month>
               <year>2007</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2007</year>
         <collab>Zhao and Cheung; licensee BioMed Central Ltd.</collab>
         <note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>Designing appropriate machine learning methods for identifying genes that have a significant discriminating power for disease outcomes has become more and more important for our understanding of diseases at genomic level. Although many machine learning methods have been developed and applied to the area of microarray gene expression data analysis, the majority of them are based on linear models, which however are not necessarily appropriate for the underlying connection between the target disease and its associated explanatory genes. Linear model based methods usually also bring in false positive significant features more easily. Furthermore, linear model based algorithms often involve calculating the inverse of a matrix that is possibly singular when the number of potentially important genes is relatively large. This leads to problems of numerical instability. To overcome these limitations, a few non-linear methods have recently been introduced to the area. Many of the existing non-linear methods have a couple of critical problems, the model selection problem and the model parameter tuning problem, that remain unsolved or even untouched. In general, a unified framework that allows model parameters of both linear and non-linear models to be easily tuned is always preferred in real-world applications. Kernel-induced learning methods form a class of approaches that show promising potentials to achieve this goal.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>A hierarchical statistical model named kernel-imbedded Gaussian process (KIGP) is developed under a unified Bayesian framework for binary disease classification problems using microarray gene expression data. In particular, based on a probit regression setting, an adaptive algorithm with a cascading structure is designed to find the appropriate kernel, to discover the potentially significant genes, and to make the optimal class prediction accordingly. A Gibbs sampler is built as the core of the algorithm to make Bayesian inferences. Simulation studies showed that, even without any knowledge of the underlying generative model, the KIGP performed very close to the theoretical Bayesian bound not only in the case with a linear Bayesian classifier but also in the case with a very non-linear Bayesian classifier. This sheds light on its broader usability to microarray data analysis problems, especially to those that linear methods work awkwardly. The KIGP was also applied to four published microarray datasets, and the results showed that the KIGP performed better than or at least as well as any of the referred state-of-the-art methods did in all of these cases.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusion</p>
               </st>
               <p>Mathematically built on the kernel-induced feature space concept under a Bayesian framework, the KIGP method presented in this paper provides a unified machine learning approach to explore both the linear and the possibly non-linear underlying relationship between the target features of a given binary disease classification problem and the related explanatory gene expression data. More importantly, it incorporates the model parameter tuning into the framework. The model selection problem is addressed in the form of selecting a proper kernel type. The KIGP method also gives Bayesian probabilistic predictions for disease classification. These properties and features are beneficial to most real-world applications. The algorithm is naturally robust in numerical computation. The simulation studies and the published data studies demonstrated that the proposed KIGP performs satisfactorily and consistently.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>DNA microarray technology provides researchers a high-throughput means to measure expression levels for thousands of genes in an experiment. Careful analyses of microarray gene expression data can help better understand human health and disease and have very important implications in basic sciences as well as pharmaceutical and clinical research. Some existing methodologies for microarray gene expression data analysis, such as introduced in <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr></abbrgrp> and <abbrgrp><abbr bid="B4">4</abbr></abbrgrp>, have demonstrated their usefulness for a variety of class discovery or class prediction problems in biomedical applications. In a microarray study, we typically face a problem of analyzing thousands of genes from a relatively small number of available samples. This nature gives rise to a very high likelihood of finding lots of "false positives" with conventional statistical methods. Therefore, properly selecting the group of genes that are significantly related to a target disease has created one of the key challenges in microarray data analysis.</p>
         <p>Gene selection problem basically can be viewed as a variable selection problem associated with linear regression models. An incomplete list of those classical variable selection methods/criteria includes the ratio of error sum of squares for the model with p variables to the error mean square of the full model and adjusted with a penalty for the number of variables or the <it>C</it><sub><it>p </it></sub>Criterion <abbrgrp><abbr bid="B5">5</abbr></abbrgrp>, the Akaike Information Criterion or AIC <abbrgrp><abbr bid="B6">6</abbr></abbrgrp>, and the Bayesian Information Criterion or BIC <abbrgrp><abbr bid="B7">7</abbr></abbrgrp>. George and Foster <abbrgrp><abbr bid="B8">8</abbr></abbrgrp> later suggested that these criteria corresponded to a hierarchical Bayesian variable selection procedure under a particular class of priors. Following the similar setting with a slightly different prior specification, Yuan and Lin <abbrgrp><abbr bid="B9">9</abbr></abbrgrp> provided another approach to solve this problem and they showed that their algorithm was significantly faster and could be potentially used even when the predictor dimension is larger than the training sample size. Although both of these algorithms have been shown to favorably enhance the selection performance comparing to the classical methods such as <it>C</it><sub><it>p</it></sub>, AIC or BIC, they share a common disadvantage. That is, even after the hyperparameters are estimated, the variable selection criteria need to be evaluated on each candidate variable for optimality. Usually, the number of candidate models grows in an exponential rate with the increase of the number of variables, whereas the typical number of the investigated genes in a microarray data analysis problem is in thousands. This motivates the development of the class of the Markov Chain Monte Carlo (MCMC) algorithms under a Bayesian framework to attack the problem. One of the most widely used MCMC algorithms is the Gibbs sampler. For the microarray analysis problem, Lee et al. <abbrgrp><abbr bid="B10">10</abbr></abbrgrp> suggested a Bayesian model based on a linear probit regression setting and proposed a Gibbs Sampler to solve it. An extension to this method based on a multinomial probit regression setting has also been proposed <abbrgrp><abbr bid="B11">11</abbr></abbrgrp>. Similarly, Zhou et al. (<abbrgrp><abbr bid="B12">12</abbr><abbr bid="B13">13</abbr></abbrgrp>) developed another Bayesian approach built upon a linear logistic regression model to the gene selection problem.</p>
         <p>The linear model based methods mentioned above have been shown with various levels of effectiveness in finding the set of significant genes in a wide range of real microarray experiments. However, they all share some common limitations: the first also the most important one is that, a linear model is not necessarily always a good approximation for the underlying physical model; second, linear model based methods are more likely to bring in false positives; third, the computations of these linear model based algorithms usually involve calculating the inverse of a matrix that is possibly singular when the number of potentially important genes is relatively large. To overcome these disadvantages, Zhou et al. <abbrgrp><abbr bid="B14">14</abbr></abbrgrp> introduced a non-linear term into the basic linear probit regression model and applied a bootstrapping procedure to enlarge the sample size. A technique called sequential Monte Carlo was adopted in the numerical Bayesian computation in their work. Some other models were also developed for tumor classification problems with gene expression profiling. For instance, based on the simple nearest centroid classifier and via a shrinking strategy, Tibshirani et al. <abbrgrp><abbr bid="B15">15</abbr></abbrgrp> offered the so-called "nearest shrunken centroids" (also known as "Prediction Analysis for Microarrays" or PAM) algorithm. By combining two ensemble schemes, i.e. bagging and boosting, Dettling <abbrgrp><abbr bid="B16">16</abbr></abbrgrp> introduced the method "BagBoosting" as an enhanced version of the regular boosting algorithm. Both of these methods were shown being very effective when applied to a few published datasets.</p>
         <p>The kernel-induced machine learning is one of the most promising approaches for exploring the potential non-linearity for a given classification or regression problem through the feature space concept. For example, kernel-induced support vector machines (SVMs) have been successfully applied to a number of learning tasks and are generally accepted as one of the state-of-the-art learning methods. Theoretically, Lin et al. (<abbrgrp><abbr bid="B17">17</abbr><abbr bid="B18">18</abbr></abbrgrp>) proved that a SVM with an appropriately chosen kernel and model parameters can approach the Bayesian bound of a given problem when the training sample size is large enough. For the gene-selection problem, Guyon et al. <abbrgrp><abbr bid="B19">19</abbr></abbrgrp> proposed the method "Recursive Feature Elimination" (RFE) to rank the genes with respect to a provided SVM, thus the SVM can be utilized for microarray data analysis. RFE was shown to be very effective with a linear kernel. However, when the number of genes is large (in hundreds), RFE doesn't function well with a non-linear kernel. This limits the applications of SVMs to the analysis of microarray data. Zhu and Hastie (<abbrgrp><abbr bid="B20">20</abbr><abbr bid="B21">21</abbr></abbrgrp>) later proposed a framework called kernel logistic regression and suggested a method called "Import Vector Machine" to solve it. However, they also chose the RFE as the strategy to select the significant genes.</p>
         <p>As Bayesian probability theory can help construct a unified framework for modeling data and facilitate tuning of the involved parameter and/or hyperparameter, developing a proper Bayesian probabilistic model is usually beneficial for a machine learning method. MacKay <abbrgrp><abbr bid="B22">22</abbr></abbrgrp> introduced a probabilistic evidence framework as a Bayesian learning paradigm for neural networks. With the close relationship between neural network methods and kernel-induced learning methods, Kwok <abbrgrp><abbr bid="B23">23</abbr></abbrgrp> and Gestel et al. <abbrgrp><abbr bid="B24">24</abbr></abbrgrp> developed a Bayesian framework for SVMs and least square support vector machines (LSSVMs) respectively, with guidance of the principle of the evidence framework. Neal <abbrgrp><abbr bid="B25">25</abbr></abbrgrp> also showed that, as the number of hidden units increases in a Bayesian neural network, the prior over the network output converges to a Gaussian process (GP) if independent Gaussian distributions are used as the priors for network weights and bias. LSSVMs conceptually are close to SVMs, except that they use equality constraints instead of inequality constraints and they use a squared error penalty function. Getting solution of an LSSVM therefore only involves solving a set of linear equations, which though loses the sparseness featured in an SVM, it makes an LSSVM much easier for an on-line implementation. If we consider the characteristic similarity between the mapping from input nodes/data to hidden units in a neural network and the mapping from input data to a feature space conceptually embedded in an LSSVM, it's not surprising that under the Gaussian noise assumption, the mean of the posterior prediction made by a GP coincides with the optimum decision function made by an LSSVM, whereas a GP offers a more approachable probabilistic model. This fact motivated us to develop a new Bayesian learning method named kernel-imbedded Gaussian process (KIGP) for microarray gene expression data analysis based on the Gaussian process theory.</p>
         <p>The general framework of the KIGP method is sketched in Fig. <figr fid="F1">1</figr>, where the box bounded by the dotted lines represents the proposed learning component of the method. Conceptually, via a gene-selection procedure, a small group of the gene data is selected. Through a feature mapping function &#936;(&#183;), the selected gene data are mapped into a feature space where the optimal classification procedure is processed. With the theory of kernel-induced feature space <abbrgrp><abbr bid="B26">26</abbr></abbrgrp>, we do not really do the feature mapping computationally. Instead, we train the data via a kernel-imbedded Gaussian Process by using a kernel function. In the output end, there are basically three consecutive phases, the "kernel parameter fitting phase", the "gene selection phase", and the "prediction phase". Given a kernel type, the KIGP algorithm finds the fitted kernel parameter(s) in the "kernel parameter fitting phase". After fixing the kernel parameter(s) at the fitted value(s), it continues with the "gene selection phase" and yields a group of significant genes under some given confidence level. Based on the fitted kernel parameter(s) and the selected significant gene data, the algorithm makes a probabilistic prediction for each testing sample in the "prediction phase". The details of the algorithm are discussed in the "Methods" section.</p>
         <fig id="F1">
            <title>
               <p>Figure 1</p>
            </title>
            <caption>
               <p>Schematic plot for the general framework of the proposed KIGP method</p>
            </caption>
            <text>
               <p>Schematic plot for the general framework of the proposed KIGP method.</p>
            </text>
            <graphic file="1471-2105-8-67-1"/>
         </fig>
         <p>The rest of this paper is organized as follows: we show the results from applying the proposed KIGP method to simulated datasets as well as real published microarray datasets in the "Results" section. The conclusions and the further research discussions are summarized in the "Discussions and Conclusions" section. In the "Methods" section, we provide the mathematical content of the methodology followed by a detailed description of the algorithm.</p>
      </sec>
      <sec>
         <st>
            <p>Results</p>
         </st>
         <p>Some terms and acronyms defined in the "Methods" section are used in this section. They include "gene-selection vector (<it>&#947;</it>)", "linear kernel (LK)", "polynomial kernel (PK)", "Gaussian kernel (GK)", "Normalized Log-Frequency (NLF)", "false discovery rate (fdr)", "kernel parameter fitting phase", "gene selection phase", "prediction phase", "misclassification rate (MR)", "average predictive probability (APP)", "leave-one-out cross-validation (LOOCV)" and "3-fold cross-validation (3-fold CV)". One can refer to the "Methods" section for the details.</p>
         <sec>
            <st>
               <p>Simulation studies</p>
            </st>
            <sec>
               <st>
                  <p>Example 1</p>
               </st>
               <p>This example was designed to illustrate all the key concepts, elements and procedures of the KIGP framework introduced in the "Methods" section. It consists of two cases. In the first case, the Bayesian classifier of the underlying generative model is linear; while in the second case, the Bayesian classifier takes a very non-linear form. We set the number of the significant/explanatory genes as two, so we can better graphically display the Bayesian classifier and the relative performance of the KIGP method. In both of these cases, the number of training samples is twenty. Ten training samples were generated from the class "1" and the other ten samples were generated from the class "-1". The number of testing samples is 5000. For each sample, the number of investigated genes is 200; the indices of the two underlying explanatory genes were preset as [23,57]. For each case, we independently generated 10 sets of training samples from the generative model and ran the simulation on each of them.</p>
            </sec>
            <sec>
               <st>
                  <p>(a) Case with a Linear Bayesian Classifier</p>
               </st>
               <p>In this linear case, the two preset significant genes were generated from the bivariate Gaussian distribution <m:math name="1471-2105-8-67-i1" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:mi>N</m:mi><m:mo stretchy="false">(</m:mo><m:mrow><m:mo>[</m:mo><m:mrow><m:mtable><m:mtr><m:mtd><m:mn>1</m:mn></m:mtd></m:mtr><m:mtr><m:mtd><m:mn>1</m:mn></m:mtd></m:mtr></m:mtable></m:mrow><m:mo>]</m:mo></m:mrow><m:mo>,</m:mo><m:mrow><m:mo>[</m:mo><m:mrow><m:mtable><m:mtr><m:mtd><m:mn>1</m:mn></m:mtd><m:mtd><m:mrow><m:mo>&#8722;</m:mo><m:mn>1</m:mn></m:mrow></m:mtd></m:mtr><m:mtr><m:mtd><m:mrow><m:mo>&#8722;</m:mo><m:mn>1</m:mn></m:mrow></m:mtd><m:mtd><m:mn>2</m:mn></m:mtd></m:mtr></m:mtable></m:mrow><m:mo>]</m:mo></m:mrow><m:mo stretchy="false">)</m:mo></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacqWGobGtcqGGOaakdaWadaqaauaabeqaceaaaeaacqaIXaqmaeaacqaIXaqmaaaacaGLBbGaayzxaaGaeiilaWYaamWaaeaafaqabeGacaaabaGaeGymaedabaGaeyOeI0IaeGymaedabaGaeyOeI0IaeGymaedabaGaeGOmaidaaaGaay5waiaaw2faaiabcMcaPaaa@3BE0@</m:annotation></m:semantics></m:math> for the class "1" and from the bivariate Gaussian distribution <m:math name="1471-2105-8-67-i2" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:mi>N</m:mi><m:mo stretchy="false">(</m:mo><m:mrow><m:mo>[</m:mo><m:mrow><m:mtable><m:mtr><m:mtd><m:mrow><m:mo>&#8722;</m:mo><m:mn>1</m:mn></m:mrow></m:mtd></m:mtr><m:mtr><m:mtd><m:mrow><m:mo>&#8722;</m:mo><m:mn>1</m:mn></m:mrow></m:mtd></m:mtr></m:mtable></m:mrow><m:mo>]</m:mo></m:mrow><m:mo>,</m:mo><m:mrow><m:mo>[</m:mo><m:mrow><m:mtable><m:mtr><m:mtd><m:mn>1</m:mn></m:mtd><m:mtd><m:mrow><m:mo>&#8722;</m:mo><m:mn>1</m:mn></m:mrow></m:mtd></m:mtr><m:mtr><m:mtd><m:mrow><m:mo>&#8722;</m:mo><m:mn>1</m:mn></m:mrow></m:mtd><m:mtd><m:mn>2</m:mn></m:mtd></m:mtr></m:mtable></m:mrow><m:mo>]</m:mo></m:mrow><m:mo stretchy="false">)</m:mo></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacqWGobGtcqGGOaakdaWadaqaauaabeqaceaaaeaacqGHsislcqaIXaqmaeaacqGHsislcqaIXaqmaaaacaGLBbGaayzxaaGaeiilaWYaamWaaeaafaqabeGacaaabaGaeGymaedabaGaeyOeI0IaeGymaedabaGaeyOeI0IaeGymaedabaGaeGOmaidaaaGaay5waiaaw2faaiabcMcaPaaa@3DBA@</m:annotation></m:semantics></m:math> for the class "-1". For those insignificant genes, each of them was independently generated from the standard normal distribution <it>N</it>(0,1). The probabilities for the class "1" and the class "-1" were equal. With this generative model, the Bayesian classifier for the two classes is a mathematical linear combination of the two prescribed significant genes.</p>
               <p>The KIGP method with an PK, or with an GK, or with an LK was applied to each of the 10 training sets respectively. The prior probability for <it>&#947;</it><sub><it>j </it></sub>= 1 for all <it>j </it>in the Gibbs sampling simulations was set at 0.01. For all the Gibbs sampling simulations in this example, we ran 5000 iterations in both the "kernel parameter fitting phase" and the "gene selection phase" and treated the first 1000 iterations as the burn-in period. In the "prediction phase", we ran 2000 iterations and treated the first 500 iterations as the burn-in period. The threshold for "fdr" in the "gene selection phase" was set at 0.05.</p>
               <p>For all of the 10 simulated training sets, when an PK was the kernel type for the KIGP method, the algorithm chose the PK(1) after the "kernel parameter fitting phase" and found both the prescribed significant genes at the end of the "gene selection phase" (i.e. with no "false negative"). However, KIGP with an PK(1) resulted with one "false positive" gene in 2 of the 10 sets. In the prediction phase, the average testing MR for the 8 sets correctly found the 2 preset significant genes with no "false positive" was 0.018. It was very close to the Bayesian bound (i.e. 0.013). However, the average testing MR for the 2 sets with one "false positive" was significantly worse. It was only 0.107. The average testing MR for all 10 sets was 0.036.</p>
               <p>The results of the simulation studies with an LK were very similar to that of the simulations with the PK(1). In all the simulations, the KIGP found the 2 preset significant genes (i.e. with no false "negative"), but in 2 of the 10 sets, the algorithm resulted with one "false positive" as well. This result was exactly same as that from the simulations with the PK(1). The average testing MR for the 10 sets with an LK was 0.037, almost the same as to that with an PK.</p>
               <p>For the results of the stimulation studies with an GK, the algorithm perfectly found the only 2 prescribed significant genes in 6 of the 10 sets (i.e. no false "negative" and no false "positive"). In other 3 sets, the KIGP identified the 2 prescribed significant genes as well as one "false positive". In one other set, the KIGP resulted only one of the two prescribed genes (i.e. with one "false negative") and one "false positive". The mean and the standard deviation for the fitted width of an GK for these 10 simulations were 1.95 and 0.31 respectively. The average testing MR for the 10 simulations with an GK was only 0.104. Based on the testing MR measure, we should use the KIGP with either a polynomial kernel or a linear kernel to make any further analysis for this problem.</p>
               <p>As an illustration, we specifically display the results from applying the KIGP to one of the training sets, in which both an PK and an GK worked very well. For the simulation with the GK, the posterior probability density function (PDF) of the width parameter "r" is plotted in Fig. <figr fid="F2">2a</figr>, in which its mode was found at around 1.61. After the "kernel parameter fitting phase", the kernel was fixed as the GK(1.61). With the posterior samples obtained in the "gene selection phase", the NLF for each gene was calculated (Fig. <figr fid="F3">3c</figr>). Following the procedure described in the "Gene selection phase" subsection, the local fdr with respect to the NLF value was estimated (Fig. <figr fid="F2">2b</figr>). With the threshold for fdr set at 0.05, the cutoff value for NLF was 3.83 and we found that only the two prescribed genes (indices: 23, 57) were found significant. The contours of the posterior predictive probabilities for the class "1" are plotted in Fig. <figr fid="F3">3d</figr>, where the X-axis is the value of the gene 23 and the Y-axis represents the value of the gene 57. In Fig. <figr fid="F3">3d</figr>, the numbers associated with the contour curves are probabilities; the asterisks denote the positive training samples and the circles present the negative training samples; the dotted line shows the Bayesian classifier. The MR of the independent testing set for this simulation was 0.028. For the simulation with an PK, after the "kernel parameter fitting phase", the estimated posterior probability masses for the discrete degree parameter "d" were Prob(d = 1) = 0.797 and Prob(d = 2) = 0.203 respectively. With the highest estimated posterior mass at d = 1, we accordingly fixed the kernel as the PK(1). With the same gene-selection procedure described in the simulation with the GK, the two prescribed genes again were found as the only two significant genes (Fig. <figr fid="F3">3e</figr>). The contour plot of the posterior predictive probability for the class "1" is drawn in Fig. <figr fid="F3">3f</figr>. The testing MR was 0.017 for this simulation. The performance of the KIGP with the PK(1) was very similar to that of the KIGP with an LK (Fig. <figr fid="F3">3a</figr> and <figr fid="F3">3b</figr>). Both of them behaved like the linear Bayesian classifier. As a benchmark comparison, we further applied a regular SVM/RFE (SVM with RFE <abbrgrp><abbr bid="B19">19</abbr></abbrgrp> as the gene selection strategy) to each of the 10 simulated training sets. In fact, rather than using a cross-validation procedure, there is no effective way for a SVM/RFE to set the model parameter (such as the box constraint) and to select the number of significant genes. Technically, it is also important to mention that the SVM/RFE is not proper for microarray data analysis with a kernel type having variable parameter(s) such as a Gaussian kernel. Nevertheless, for this linear example, we applied a SVM/RFE with an LK to the datasets and preset the box constraint as 1. The obtained results were similar to those of the KIGP with an LK case. In 8 out of the 10 sets, the gene 23 and the gene 57 were ranked as the top 2 genes in the significance gene list. However, in the remaining 2 of the 10 sets, the gene 23 was ranked as the top significant gene but the gene 57 was ranked in the 3<sup>rd </sup>place and in the 5<sup>th </sup>place respectively. For the prediction with RFE, we used the top genes including the gene 23 and the gene 57. The resulted average testing MR for all 10 sets was 0.058. Even in this linear case, the KIGP with an LK or the PK(1) outperformed the SVM/RFE with an LK in an automatic fashion. More importantly, the SVM/RFE only made a binary prediction of the class for each testing sample, while the KIGP gave a probabilistic prediction on the certainty of the decision. Furthermore, the proposed KIGP framework offered the posterior distribution for each model parameter as well as a universal significance measure (NLF) for each investigated gene at the end.</p>
               <fig id="F2">
                  <title>
                     <p>Figure 2</p>
                  </title>
                  <caption>
                     <p>The results from applying the KIGP with an GK to one of the training sets of the simulated example 1, where (a) and (b) are for the linear case; (c) and (d) are for the non-liner case</p>
                  </caption>
                  <text>
                     <p>The results from applying the KIGP with an GK to one of the training sets of the simulated example 1, where (a) and (b) are for the linear case; (c) and (d) are for the non-liner case. (a) The estimated marginal posterior PDF of the width parameter of the GK (solid line) versus its prior PDF (dotted line). The mode of the posterior PDF is at around 1.61. (b) The local fdr with the GK(1.61) (with the standard normal as the density of NLF under null hypothesis); the horizontal dotted line represents the threshold of the fdr (0.05); the vertical dotted line shows the resulted cutoff value for NLF (3.83). (c) The estimated marginal posterior PDF of the width parameter of the GK (solid line) versus its prior PDF (dotted line). The mode of the posterior PDF is at around 0.81. (d) The local fdr with the GK(0.81) (with standard normal as the density of NLF under null hypothesis); the horizontal dotted line represents the threshold of the fdr (0.05); the vertical dotted line shows the resulted cutoff value for NLF (3.68).</p>
                  </text>
                  <graphic file="1471-2105-8-67-2"/>
               </fig>
               <fig id="F3">
                  <title>
                     <p>Figure 3</p>
                  </title>
                  <caption>
                     <p>The results from applying the KIGP to one of the training sets of the linear case in the simulated example 1, where (a) and (b) are for the simulation with an LK; (c) and (d) are for the one with an GK; (e) and (f) for the one with an PK</p>
                  </caption>
                  <text>
                     <p>The results from applying the KIGP to one of the training sets of the linear case in the simulated example 1, where (a) and (b) are for the simulation with an LK; (c) and (d) are for the one with an GK; (e) and (f) for the one with an PK. (a) The NLF plot of each gene for the simulation with an LK; with the cutoff value for NLF (dotted line), two genes were found significant (the circles mark the preset significant genes). (b) The contours of the posterior predictive probability of the class "1" for the simulation with an LK, where X-axis is for the value of the gene 23 and Y-axis represents the value of the gene 57; the numbers associated with contours are the probabilities; the asterisks denote the training samples from the class "1"; the circles demonstrate the training samples from the class "-1"; the dotted line shows the Bayesian classifier. For this set of training samples, the testing MR is 0.022 (the Bayesian bound for MR is 0.013). (c) Same as (a) except it is for the simulation with an GK. (d) Same as (b) except it is for the simulation with an GK. The testing MR is 0.028. (e) Same as (a) except it is for the simulation with an PK. (f) Same as (b) except it is for the simulation with an PK. The testing MR is 0.017.</p>
                  </text>
                  <graphic file="1471-2105-8-67-3"/>
               </fig>
               <p>In the majority of the simulations, the KIGPs found the two preset significant genes in this linear case. They all performed very close to the Bayesian bound when the two preset genes were perfectly found. Since the KIGP with the PK(1) gave the best average testing MR, we should use it for any further analysis.</p>
            </sec>
            <sec>
               <st>
                  <p>(b) Case with a Non-linear Bayesian Classifier</p>
               </st>
               <p>In this non-linear case, the two preset significant genes were generated from a mixture Gaussian distribution with equal probability on <it>N</it>(<b>1</b><sub>2</sub>, <b>I</b><sub>2 </sub>*0.16) and <it>N</it>(-<b>1</b><sub>2</sub>, <b>I</b><sub>2 </sub>*0.16) for the class "1" and from an independent normal distribution <it>N</it>(0,0.16) for the class "-1". <b>1</b><sub>2 </sub>and <b>I</b><sub>2 </sub>denote the one-vector and the identity matrix respectively (defined in (7) of the "Methods" section). For those insignificant genes, each of them was independently drawn from the standard normal distribution <it>N</it>(0,1). The probabilities for the two classes were equal. The Bayesian classifier given the two significant genes looks like two parallel lines (Fig. <figr fid="F4">4</figr>) and the Bayesian bound for the MR is 0.055. We applied both the linear probit regression method proposed by Lee et al. <abbrgrp><abbr bid="B10">10</abbr></abbrgrp> and an KIGP with an LK (such as in Fig. <figr fid="F4">4a</figr>) to the 10 training sets. Unsurprisingly, both of them failed badly in terms of finding the correct significant genes and making optimal class predictions for this non-linear case.</p>
               <fig id="F4">
                  <title>
                     <p>Figure 4</p>
                  </title>
                  <caption>
                     <p>The results from applying the KIGP to one of the training sets for the non-linear case in the simulated example 1, where (a) and (b) are for the simulation with an LK; (c) and (d) are for the simulation with an GK; (e) and (f) for the simulation with an PK</p>
                  </caption>
                  <text>
                     <p>The results from applying the KIGP to one of the training sets for the non-linear case in the simulated example 1, where (a) and (b) are for the simulation with an LK; (c) and (d) are for the simulation with an GK; (e) and (f) for the simulation with an PK. All the legends are same as those in Fig. 3. (a) The NLF plot of each gene for the simulation with an LK; with the cutoff value for NLF (dotted line), none of the true preset significant genes was found (2 false negatives). Three false positive genes were misclassified as significant. (b) The contours of the posterior predictive probability of the class "1" for the simulation with an LK (given the two true preset significant genes). For this set of training samples, the testing MR is 0.5 (the Bayesian bound is 0.055). (c) Same as (a) except it is for the simulation with an GK. (d) Same as (b) except it is for the simulation with an GK. The testing MR is 0.063. (e) Same as (a) except it is for the simulation with an PK. (f) Same as (b) except it is for the simulation with an PK. The testing MR is 0.060.</p>
                  </text>
                  <graphic file="1471-2105-8-67-4"/>
               </fig>
               <p>For the 10 simulated training sets, when an PK was the kernel type for the KIGP method, the algorithm chose the PK(1) for 5 sets and the PK(2) for the other 5 sets after the "kernel parameter fitting phase". Only in 2 of the 5 sets, the KIGP with the PK(2) perfectly found the two prescribed genes as the only significant genes. The average testing MR for these 10 sets was horrendous. However, for those two sets correctly found the two preset significant genes, the testing MRs were both fairly close to the Bayesian bound.</p>
               <p>The results of the simulations with an GK were much better. For all of the 10 sets, the KIGP successfully found the 2 preset significant genes (i.e. with no "false negative"). The KIGP also resulted with one "false positive" for 2 sets as well. The mean and the standard deviation of the fitted width of an GK for these 10 sets were 0.71 and 0.08 respectively. In the "prediction phase", the average testing MR was 0.065 for the 8 sets correctly found the 2 preset significant genes. It was very close to the Bayesian bound (i.e. 0.055). The average testing MR was 0.171 for the 2 sets with one "false positive". The average testing MR for all 10 sets was 0.086.</p>
               <p>As an illustration, we depict the results from applying the KIGP to one of the training sets, in which both an PK and an GK worked well. The procedure and all settings of the simulations and the legends of the figures were same as described in the linear case. We first applied an KIGP with an GK to the training set. The mode of the posterior PDF of the width parameter was found at around 0.81 after the "kernel parameter fitting phase" (Fig. <figr fid="F2">2c</figr>). With the GK(0.81), the cutoff value of 3.68 for NLF was obtained at the end of the "gene selection phase". Based on the NLF statistic, the two prescribed genes were successfully retrieved (Fig. <figr fid="F4">4c</figr>) and the KIGP performed well with MR = 0.063 (Fig. <figr fid="F4">4d</figr>). It was very close to the Bayesian bound. For the simulation with an PK, the posterior probability masses of the degree parameter were Prob(d = 1) = 0.229 and Prob(d = 2) = 0.771 respectively. The NLF plot for each gene and the relative cutoff line for the NLF are both displayed in Fig. <figr fid="F4">4e</figr>. The two prescribed genes were discovered. The performance of the KIGP with the PK(2) was very well with MR = 0.060 (Fig. <figr fid="F4">4f</figr>). It was very close to the Bayesian bound too.</p>
               <p>We tried to apply the regular SVM with RFE to this example as we did in the linear case, but SVM/RFE failed to work with an LK, nor an GK (with any width), nor an PK. The key problem might be due to the large dimension (i.e. 200) of this example. Comparing the KIGP method to the SVM/RFE in this non-linear case, besides those beneficial properties of the KIGP that we already observed in the linear case, the KIGP method particularly shows its better adaptability for non-linear problems. In summary, owing to the non-linear setting of this case, all linear methods were not applicable. The regular SVM/RFE approach also did not work. On the contrary, in terms of the testing MR measure, the KIGP with an GK provided a performance very close to the Bayesian bound. Comparatively, the KIGP with an PK seems to be less robust and consistent than the KIGP with an GK for a non-linear problem in general.</p>
               <p>As a side note, it's worth pointing out that the posterior PDF of the width parameter seems to disclose some special nature of a dataset for a classification problem when one applies the KIGP with an GK. For instance, we observed that if the underlying Bayesian classifier can be well approximated by a linear function, the mode (peak) of the PDF of the width parameter significantly moves to the right side of the value 1 (Fig. <figr fid="F2">2a</figr>); whereas if the Bayesian classifier is very non-linear, it moves to the left side of the value 1 (Fig. <figr fid="F2">2c</figr>).</p>
            </sec>
            <sec>
               <st>
                  <p>Example 2</p>
               </st>
               <p>We further designed this example to demonstrate the effectiveness of the proposed KIGP method when the number of investigated genes is large, especially for a problem with a very non-linear Bayesian classifier. A total of 1000 genes in a simulated microarray experiment and 10 of them were preset as the significant genes with indices [64,237,243,449,512,573,783,818,890,961]. These 10 significant genes were generated from the mixture Gaussian distribution with equal probability on <it>N</it>(<b>1</b><sub>10</sub>, <b>I</b><sub>10 </sub>*0.1) and <it>N</it>(-<b>1</b><sub>10</sub>, <b>I</b><sub>10 </sub>*0.1) for the class "1" and from the Gaussian distribution <it>N</it>(<b>0</b><sub>10</sub>, <b>I</b><sub>10 </sub>*0.1) for the class "-1", where <b>0</b><sub>10 </sub>denotes a vector with 10 "0" elements. The probabilities for the two classes were equal. The rest of other insignificant genes were independently generated from the standard normal distribution <it>N</it>(0,1). Similar to the first example, the number of training samples is 20, 10 of which were generated from the class "1" and the other 10 samples were generated from the class "-1"; the number of testing samples is 5000; we independently generated 10 sets of training samples from the model and ran the simulation on each of them.</p>
               <p>The procedure for this example is same as in the non-linear case of the first example. The prior probability for <it>&#947;</it><sub><it>j </it></sub>= 1 was set at 0.01. For both the "kernel parameter fitting phase" and the "gene selection phase", we ran 20000 iterations and treated the first 10000 as the burn-in period, and for the "prediction phase", we ran 5000 iterations and treated the first 1000 as the burn-in period.</p>
               <p>For the 10 simulated training sets, when an PK was the kernel type for the KIGP method, the algorithm chose the PK(2) in 7 out of 10 sets. Only in 2 of these 7 sets with the PK(2), the algorithm found all 10 significant genes. However, for the 10 sets with an GK, the 10 prescribed genes were all found in each of the 10 sets. There was one "false positive" being brought into the significant group in one set. There was almost no error for the testing samples and extremely close to the Bayesian bound.</p>
               <p>In Fig. <figr fid="F5">5</figr>, we show the simulation results from applying the KIGP method to one of the training sets. Fig. <figr fid="F5">5a</figr> and <figr fid="F5">5b</figr> are for the simulation with an PK, whereas Fig. <figr fid="F5">5c</figr> and <figr fid="F5">5d</figr> are for the simulation with an GK. Based on Fig. <figr fid="F5">5a</figr>, the PK(2) was chosen after the "kernel parameter fitting phase". After the "gene selection phase", with the yielded cutoff line for the NLF, the KIGP found all 10 prescribed significant genes and one "false positive" (Fig. <figr fid="F5">5b</figr>). The MR of the testing set was 0.991. In the simulation with an GK, the mode of the posterior PDF for the width was found at around 0.64 (Fig. <figr fid="F5">5c</figr>). With the GK(0.64), after the "gene selection phase", all 10 prescribed genes were correctly found with no "false positive". With the found significant genes, we did not find any testing error in the "prediction phase". Based on the testing MR, we should choose the GK for further analysis. This example not only illustrates the usefulness of the proposed algorithm for problems with very large number of investigated genes, but also reinforces all the arguments we have made for the Bayesian KIGP framework in the last example.</p>
               <fig id="F5">
                  <title>
                     <p>Figure 5</p>
                  </title>
                  <caption>
                     <p>The results from applying the KIGP to one of the training sets of the simulated example 2, where (a) and (b) are for the simulation with an PK; (c) and (d) are for the simulation with an GK</p>
                  </caption>
                  <text>
                     <p>The results from applying the KIGP to one of the training sets of the simulated example 2, where (a) and (b) are for the simulation with an PK; (c) and (d) are for the simulation with an GK. (a) The estimated marginal posterior PMF of the degree parameter d. (b) The NLF plot of each gene for the simulation with the PK(2); the dots mark the prescribed significant genes. For this training set, all 10 preset significant genes and 1 false positive gene were found. (c) The estimated marginal posterior PDF of the width parameter r (solid line) versus its prior PDF (dotted line). The mode of the posterior PDF is at around 0.64. (d) The NLF plot for each gene for the simulation with the GK(0.64). The legends are same as those in (b). For this training set, all 10 preset significant genes were found with no false positive result.</p>
                  </text>
                  <graphic file="1471-2105-8-67-5"/>
               </fig>
            </sec>
         </sec>
         <sec>
            <st>
               <p>Real data studies</p>
            </st>
            <p>Following the similar procedure executed in the simulated studies, the KIGP was applied to four published microarray gene expression datasets. A brief summary of these datasets is provided in Table <tblr tid="T1">1</tblr> and the experimental details are extracted and described below.</p>
            <tbl id="T1">
               <title>
                  <p>Table 1</p>
               </title>
               <caption>
                  <p>Summary of the real dataset studied in this paper</p>
               </caption>
               <tblbdy cols="6">
                  <r>
                     <c ca="left">
                        <p>
                           <b>Dataset</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>Publication</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>p</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>n</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>M</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>Response</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="6">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Leukemia</p>
                     </c>
                     <c ca="left">
                        <p>Golub et al. (1999) [1]</p>
                     </c>
                     <c ca="center">
                        <p>7129</p>
                     </c>
                     <c ca="center">
                        <p>38</p>
                     </c>
                     <c ca="center">
                        <p>34</p>
                     </c>
                     <c ca="left">
                        <p>ALL/AML</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>SRBCT</p>
                     </c>
                     <c ca="left">
                        <p>Khan et al. (2001) [27]</p>
                     </c>
                     <c ca="center">
                        <p>2308</p>
                     </c>
                     <c ca="center">
                        <p>35</p>
                     </c>
                     <c ca="center">
                        <p>12</p>
                     </c>
                     <c ca="left">
                        <p>EWS/NB</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Breast Cancer</p>
                     </c>
                     <c ca="left">
                        <p>Hedenfalk et al. (2001) [28]</p>
                     </c>
                     <c ca="center">
                        <p>3226</p>
                     </c>
                     <c ca="center">
                        <p>22</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>BRCA1/BRCA2 or sporadic</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Colon</p>
                     </c>
                     <c ca="left">
                        <p>Alon et al. (1999) [30]</p>
                     </c>
                     <c ca="center">
                        <p>2000</p>
                     </c>
                     <c ca="center">
                        <p>62</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>Tumor/Normal tissue</p>
                     </c>
                  </r>
               </tblbdy>
            </tbl>
            <sec>
               <st>
                  <p>Acute leukemia data</p>
               </st>
               <p>The leukemia dataset was originally published by Golub et al. <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>, in which the bone marrow or peripheral blood samples were taken from 72 patients with either acute myeloid leukemia (AML) or acute lymphoblastic leukemia (ALL). The data was divided into two independent sets: a training set and a testing set. The training set consists of 38 samples, of which 27 are ALL and 11 are AML. The testing set consists of 34 samples, of which 20 are ALL and 14 are AML. This dataset contains expression levels for 7129 human genes produced by Affymetrix high-density oligonucleotide micorarrays. The scores in the dataset represent the intensity of gene expression after being rescaled. By using a weighted voting scheme, Golub et al. made predictions for all the 34 testing samples and 5 of them were reported being misclassified.</p>
               <p>The KIGP with an GK, an PK, and an LK was applied to the training dataset (including all investigated genes) respectively. The prior parameter <it>&#960;</it><sub><it>j </it></sub>for all <it>j </it>was uniformly set at 0.001. In both the "kernel parameter fitting phase" and the "gene selection phase", we ran 30000 iterations and treated the first 15000 iterations as the burn-in period; and in the "prediction phase", we ran 5000 iterations and treated the first 1000 iterations as the burn-in period.</p>
               <p>For the simulation with an PK, the resulted posterior probability masses of the degree parameter d are Prob(d = 1) = 0.985 and Prob(d = 2) = 0.015. With the PK(1), 20 genes were identified as "significant" at 0.05 significance level (Table <tblr tid="T3">3</tblr>). Using the PK(1) and the found significant genes, we made predictions for the 34 testing samples. We then ran a leave-one-out cross-validation (LOOCV) for the 38 training samples. This "loose" LOOCV procedure was however only involved in the "prediction phase". Since the fitted kernel parameter and the significant genes chosen from the first two phases had already contained the most information of the whole training dataset, it was not a proper validation measure for kernel type competition. More properly, we further did a rigorous 3-fold cross-validation (3-fold CV) that included all 3 phases of the proposed algorithm (the details are described in the "Kernel type competition" subsection). This whole procedure was then repeated for the simulation with an GK and with an LK respectively. All the results are summarized in Table <tblr tid="T2">2</tblr>.</p>
               <tbl id="T2">
                  <title>
                     <p>Table 2</p>
                  </title>
                  <caption>
                     <p>Summary of the results from applying the proposed KIGP to the leukemia dataset.</p>
                  </caption>
                  <tblbdy cols="10">
                     <r>
                        <c ca="center">
                           <p>
                              <b>Performance Measure</b>
                           </p>
                        </c>
                        <c cspan="3" ca="center">
                           <p>
                              <b>Test</b>
                           </p>
                        </c>
                        <c cspan="3" ca="center">
                           <p>
                              <b>CV (3-fold)</b>
                           </p>
                        </c>
                        <c cspan="3" ca="center">
                           <p>
                              <b>LOOCV (fixed genes)</b>
                           </p>
                        </c>
                     </r>
                     <r>
                        <c>
                           <p/>
                        </c>
                        <c cspan="9">
                           <hr/>
                        </c>
                     </r>
                     <r>
                        <c>
                           <p/>
                        </c>
                        <c ca="center">
                           <p>
                              <b>PK</b>
                           </p>
                        </c>
                        <c ca="center">
                           <p>
                              <b>GK</b>
                           </p>
                        </c>
                        <c ca="center">
                           <p>
                              <b>LK</b>
                           </p>
                        </c>
                        <c ca="center">
                           <p>
                              <b>PK</b>
                           </p>
                        </c>
                        <c ca="center">
                           <p>
                              <b>GK</b>
                           </p>
                        </c>
                        <c ca="center">
                           <p>
                              <b>LK</b>
                           </p>
                        </c>
                        <c ca="center">
                           <p>
                              <b>PK</b>
                           </p>
                        </c>
                        <c ca="center">
                           <p>
                              <b>GK</b>
                           </p>
                        </c>
                        <c ca="center">
                           <p>
                              <b>LK</b>
                           </p>
                        </c>
                     </r>
                     <r>
                        <c cspan="10">
                           <hr/>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>
                              <b>ERR #</b>
                           </p>
                        </c>
                        <c ca="center">
                           <p>2/34</p>
                        </c>
                        <c ca="center">
                           <p>1/34</p>
                        </c>
                        <c ca="center">
                           <p>1/34</p>
                        </c>
                        <c ca="center">
                           <p>2/38</p>
                        </c>
                        <c ca="center">
                           <p>1/38</p>
                        </c>
                        <c ca="center">
                           <p>1/38</p>
                        </c>
                        <c ca="center">
                           <p>0/38</p>
                        </c>
                        <c ca="center">
                           <p>0/38</p>
                        </c>
                        <c ca="center">
                           <p>0/38</p>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>
                              <b>APP</b>
                           </p>
                        </c>
                        <c ca="center">
                           <p>0.858</p>
                        </c>
                        <c ca="center">
                           <p>0.835</p>
                        </c>
                        <c ca="center">
                           <p>0.923</p>
                        </c>
                        <c ca="center">
                           <p>0.844</p>
                        </c>
                        <c ca="center">
                           <p>0.819</p>
                        </c>
                        <c ca="center">
                           <p>0.875</p>
                        </c>
                        <c ca="center">
                           <p>0.995</p>
                        </c>
                        <c ca="center">
                           <p>0.928</p>
                        </c>
                        <c ca="center">
                           <p>0.993</p>
                        </c>
                     </r>
                  </tblbdy>
                  <tblfn>
                     <p>The columns labeled by "Test" are for the independent tests.</p>
                     <p>The columns labeled by "CV (3-fold)" are for the rigorous 3-fold CVs (each CV involves all three phases of an KIGP).</p>
                     <p>The columns labeled by "LOOCV (fixed genes)" are for the loose LOOCVs (each CV only involves the "prediction phase" of an KIGP).</p>
                  </tblfn>
               </tbl>
               <tbl id="T3">
                  <title>
                     <p>Table 3</p>
                  </title>
                  <caption>
                     <p>Summary of the genes found by applying the KIGP with the LK to the leukemia dataset</p>
                  </caption>
                  <tblbdy cols="4">
                     <r>
                        <c ca="center">
                           <p>
                              <b>Index</b>
                           </p>
                        </c>
                        <c ca="center">
                           <p>
                              <b>NLF</b>
                           </p>
                        </c>
                        <c ca="center">
                           <p>
                              <b>Accession #</b>
                           </p>
                        </c>
                        <c ca="left">
                           <p>
                              <b>Gene Description</b>
                           </p>
                        </c>
                     </r>
                     <r>
                        <c cspan="4">
                           <hr/>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>4847</p>
                        </c>
                        <c ca="center">
                           <p>11.47</p>
                        </c>
                        <c ca="center">
                           <p>X95735</p>
                        </c>
                        <c ca="left">
                           <p>Zyxin</p>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>3320</p>
                        </c>
                        <c ca="center">
                           <p>10.36</p>
                        </c>
                        <c ca="center">
                           <p>U50136</p>
                        </c>
                        <c ca="left">
                           <p>Leukotriene C4 synthase (LTC4S) gene</p>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>2020</p>
                        </c>
                        <c ca="center">
                           <p>9.79</p>
                        </c>
                        <c ca="center">
                           <p>M55150</p>
                        </c>
                        <c ca="left">
                           <p>FAH Fumarylacetoacetate</p>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>5039</p>
                        </c>
                        <c ca="center">
                           <p>9.63</p>
                        </c>
                        <c ca="center">
                           <p>Y12670</p>
                        </c>
                        <c ca="left">
                           <p>LEPR Leptin receptor</p>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>1834</p>
                        </c>
                        <c ca="center">
                           <p>9.22</p>
                        </c>
                        <c ca="center">
                           <p>M23197</p>
                        </c>
                        <c ca="left">
                           <p>CD33 CD33 antigen (differentiation antigen)</p>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>4499*</p>
                        </c>
                        <c ca="center">
                           <p>6.79</p>
                        </c>
                        <c ca="center">
                           <p>X70297</p>
                        </c>
                        <c ca="left">
                           <p>CHRNA7 Cholinergic receptor, nicotinic, alpha polypeptide 7</p>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>1745</p>
                        </c>
                        <c ca="center">
                           <p>6.46</p>
                        </c>
                        <c ca="center">
                           <p>M16038</p>
                        </c>
                        <c ca="left">
                           <p>LYN V-yes-1 Yamaguchi sarcoma viral related oncogene homolog</p>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>3847</p>
                        </c>
                        <c ca="center">
                           <p>5.32</p>
                        </c>
                        <c ca="center">
                           <p>U82759</p>
                        </c>
                        <c ca="left">
                           <p>GB DEF = Homeodomain protein HoxA9 mRNA</p>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>4196</p>
                        </c>
                        <c ca="center">
                           <p>5.21</p>
                        </c>
                        <c ca="center">
                           <p>X17042</p>
                        </c>
                        <c ca="left">
                           <p>PRG1 Proteoglycan 1, secretory granule</p>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>1779*</p>
                        </c>
                        <c ca="center">
                           <p>5.08</p>
                        </c>
                        <c ca="center">
                           <p>M19507</p>
                        </c>
                        <c ca="left">
                           <p>MPO Myeloperoxidase</p>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>6539</p>
                        </c>
                        <c ca="center">
                           <p>4.98</p>
                        </c>
                        <c ca="center">
                           <p>X85116</p>
                        </c>
                        <c ca="left">
                           <p>Epb72 gene exon 1</p>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>6376</p>
                        </c>
                        <c ca="center">
                           <p>4.80</p>
                        </c>
                        <c ca="center">
                           <p>M83652</p>
                        </c>
                        <c ca="left">
                           <p>PFC Properdin P factor, complement</p>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>3258</p>
                        </c>
                        <c ca="center">
                           <p>4.73</p>
                        </c>
                        <c ca="center">
                           <p>U46751</p>
                        </c>
                        <c ca="left">
                           <p>Phosphotyrosine independent ligand p62 for the Lck SH2 domain mRNA</p>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>2111</p>
                        </c>
                        <c ca="center">
                           <p>4.64</p>
                        </c>
                        <c ca="center">
                           <p>M62762</p>
                        </c>
                        <c ca="left">
                           <p>ATP6C Vacuolar H+ ATPase proton channel subunit</p>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>1882</p>
                        </c>
                        <c ca="center">
                           <p>4.64</p>
                        </c>
                        <c ca="center">
                           <p>M27891</p>
                        </c>
                        <c ca="left">
                           <p>CST3 Cystatin C (amyloid angiopathy and cerebral hemorrhage)</p>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>1829*</p>
                        </c>
                        <c ca="center">
                           <p>4.59</p>
                        </c>
                        <c ca="center">
                           <p>M22960</p>
                        </c>
                        <c ca="left">
                           <p>PPGB Protective protein for beta-galactosidase (galactosialidosis)</p>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>1249</p>
                        </c>
                        <c ca="center">
                           <p>4.49</p>
                        </c>
                        <c ca="center">
                           <p>L08246</p>
                        </c>
                        <c ca="left">
                           <p>INDUCED MYELOID LEUKEMIA CELL DIFFERENTIATION PROTEIN MCL1</p>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>2121</p>
                        </c>
                        <c ca="center">
                           <p>4.41</p>
                        </c>
                        <c ca="center">
                           <p>M63138</p>
                        </c>
                        <c ca="left">
                           <p>CTSD Cathepsin D (lysosomal aspartyl protease)</p>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>2288</p>
                        </c>
                        <c ca="center">
                           <p>4.28</p>
                        </c>
                        <c ca="center">
                           <p>M84526</p>
                        </c>
                        <c ca="left">
                           <p>DF D component of complement (adipsin)</p>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>1924*</p>
                        </c>
                        <c ca="center">
                           <p>4.28</p>
                        </c>
                        <c ca="center">
                           <p>M31158</p>
                        </c>
                        <c ca="left">
                           <p>PRKAR2B Protein kinase, cAMP-dependent, regulatory, type II, beta</p>
                        </c>
                     </r>
                  </tblbdy>
                  <tblfn>
                     <p>*: Index of the Genes not reported in [1].</p>
                  </tblfn>
               </tbl>
               <p>In Table <tblr tid="T2">2</tblr>, the KIGP with an LK gave the best testing performance: only 1 error was found. We found that many publications (e.g. <abbrgrp><abbr bid="B10">10</abbr><abbr bid="B12">12</abbr></abbrgrp> and <abbrgrp><abbr bid="B21">21</abbr></abbrgrp>) reported the same testing error for this dataset as well. Only Zhou et al. <abbrgrp><abbr bid="B14">14</abbr></abbrgrp> reported 0 testing error. However, based on the results of <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>, the testing APP was only 0.83, which is much worse than that of the KIGP with an LK (i.e. the testing APP = 0.923). We suspect that this misclassified testing sample by KIGP/LK may be phenotyped incorrectly.</p>
               <p>The significant genes found by the KIGP with an LK are reported in Table <tblr tid="T3">3</tblr> and the NLF plot is plotted in Fig. <figr fid="F8">8a</figr>. In Table <tblr tid="T3">3</tblr>, the genes with asterisks (gene indices 4499, 1799, 1829 and 1924) are those not reported by the original paper <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>. The heat map of the found significant genes for all the samples (Fig. <figr fid="F6">6</figr>) exhibits a very good consistency between the training set and the testing set (including the genes with asterisks). We realize that the posterior PDF of the width parameter of an GK can disclose some special nature of the feature space for a given dataset and problem. Fig. <figr fid="F9">9a</figr> illustrates the dominant linearity of this case. Another issue that needs to be addressed is that, if the number of the available samples is small (often true for a typical microarray application), the measure of "the number of testing errors" may have noticeable bias. Instead using "the number of testing errors", the measure of APP is more reliable under this scenario. In this case, it's easy to see in Table <tblr tid="T2">2</tblr> that, the APP of the rigorous 3-fold CV is very consistent to that of the independent testing, whereas the "loose" LOOCV is not. This gives a good example on how a "loose" LOOCV brings in the so-called "gene-selection bias".</p>
               <fig id="F6">
                  <title>
                     <p>Figure 6</p>
                  </title>
                  <caption>
                     <p>The heat map of the gene expression levels of the 20 found significant genes for the acute leukemia dataset</p>
                  </caption>
                  <text>
                     <p>The heat map of the gene expression levels of the 20 found significant genes for the acute leukemia dataset. The panel on the left (to the solid line) represents the training samples and that on the right shows the testing samples. The two dotted lines are used to separate the two classes (ALL and AML).</p>
                  </text>
                  <graphic file="1471-2105-8-67-6"/>
               </fig>
               <fig id="F8">
                  <title>
                     <p>Figure 8</p>
                  </title>
                  <caption>
                     <p>The NLF plots for all 4 real data studies with found kernels</p>
                  </caption>
                  <text>
                     <p>The NLF plots for all 4 real data studies with found kernels. The legends are same for all four plots. (a) The NLF plot of each gene with the LK for the leukemia dataset and the dots mark the 20 found significant genes, the details of which are listed in Table 3. (b) The NLF plot of each gene with the PK(1) for the SRBCT dataset and the details of the 15 found significant genes are listed in Table 5. (c) The NLF plot of each gene with the GK(3.19) for the breast cancer dataset and the details of the 9 found significant genes are listed in Table 7. (d) The NLF plot of each gene with the GK(2.38) for the colon dataset and the details of the 8 found significant genes are listed in Table 9.</p>
                  </text>
                  <graphic file="1471-2105-8-67-8"/>
               </fig>
               <fig id="F9">
                  <title>
                     <p>Figure 9</p>
                  </title>
                  <caption>
                     <p>The estimated marginal posterior PDF of the width parameter of an GK for each real data study case (dotted lines present the prior PDF)</p>
                  </caption>
                  <text>
                     <p>The estimated marginal posterior PDF of the width parameter of an GK for each real data study case (dotted lines present the prior PDF). (a) For the leukemia dataset, the mode of the posterior PDF is at around 2.79. (b) For the SRBCT dataset, the mode of the posterior PDF is at around 2.36. (c) For the breast cancer dataset, the mode of the posterior PDF is at around 3.19. (We also noticed that there was a local peak on the left, which was at around 0.56). (d) For the colon dataset, the mode of the posterior PDF is at around 2.38.</p>
                  </text>
                  <graphic file="1471-2105-8-67-9"/>
               </fig>
            </sec>
            <sec>
               <st>
                  <p>Small round blue-cell tumor (SRBCT) data</p>
               </st>
               <p>The SRBCT data was originally published by Khan et al. <abbrgrp><abbr bid="B27">27</abbr></abbrgrp>. The tumor types include Ewing family of tumors (EWS), rhabdomyosarcoma (RMS), neuroblastoma (NB) and non-Hodgkin lymphoma (NHL). The dataset of the four tumor types is composed of 2308 genes and 63 samples, while 25 blinded testing samples are available. In this study, we only focused on two classes, EWS and NB. Thus, there are only 35 training sample (23 EWS and 12 NB) and 12 testing samples (6 EWS and 6 NB).</p>
               <p>We applied the same procedure as we did in the leukemia data case to this dataset. The computational settings were also almost the same except that <it>&#960;</it><sub><it>j </it></sub>for all <it>j </it>was set at 0.003. The overall performance report is given in Table <tblr tid="T4">4</tblr>. The KIGP with the PK(1) performed best with respect to both the independent testing and the rigorous 3-fold CV. The 15 significant genes found by the KIGP with the PK(1) are listed in Table <tblr tid="T5">5</tblr>. The NLF plot is shown in Fig. <figr fid="F8">8b</figr>. The heat map of the significant genes for all samples is drawn in Fig. <figr fid="F7">7</figr>. The posterior PDF of the width parameter of the GK is depicted in Fig. <figr fid="F9">9b</figr>. In Table <tblr tid="T5">5</tblr>, the genes with asterisks (gene indices 976, 823, 842, 437 and 1700) are those not reported by the original paper <abbrgrp><abbr bid="B27">27</abbr></abbrgrp>. Based on the heat map plot (Fig. <figr fid="F7">7</figr>), except the gene 823, the other 4 genes (gene indices 976, 842, 437, and 1700) are consistent through the training samples to the testing samples.</p>
               <tbl id="T4">
                  <title>
                     <p>Table 4</p>
                  </title>
                  <caption>
                     <p>Summary of the results from applying the proposed KIGP to the SRBCT dataset</p>
                  </caption>
                  <tblbdy cols="11">
                     <r>
                        <c ca="center">
                           <p>
                              <b>Performance Measure</b>
                           </p>
                        </c>
                        <c cspan="4" ca="center">
                           <p>
                              <b>Test</b>
                           </p>
                        </c>
                        <c cspan="3" ca="center">
                           <p>
                              <b>CV (3-fold)</b>
                           </p>
                        </c>
                        <c cspan="3" ca="center">
                           <p>
                              <b>LOOCV (fixed genes)</b>
                           </p>
                        </c>
                     </r>
                     <r>
                        <c>
                           <p/>
                        </c>
                        <c cspan="10">
                           <hr/>
                        </c>
                     </r>
                     <r>
                        <c>
                           <p/>
                        </c>
                        <c ca="center">
                           <p>
                              <b>ANN</b>
                           </p>
                        </c>
                        <c ca="center">
                           <p>
                              <b>PK</b>
                           </p>
                        </c>
                        <c ca="center">
                           <p>
                              <b>GK</b>
                           </p>
                        </c>
                        <c ca="center">
                           <p>
                              <b>LK</b>
                           </p>
                        </c>
                        <c ca="center">
                           <p>
                              <b>PK</b>
                           </p>
                        </c>
                        <c ca="center">
                           <p>
                              <b>GK</b>
                           </p>
                        </c>
                        <c ca="center">
                           <p>
                              <b>LK</b>
                           </p>
                        </c>
                        <c ca="center">
                           <p>
                              <b>PK</b>
                           </p>
                        </c>
                        <c ca="center">
                           <p>
                              <b>GK</b>
                           </p>
                        </c>
                        <c ca="center">
                           <p>
                              <b>LK</b>
                           </p>
                        </c>
                     </r>
                     <r>
                        <c cspan="11">
                           <hr/>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>
                              <b>ERR #</b>
                           </p>
                        </c>
                        <c ca="center">
                           <p>0/12</p>
                        </c>
                        <c ca="center">
                           <p>0/12</p>
                        </c>
                        <c ca="center">
                           <p>0/12</p>
                        </c>
                        <c ca="center">
                           <p>0/12</p>
                        </c>
                        <c ca="center">
                           <p>0/35</p>
                        </c>
                        <c ca="center">
                           <p>2/35</p>
                        </c>
                        <c ca="center">
                           <p>0/35</p>
                        </c>
                        <c ca="center">
                           <p>0/35</p>
                        </c>
                        <c ca="center">
                           <p>0/35</p>
                        </c>
                        <c ca="center">
                           <p>0/35</p>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>
                              <b>APP</b>
                           </p>
                        </c>
                        <c ca="center">
                           <p>0.923</p>
                        </c>
                        <c ca="center">
                           <p>0.945</p>
                        </c>
                        <c ca="center">
                           <p>0.781</p>
                        </c>
                        <c ca="center">
                           <p>0.865</p>
                        </c>
                        <c ca="center">
                           <p>0.875</p>
                        </c>
                        <c ca="center">
                           <p>0.794</p>
                        </c>
                        <c ca="center">
                           <p>0.823</p>
                        </c>
                        <c ca="center">
                           <p>0.998</p>
                        </c>
                        <c ca="center">
                           <p>0.909</p>
                        </c>
                        <c ca="center">
                           <p>0.997</p>
                        </c>
                     </r>
                  </tblbdy>
                  <tblfn>
                     <p>All the captions are same as in Table 2.</p>
                     <p>"ANN" stands for the "artificial neural network" method used by the paper [27]</p>
                  </tblfn>
               </tbl>
               <tbl id="T5">
                  <title>
                     <p>Table 5</p>
                  </title>
                  <caption>
                     <p>Summary of the genes found by applying the KIGP with PK(1) to the SRBCT dataset</p>
                  </caption>
                  <tblbdy cols="4">
                     <r>
                        <c ca="center">
                           <p>
                              <b>Index</b>
                           </p>
                        </c>
                        <c ca="center">
                           <p>
                              <b>NLF</b>
                           </p>
                        </c>
                        <c ca="center">
                           <p>
                              <b>Image ID</b>
                           </p>
                        </c>
                        <c ca="left">
                           <p>
                              <b>Gene Description</b>
                           </p>
                        </c>
                     </r>
                     <r>
                        <c cspan="4">
                           <hr/>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>255</p>
                        </c>
                        <c ca="center">
                           <p>11.36</p>
                        </c>
                        <c ca="center">
                           <p>325182</p>
                        </c>
                        <c ca="left">
                           <p>cadherin 2, N-cadherin (neuronal)</p>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>976*</p>
                        </c>
                        <c ca="center">
                           <p>10.88</p>
                        </c>
                        <c ca="center">
                           <p>786084</p>
                        </c>
                        <c ca="left">
                           <p>chromobox homolog 1 (Drosophila HP1 beta)</p>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>1389</p>
                        </c>
                        <c ca="center">
                           <p>10.19</p>
                        </c>
                        <c ca="center">
                           <p>770394</p>
                        </c>
                        <c ca="left">
                           <p>Fc fragment of IgG, receptor, transporter, alpha</p>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>742</p>
                        </c>
                        <c ca="center">
                           <p>9.28</p>
                        </c>
                        <c ca="center">
                           <p>812105</p>
                        </c>
                        <c ca="left">
                           <p>transmembrane protein</p>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>2144</p>
                        </c>
                        <c ca="center">
                           <p>8.12</p>
                        </c>
                        <c ca="center">
                           <p>308231</p>
                        </c>
                        <c ca="left">
                           <p>Homo sapiens incomplete cDNA for a mutated allele of a myosin class I, myh-1c</p>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>823*</p>
                        </c>
                        <c ca="center">
                           <p>7.53</p>
                        </c>
                        <c ca="center">
                           <p>134748</p>
                        </c>
                        <c ca="left">
                           <p>glycine cleavage system protein H (aminomethyl carrier)</p>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>2050</p>
                        </c>
                        <c ca="center">
                           <p>6.61</p>
                        </c>
                        <c ca="center">
                           <p>295985</p>
                        </c>
                        <c ca="left">
                           <p>ESTs</p>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>842*</p>
                        </c>
                        <c ca="center">
                           <p>6.08</p>
                        </c>
                        <c ca="center">
                           <p>810057</p>
                        </c>
                        <c ca="left">
                           <p>cold shock domain protein A</p>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>545</p>
                        </c>
                        <c ca="center">
                           <p>5.27</p>
                        </c>
                        <c ca="center">
                           <p>1435862</p>
                        </c>
                        <c ca="left">
                           <p>antigen identified by monoclonal antibodies 12E7, F21 and O13</p>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>867</p>
                        </c>
                        <c ca="center">
                           <p>5.22</p>
                        </c>
                        <c ca="center">
                           <p>784593</p>
                        </c>
                        <c ca="left">
                           <p>ESTs</p>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>481</p>
                        </c>
                        <c ca="center">
                           <p>5.15</p>
                        </c>
                        <c ca="center">
                           <p>825411</p>
                        </c>
                        <c ca="left">
                           <p>N-acetylglucosamine receptor 1 (thyroid)</p>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>1662</p>
                        </c>
                        <c ca="center">
                           <p>4.82</p>
                        </c>
                        <c ca="center">
                           <p>377048</p>
                        </c>
                        <c ca="left">
                           <p>Homo sapiens incomplete cDNA for a mutated allele of a myosin class I, myh-1c</p>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>1601</p>
                        </c>
                        <c ca="center">
                           <p>4.81</p>
                        </c>
                        <c ca="center">
                           <p>629896</p>
                        </c>
                        <c ca="left">
                           <p>microtubule-associated protein 1B</p>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>437*</p>
                        </c>
                        <c ca="center">
                           <p>4.42</p>
                        </c>
                        <c ca="center">
                           <p>448386</p>
                        </c>
                        <c>
                           <p/>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>1700*</p>
                        </c>
                        <c ca="center">
                           <p>4.20</p>
                        </c>
                        <c ca="center">
                           <p>796475</p>
                        </c>
                        <c ca="left">
                           <p>ESTs, Moderately similar to skeletal muscle LIM-protein FHL3 [H. sapiens]</p>
                        </c>
                     </r>
                  </tblbdy>
                  <tblfn>
                     <p>*: Index of the Genes not reported in[27].</p>
                  </tblfn>
               </tbl>
               <fig id="F7">
                  <title>
                     <p>Figure 7</p>
                  </title>
                  <caption>
                     <p>The heat map of the gene expression levels of the 15 found significant genes for the SRBCT dataset</p>
                  </caption>
                  <text>
                     <p>The heat map of the gene expression levels of the 15 found significant genes for the SRBCT dataset. All the legends are same as those in Fig. 6 except that the two classes are EWS and NB.</p>
                  </text>
                  <graphic file="1471-2105-8-67-7"/>
               </fig>
               <p>Similar to the Leukemia data case, the APP of the rigorous 3-fold CV is very consistent to that of the independent testing while the "loose" LOOCV is rather biased. We also found that the KIGP with the PK(1) outperformed the Artificial Neural Network (ANN, <abbrgrp><abbr bid="B27">27</abbr></abbrgrp>) method in terms of APP (and both methods gave 0 testing errors).</p>
            </sec>
            <sec>
               <st>
                  <p>Breast cancer data</p>
               </st>
               <p>The hereditary breast cancer data used in this example was published by Hedenfalk et al. <abbrgrp><abbr bid="B28">28</abbr></abbrgrp>, in which cDNA microarrays were used in conjunction with classification algorithms to show the feasibility of using the differences in global gene expression profiles to separate BRCA1 and BRCA2/sporadic. 22 breast cancer tumors were examined: 7 with BRCA1, 8 with BRCA2 and 7 considered sporadic. 3226 genes were investigated for each sample. We labeled the samples with BRCA1 as the class "1" and others as the class "-1".</p>
               <p>The computational procedure and settings of this example are same as those in the SRBCT case except that there is no independent testing. In order to highlight the "gene-selection bias" problem, besides running a rigorous 3-fold CV procedure to measure the performance of a kernel type, we further added a "loose" 3-fold CV procedure (like the "loose" LOOCV, the CV was only run in the "prediction phase"). The overall performance report is provided in Table <tblr tid="T6">6</tblr>. Based on the rigorous 3-fold CV, we selected the GK(3.19) as the fitted kernel for this dataset. The posterior PDF of the width parameter is shown in Fig. <figr fid="F9">9c</figr>. We list the 9 significant genes found by the GK(3.19) in Table <tblr tid="T7">7</tblr>. There are two genes (gene 1851 and gene 2893 marked with asterisks in Table <tblr tid="T7">7</tblr>) that were not reported by the original paper <abbrgrp><abbr bid="B28">28</abbr></abbrgrp>. The NLF plot is shown in Fig. <figr fid="F8">8c</figr>.</p>
               <tbl id="T6">
                  <title>
                     <p>Table 6</p>
                  </title>
                  <caption>
                     <p>Summary of the results from applying the KIGP to the breast cancer dataset</p>
                  </caption>
                  <tblbdy cols="10">
                     <r>
                        <c ca="center">
                           <p>
                              <b>Performance Measure</b>
                           </p>
                        </c>
                        <c cspan="3" ca="center">
                           <p>
                              <b>CV (3-fold)</b>
                           </p>
                        </c>
                        <c cspan="3" ca="center">
                           <p>
                              <b>CV (3-fold, fixed genes)</b>
                           </p>
                        </c>
                        <c cspan="3" ca="center">
                           <p>
                              <b>LOOCV (fixed genes)</b>
                           </p>
                        </c>
                     </r>
                     <r>
                        <c>
                           <p/>
                        </c>
                        <c cspan="9">
                           <hr/>
                        </c>
                     </r>
                     <r>
                        <c>
                           <p/>
                        </c>
                        <c ca="center">
                           <p>
                              <b>PK</b>
                           </p>
                        </c>
                        <c ca="center">
                           <p>
                              <b>GK</b>
                           </p>
                        </c>
                        <c ca="center">
                           <p>
                              <b>LK</b>
                           </p>
                        </c>
                        <c ca="center">
                           <p>
                              <b>PK</b>
                           </p>
                        </c>
                        <c ca="center">
                           <p>
                              <b>GK</b>
                           </p>
                        </c>
                        <c ca="center">
                           <p>
                              <b>LK</b>
                           </p>
                        </c>
                        <c ca="center">
                           <p>
                              <b>PK</b>
                           </p>
                        </c>
                        <c ca="center">
                           <p>
                              <b>GK</b>
                           </p>
                        </c>
                        <c ca="center">
                           <p>
                              <b>LK</b>
                           </p>
                        </c>
                     </r>
                     <r>
                        <c cspan="10">
                           <hr/>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>
                              <b>ERR #</b>
                           </p>
                        </c>
                        <c ca="center">
                           <p>4/22</p>
                        </c>
                        <c ca="center">
                           <p>3/22</p>
                        </c>
                        <c ca="center">
                           <p>5/22</p>
                        </c>
                        <c ca="center">
                           <p>0/22</p>
                        </c>
                        <c ca="center">
                           <p>0/22</p>
                        </c>
                        <c ca="center">
                           <p>0/22</p>
                        </c>
                        <c ca="center">
                           <p>0/22</p>
                        </c>
                        <c ca="center">
                           <p>0/22</p>
                        </c>
                        <c ca="center">
                           <p>0/22</p>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>
                              <b>APP</b>
                           </p>
                        </c>
                        <c ca="center">
                           <p>0.685</p>
                        </c>
                        <c ca="center">
                           <p>0.739</p>
                        </c>
                        <c ca="center">
                           <p>0.662</p>
                        </c>
                        <c ca="center">
                           <p>0.855</p>
                        </c>
                        <c ca="center">
                           <p>0.878</p>
                        </c>
                        <c ca="center">
                           <p>0.929</p>
                        </c>
                        <c ca="center">
                           <p>0.903</p>
                        </c>
                        <c ca="center">
                           <p>0.889</p>
                        </c>
                        <c ca="center">
                           <p>0.995</p>
                        </c>
                     </r>
                  </tblbdy>
                  <tblfn>
                     <p>All the captions are same as in Table 2, except that there is no independent testing.</p>
                     <p>The columns labeled by "CV (3-fold, fixed genes)" are for the loose 3-fold CVs (each CV only involves the "prediction phase" of an KIGP).</p>
                  </tblfn>
               </tbl>
               <tbl id="T7">
                  <title>
                     <p>Table 7</p>
                  </title>
                  <caption>
                     <p>Summary of the genes found by applying the KIGP with GK(3.19) to the breast cancer dataset</p>
                  </caption>
                  <tblbdy cols="4">
                     <r>
                        <c ca="center">
                           <p>
                              <b>Index</b>
                           </p>
                        </c>
                        <c ca="center">
                           <p>
                              <b>NLF</b>
                           </p>
                        </c>
                        <c ca="center">
                           <p>
                              <b>Clone ID</b>
                           </p>
                        </c>
                        <c ca="left">
                           <p>
                              <b>Gene Description</b>
                           </p>
                        </c>
                     </r>
                     <r>
                        <c cspan="4">
                           <hr/>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>1999</p>
                        </c>
                        <c ca="center">
                           <p>4.44</p>
                        </c>
                        <c ca="center">
                           <p>247818</p>
                        </c>
                        <c ca="left">
                           <p>ESTs</p>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>2734</p>
                        </c>
                        <c ca="center">
                           <p>4.21</p>
                        </c>
                        <c ca="center">
                           <p>46019</p>
                        </c>
                        <c ca="left">
                           <p>minichromosome maintenance deficient (S. cerevisiae) 7</p>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>1851*</p>
                        </c>
                        <c ca="center">
                           <p>4.06</p>
                        </c>
                        <c ca="center">
                           <p>293977</p>
                        </c>
                        <c ca="left">
                           <p>ESTs</p>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>585</p>
                        </c>
                        <c ca="center">
                           <p>3.89</p>
                        </c>
                        <c ca="center">
                           <p>293104</p>
                        </c>
                        <c ca="left">
                           <p>phytanoyl-CoA hydroxylase (Refsum disease)</p>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>2423</p>
                        </c>
                        <c ca="center">
                           <p>3.85</p>
                        </c>
                        <c ca="center">
                           <p>26082</p>
                        </c>
                        <c ca="left">
                           <p>very low density lipoprotein receptor</p>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>1443</p>
                        </c>
                        <c ca="center">
                           <p>3.85</p>
                        </c>
                        <c ca="center">
                           <p>566887</p>
                        </c>
                        <c ca="left">
                           <p>chromobox homolog 3 (Drosophila HP1 gamma)</p>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>2893*</p>
                        </c>
                        <c ca="center">
                           <p>3.81</p>
                        </c>
                        <c ca="center">
                           <p>32790</p>
                        </c>
                        <c ca="left">
                           <p>mutS (E. coli) homolog 2 (colon cancer, nonpolyposis type 1)</p>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>1068</p>
                        </c>
                        <c ca="center">
                           <p>3.81</p>
                        </c>
                        <c ca="center">
                           <p>840702</p>
                        </c>
                        <c ca="left">
                           <p>SELENOPHOSPHATE SYNTHETASE ; Human selenium donor protein</p>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>1008</p>
                        </c>
                        <c ca="center">
                           <p>3.81</p>
                        </c>
                        <c ca="center">
                           <p>897781</p>
                        </c>
                        <c ca="left">
                           <p>keratin 8</p>
                        </c>
                     </r>
                  </tblbdy>
                  <tblfn>
                     <p>*: Index of the Genes not reported in[28].</p>
                  </tblfn>
               </tbl>
               <p>It's not surprising to find that the general performance of the KIGP with an LK or the PK(1) was not good since we notice that there is an unusual local peak on the left side of the posterior PDF of the width parameter r (Fig. <figr fid="F9">9c</figr>). This local peak usually implies the existence of non-linearity in the data for the given problem. A fairly logical reason for this phenomena can be found in <abbrgrp><abbr bid="B29">29</abbr></abbrgrp>, in which Efron showed that the empirical null of this dataset was significantly different from its theoretical null based on a large-scale simultaneous 2-sample t-test and he argued that this was probably due to the fact that the experimental methodology used in the original paper had induced substantial correlations among the various microarrays.</p>
               <p>This example is also a good case to show the "gene-selection bias" problem. In Table <tblr tid="T6">6</tblr>, with the selected significant genes found by training all the available samples, the performance of the KIGP with an LK from the "loose" 3-fold CV was much better than that of the KIGP with a GK. However, from the results of the rigorous 3-fold CV, the KIGP with an LK gave very poor predictive performance, while the KIGP with an GK still worked reasonably well.</p>
            </sec>
            <sec>
               <st>
                  <p>Colon data</p>
               </st>
               <p>This dataset was originally published by Alon et al. <abbrgrp><abbr bid="B30">30</abbr></abbrgrp> and we noticed that Dettling <abbrgrp><abbr bid="B16">16</abbr></abbrgrp> has reported the performances of many state-of-the-art learning methods that had been applied to this dataset. We applied the KIGP method to this dataset so as to have a more side-by-side performance comparison with other methods. <abbrgrp><abbr bid="B16">16</abbr></abbrgrp> did a pre-filtering of genes based on the Wilcoxon test statistic and only ran all the simulations within a 200-gene pool. However, based on the reported procedure, it should not bring in much gene-selection bias. Therefore, it forms a good dataset for comparing different microarray data analysis methods.</p>
               <p>We applied the proposed KIGP to the whole dataset without any pre-filtering to preclude any possible gene-selection bias. The computational procedure and settings of this example are very similar to those in the SRBCT case except that there is no independent testing. As for the cross validation procedure, we ran 5 independent simulations and reported the average of the results to decrease the possible data split bias. The procedure for the data splitting is described in the "Kernel type competition" subsection. We did this with each kernel type: PK, GK and LK, respectively. The resulted performance and the performances of other methods (reported by <abbrgrp><abbr bid="B16">16</abbr></abbrgrp>) are summarized in Table <tblr tid="T8">8</tblr>. We found that the MR of the KIGP with an GK is very close to that of the best classifier (PAM in this case) shown in the list. It is worth mentioning that PAM's performances were ranked as average to significantly worse than 6 other methods, especially comparing to kernel-induced methods such as the SVM for other published real datasets (such as the leukemia dataset, the prostate dataset and the lymphoma dataset <abbrgrp><abbr bid="B16">16</abbr></abbrgrp>). The KIGP method with an appropriate kernel is at least not worse than the SVM.</p>
               <tbl id="T8">
                  <title>
                     <p>Table 8</p>
                  </title>
                  <caption>
                     <p>Summary of the performance comparison on applying different classifiers to the colon dataset</p>
                  </caption>
                  <tblbdy cols="2">
                     <r>
                        <c ca="left">
                           <p>
                              <b>Classifier</b>
                           </p>
                        </c>
                        <c ca="center">
                           <p>
                              <b>MR</b>
                           </p>
                        </c>
                     </r>
                     <r>
                        <c cspan="2">
                           <hr/>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>KIGP(PK)</p>
                        </c>
                        <c ca="center">
                           <p>0.166</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>KIGP(GK)</p>
                        </c>
                        <c ca="center">
                           <p>0.129</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>KIGP(LK)</p>
                        </c>
                        <c ca="center">
                           <p>0.198</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>BagBoost</p>
                        </c>
                        <c ca="center">
                           <p>0.161</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>Boosting</p>
                        </c>
                        <c ca="center">
                           <p>0.191</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>RF</p>
                        </c>
                        <c ca="center">
                           <p>0.149</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>SVM</p>
                        </c>
                        <c ca="center">
                           <p>0.151</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>PAM</p>
                        </c>
                        <c ca="center">
                           <p>0.119</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>DLDA</p>
                        </c>
                        <c ca="center">
                           <p>0.129</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>kNN</p>
                        </c>
                        <c ca="center">
                           <p>0.164</p>
                        </c>
                     </r>
                  </tblbdy>
                  <tblfn>
                     <p>For the KIGP with each of the three different kernel types (PK, GK, LK), we took 5 independent rigorous 3-fold CVs to 62 samples (each CV involves all 3 phases of an KIGP) and reported the average MR. For the 7 referred classifiers, the results and the experimental details were originally reported by[16].</p>
                     <p>RF: "Random Forests"</p>
                     <p>PAM: "Nearest shrunken centroids"</p>
                     <p>DLDA: "diagonal linear discriminant analysis"</p>
                  </tblfn>
               </tbl>
               <p>Based on the MR of the rigorous 3-fold CV, we selected the GK as the winning kernel type. We then ran KIGP with a GK to all the available samples. After the "kernel parameter fitting phase", with the posterior PDF of the width parameter (Fig. <figr fid="F9">9d</figr>), we fixed the kernel as the GK(2.38). The resulted NLF plot with the GK(2.38) after the "gene selection phase" is depicted in Fig. <figr fid="F8">8d</figr>. The indices of the 8 identified significant genes are provided in Table <tblr tid="T9">9</tblr>.</p>
               <tbl id="T9">
                  <title>
                     <p>Table 9</p>
                  </title>
                  <caption>
                     <p>Summary of the genes found by applying the KIGP with GK(2.38) to the colon dataset</p>
                  </caption>
                  <tblbdy cols="2">
                     <r>
                        <c ca="center">
                           <p>
                              <b>Index</b>
                           </p>
                        </c>
                        <c ca="center">
                           <p>
                              <b>NLF</b>
                           </p>
                        </c>
                     </r>
                     <r>
                        <c cspan="2">
                           <hr/>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>377</p>
                        </c>
                        <c ca="center">
                           <p>6.54</p>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>493</p>
                        </c>
                        <c ca="center">
                           <p>6.25</p>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>249</p>
                        </c>
                        <c ca="center">
                           <p>5.48</p>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>267</p>
                        </c>
                        <c ca="center">
                           <p>4.51</p>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>245</p>
                        </c>
                        <c ca="center">
                           <p>4.34</p>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>765</p>
                        </c>
                        <c ca="center">
                           <p>4.29</p>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>513</p>
                        </c>
                        <c ca="center">
                           <p>3.94</p>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>14</p>
                        </c>
                        <c ca="center">
                           <p>3.88</p>
                        </c>
                     </r>
                  </tblbdy>
               </tbl>
               <p>Another interesting finding of this experiment is that, based on the results of the "loose" CV, the KIGP/LK performed better than the KIGP/GK for this dataset. However, with a multiple rigorous 3-fold CV, it turned out that KIGP/GK was the more reliable kernel type for this problem. When we checked the heat map of the significant gene set identified by the KIGP/GK (Table <tblr tid="T9">9</tblr>), we found that a few samples, particularly including the sample #18, #20, #45, #49 and #56, are significantly different from other samples in their labeled class. However, they are very consistent to those samples in their opposite class. In fact, these samples were also almost always misclassified by the KIGP in the multiple rigorous 3-fold CV tests. We therefore suspect that these samples are mistakenly phenotyped. We think that this is probably the reason why all other learning methods referred in Table <tblr tid="T8">8</tblr> do not perform well for this colon dataset. This also supports the nature of a KIGP/GK being less sensitive to the mislabeled training samples than a KIGP/LK.</p>
            </sec>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Discussions and Conclusion</p>
         </st>
         <p>This work was motivated by the data analysis challenges posed by microarray gene expression experiments and the mathematical beauty of the kernel-imbedding approach in their ability to solve a non-linear classification problem in the feature space rather than in the observation space. We have presented a unified supervised learning model named kernel-imbedded Gaussian process (KIGP) under a hierarchical Bayesian framework. This model was specifically designed for automatic learning and profiling of microarray gene expression patterns. In the simulated examples, without knowing anything of the underlying generative model, the proposed KIGP method has been shown to perform very close to the Bayesian bound not only in the linear case, but also in the non-linear case.</p>
         <p>With a probit regression setting and the introduction of latent variables, the KIGP model was set for a binary disease classification problem. An algorithm with a cascading structure was proposed to solve this problem and a Gibbs sampler was built as the mechanical core to do the Bayesian inferences. Given a kernel type such as a Gaussian kernel or a polynomial kernel, with the training data as input, the fitted parameter(s) of the kernel type and a set of significant genes will be the output of the algorithm. The algorithm also offers a probabilistic class prediction for each sample. The proposed KIGP can explore not only the linear but also the potential non-linear relationship between the target disease and its associated explanatory genes. Comparing to the regular SVM (a very popular kernel-induced learning method), the proposed KIGP has two advantages. First, the probabilistic class prediction from the KIGP could be insightful for borderline cases in real-world applications. Second, the KIGP method has implemented specific procedure for tuning the kernel parameter(s) (such as the width parameter of a Gaussian kernel or the degree parameter of a polynomial kernel) and the model parameters (such as the variance of the noise term). Tuning parameters has always been one of the key issues for non-linear parametric learning methods. The results of the simulated examples show that the KIGP significantly outperformed the regular SVM method with RFE as a gene selection strategy in a non-linear case and it provided more useful information, such as the posterior PDF of the parameters, for further prediction and analysis as well. Computationally, KIGP is also proven to be robust, therefore it's very amenable to be adopted to a Gibbs sampling system. Both the simulated examples and the real data studies have demonstrated the effectiveness of the proposed method.</p>
         <p>There are still a few interesting problems left for future research. For example, although the KIGP in this study is developed to only solve a binary classification problem, it can easily be extended to a multi-class classification problem based on a multinomial probit regression setting. On the other hand, some other problems are not only challenging but also critical. First, the kernel type competing problem is still a tough issue. The use of the predictive fit measure method discussed in the "Methods" section is simple to formulate, but it may be problematic when the independent testing set is not available and/or there are many candidate kernel types. We are currently working on addressing this issue by implementing a reversible jump Markov Chain Monte Carlo (RJMCMC) algorithm as a simultaneous integrative approach for kernel type selection within the KIGP framework. Another important problem is the independent prior assumption on elements of the gene-selection vector <b>&#947; </b>and the "component-wise drawing" strategy to sample it. Although this will eventually lead to convergence based on the MCMC theory, it may take a very long time if the true underlying explanatory genes are highly correlated with each other. Therefore, a proper kernel-induced clustering algorithm under some proper generative model will definitely be helpful on this regard. Furthermore, if a more appropriate prior for <b>&#947; </b>can be found, the dependency between genes can be simply taken into account to the whole framework by sampling <b>&#947; </b>not in a component-wise fashion but in a block-wise fashion instead. This will then dramatically increase the speed for reaching convergence.</p>
         <p>Interestingly, building a kernel based on the feature of the given data and the classification problem is the ideal way to take full advantage of the kernel-induced learning algorithm. For example, if an appropriate generative model is available for the given dataset, a class of kernels named "natural kernels" is applicable in this context. This problem and the pre-clustering problem mentioned above seemingly share many fundamental elements. However, the further investigation of this is beyond the scope of this paper.</p>
      </sec>
      <sec>
         <st>
            <p>Methods</p>
         </st>
         <sec>
            <st>
               <p>Problem formulation</p>
            </st>
            <p>We consider a binary classification problem. Suppose there are <it>n </it>training samples and let <b>y </b>= [<it>y</it><sub>1</sub>, <it>y</it><sub>2</sub>,...,<it>y</it><sub><it>n</it></sub>]' denote the class labels, where <it>y</it><sub><it>i </it></sub>= 1 indicates the sample <it>i </it>being in the class "I" and <it>y</it><sub><it>i </it></sub>= -1 indicates it being in the other class (i.e. not class "I"), for <it>i </it>= 1,2,...,<it>n</it>. For each sample, there are <it>p </it>genes being investigated and we define the gene expression matrix <b>X </b>as</p>
            <p>
               <m:math name="1471-2105-8-67-i3" xmlns:m="http://www.w3.org/1998/Math/MathML">
                  <m:semantics>
                     <m:mrow>
                        <m:mi>X</m:mi>
                        <m:mo>=</m:mo>
                        <m:mrow>
                           <m:mo>[</m:mo>
                           <m:mrow>
                              <m:mtable>
                                 <m:mtr>
                                    <m:mtd>
                                       <m:mrow>
                                          <m:mtext>Gene&#160;</m:mtext>
                                          <m:mn>1</m:mn>
                                       </m:mrow>
                                    </m:mtd>
                                    <m:mtd>
                                       <m:mrow>
                                          <m:mtext>Gene&#160;</m:mtext>
                                          <m:mn>2</m:mn>
                                       </m:mrow>
                                    </m:mtd>
                                    <m:mtd>
                                       <m:mo>&#8230;</m:mo>
                                    </m:mtd>
                                    <m:mtd>
                                       <m:mrow>
                                          <m:mtext>Gene&#160;</m:mtext>
                                          <m:mi>p</m:mi>
                                       </m:mrow>
                                    </m:mtd>
                                 </m:mtr>
                                 <m:mtr>
                                    <m:mtd>
                                       <m:mrow>
                                          <m:msub>
                                             <m:mi>X</m:mi>
                                             <m:mrow>
                                                <m:mn>11</m:mn>
                                             </m:mrow>
                                          </m:msub>
                                       </m:mrow>
                                    </m:mtd>
                                    <m:mtd>
                                       <m:mrow>
                                          <m:msub>
                                             <m:mi>X</m:mi>
                                             <m:mrow>
                                                <m:mn>12</m:mn>
                                             </m:mrow>
                                          </m:msub>
                                       </m:mrow>
                                    </m:mtd>
                                    <m:mtd>
                                       <m:mo>&#8230;</m:mo>
                                    </m:mtd>
                                    <m:mtd>
                                       <m:mrow>
                                          <m:msub>
                                             <m:mi>X</m:mi>
                                             <m:mrow>
                                                <m:mn>1</m:mn>
                                                <m:mi>p</m:mi>
                                             </m:mrow>
                                          </m:msub>
                                       </m:mrow>
                                    </m:mtd>
                                 </m:mtr>
                                 <m:mtr>
                                    <m:mtd>
                                       <m:mo>&#8942;</m:mo>
                                    </m:mtd>
                                    <m:mtd>
                                       <m:mo>&#8942;</m:mo>
                                    </m:mtd>
                                    <m:mtd>
                                       <m:mo>&#8945;</m:mo>
                                    </m:mtd>
                                    <m:mtd>
                                       <m:mo>&#8942;</m:mo>
                                    </m:mtd>
                                 </m:mtr>
                                 <m:mtr>
                                    <m:mtd>
                                       <m:mrow>
                                          <m:msub>
                                             <m:mi>X</m:mi>
                                             <m:mrow>
                                                <m:mi>n</m:mi>
                                                <m:mn>1</m:mn>
                                             </m:mrow>
                                          </m:msub>
                                       </m:mrow>
                                    </m:mtd>
                                    <m:mtd>
                                       <m:mrow>
                                          <m:msub>
                                             <m:mi>X</m:mi>
                                             <m:mrow>
                                                <m:mi>n</m:mi>
                                                <m:mn>2</m:mn>
                                             </m:mrow>
                                          </m:msub>
                                       </m:mrow>
                                    </m:mtd>
                                    <m:mtd>
                                       <m:mo>&#8943;</m:mo>
                                    </m:mtd>
                                    <m:mtd>
                                       <m:mrow>
                                          <m:msub>
                                             <m:mi>X</m:mi>
                                             <m:mrow>
                                                <m:mi>n</m:mi>
                                                <m:mi>p</m:mi>
                                             </m:mrow>
                                          </m:msub>
                                       </m:mrow>
                                    </m:mtd>
                                 </m:mtr>
                              </m:mtable>
                           </m:mrow>
                           <m:mo>]</m:mo>
                        </m:mrow>
                        <m:mtext>&#160;&#160;&#160;&#160;&#160;</m:mtext>
                        <m:mrow>
                           <m:mo>(</m:mo>
                           <m:mn>1</m:mn>
                           <m:mo>)</m:mo>
                        </m:mrow>
                     </m:mrow>
                     <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaaieqacqWFybawcqGH9aqpdaWadaqaauaabeqaeqaaaaaabaGaee4raCKaeeyzauMaeeOBa4MaeeyzauMaeeiiaaIaeGymaedabaGaee4raCKaeeyzauMaeeOBa4MaeeyzauMaeeiiaaIaeGOmaidabaGaeSOjGSeabaGaee4raCKaeeyzauMaeeOBa4MaeeyzauMaeeiiaaIaemiCaahabaGaemiwaG1aaSbaaSqaaiabigdaXiabigdaXaqabaaakeaacqWGybawdaWgaaWcbaGaeGymaeJaeGOmaidabeaaaOqaaiablAcilbqaaiabdIfaynaaBaaaleaacqaIXaqmcqWGWbaCaeqaaaGcbaGaeSO7I0eabaGaeSO7I0eabaGaeSy8I8eabaGaeSO7I0eabaGaemiwaG1aaSbaaSqaaiabd6gaUjabigdaXaqabaaakeaacqWGybawdaWgaaWcbaGaemOBa4MaeGOmaidabeaaaOqaaiabl+UimbqaaiabdIfaynaaBaaaleaacqWGUbGBcqWGWbaCaeqaaaaaaOGaay5waiaaw2faaiaaxMaacaWLjaWaaeWaaeaacqaIXaqmaiaawIcacaGLPaaaaaa@6BCC@</m:annotation>
                  </m:semantics>
               </m:math>
            </p>
            <p>The data matrix <b>X </b>usually should be normalized for each gene (each column of <b>X</b>). In order to handle the gene selection problem, we further define the gene-selection vector <b>&#947; </b>as:</p>
            <p>
               <m:math name="1471-2105-8-67-i4" xmlns:m="http://www.w3.org/1998/Math/MathML">
                  <m:semantics>
                     <m:mrow>
                        <m:mi>&#947;</m:mi>
                        <m:mo>=</m:mo>
                        <m:msup>
                           <m:mrow>
                              <m:mo stretchy="false">[</m:mo>
                              <m:msub>
                                 <m:mi>&#947;</m:mi>
                                 <m:mn>1</m:mn>
                              </m:msub>
                              <m:mo>,</m:mo>
                              <m:msub>
                                 <m:mi>&#947;</m:mi>
                                 <m:mn>2</m:mn>
                              </m:msub>
                              <m:mo>,</m:mo>
                              <m:mn>...</m:mn>
                              <m:mo>,</m:mo>
                              <m:msub>
                                 <m:mi>&#947;</m:mi>
                                 <m:mi>p</m:mi>
                              </m:msub>
                              <m:mo stretchy="false">]</m:mo>
                           </m:mrow>
                           <m:mo>&#8242;</m:mo>
                        </m:msup>
                        <m:mo>,</m:mo>
                        <m:mtext>&#160;where&#160;</m:mtext>
                        <m:msub>
                           <m:mi>&#947;</m:mi>
                           <m:mi>j</m:mi>
                        </m:msub>
                        <m:mo>=</m:mo>
                        <m:mrow>
                           <m:mo>{</m:mo>
                           <m:mrow>
                              <m:mtable>
                                 <m:mtr>
                                    <m:mtd>
                                       <m:mn>1</m:mn>
                                    </m:mtd>
                                    <m:mtd>
                                       <m:mrow>
                                          <m:mtext>if&#160;the&#160;jth&#160;gene&#160;is&#160;selected</m:mtext>
                                       </m:mrow>
                                    </m:mtd>
                                 </m:mtr>
                                 <m:mtr>
                                    <m:mtd>
                                       <m:mn>0</m:mn>
                                    </m:mtd>
                                    <m:mtd>
                                       <m:mrow>
                                          <m:mtext>otherwise</m:mtext>
                                       </m:mrow>
                                    </m:mtd>
                                 </m:mtr>
                              </m:mtable>
                              <m:mo>,</m:mo>
                              <m:mi>j</m:mi>
                              <m:mo>=</m:mo>
                              <m:mn>1</m:mn>
                              <m:mo>,</m:mo>
                              <m:mtext>&#160;</m:mtext>
                              <m:mn>..</m:mn>
                              <m:mi>p</m:mi>
                           </m:mrow>
                        </m:mrow>
                        <m:mtext>&#160;&#160;&#160;&#160;&#160;</m:mtext>
                        <m:mrow>
                           <m:mo>(</m:mo>
                           <m:mn>2</m:mn>
                           <m:mo>)</m:mo>
                        </m:mrow>
                     </m:mrow>
                     <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaaiiqacqWFZoWzcqGH9aqpcqGGBbWwiiGacqGFZoWzdaWgaaWcbaGaeGymaedabeaakiabcYcaSiab+n7aNnaaBaaaleaacqaIYaGmaeqaaOGaeiilaWIaeiOla4IaeiOla4IaeiOla4IaeiilaWIae43SdC2aaSbaaSqaaiabdchaWbqabaGccqGGDbqxdaahaaWcbeqaaOGamai0gkdiIcaacqGGSaalcqqGGaaicqqG3bWDcqqGObaAcqqGLbqzcqqGYbGCcqqGLbqzcqqGGaaicqGFZoWzdaWgaaWcbaGaemOAaOgabeaakiabg2da9maaceqabaqbaeqabiGaaaqaaiabigdaXaqaaiabbMgaPjabbAgaMjabbccaGiabbsha0jabbIgaOjabbwgaLjabbccaGiabbQgaQjabbsha0jabbIgaOjabbccaGiabbEgaNjabbwgaLjabb6gaUjabbwgaLjabbccaGiabbMgaPjabbohaZjabbccaGiabbohaZjabbwgaLjabbYgaSjabbwgaLjabbogaJjabbsha0jabbwgaLjabbsgaKbqaaiabicdaWaqaaiabb+gaVjabbsha0jabbIgaOjabbwgaLjabbkhaYjabbEha3jabbMgaPjabbohaZjabbwgaLbaacqGGSaalcqWGQbGAcqGH9aqpcqaIXaqmcqGGSaalcqqGGaaicqGGUaGlcqGGUaGlcqWGWbaCaiaawUhaaiaaxMaacaWLjaWaaeWaaeaacqaIYaGmaiaawIcacaGLPaaaaaa@8E8E@</m:annotation>
                  </m:semantics>
               </m:math>
            </p>
            <p><b>X</b><sub><b>&#947; </b></sub>is defined as the gene expression matrix corresponding to the selected genes in accordance to the gene-selection vector <b>&#947;</b>. I.e.</p>
            <p>
               <m:math name="1471-2105-8-67-i5" xmlns:m="http://www.w3.org/1998/Math/MathML">
                  <m:semantics>
                     <m:mrow>
                        <m:msub>
                           <m:mi>X</m:mi>
                           <m:mi>&#947;</m:mi>
                        </m:msub>
                        <m:mo>=</m:mo>
                        <m:mrow>
                           <m:mo>[</m:mo>
                           <m:mrow>
                              <m:mtable columnalign="left">
                                 <m:mtr columnalign="left">
                                    <m:mtd columnalign="left">
                                       <m:mrow>
                                          <m:msub>
                                             <m:mi>X</m:mi>
                                             <m:mrow>
                                                <m:mi>&#947;</m:mi>
                                                <m:mo>,</m:mo>
                                                <m:mn>11</m:mn>
                                             </m:mrow>
                                          </m:msub>
                                          <m:mo>,</m:mo>
                                          <m:msub>
                                             <m:mi>X</m:mi>
                                             <m:mrow>
                                                <m:mi>&#947;</m:mi>
                                                <m:mo>,</m:mo>
                                                <m:mn>12</m:mn>
                                             </m:mrow>
                                          </m:msub>
                                          <m:mo>,</m:mo>
                                          <m:mn>...</m:mn>
                                          <m:mo>,</m:mo>
                                          <m:msub>
                                             <m:mi>X</m:mi>
                                             <m:mrow>
                                                <m:mi>&#947;</m:mi>
                                                <m:mo>,</m:mo>
                                                <m:mn>1</m:mn>
                                                <m:mi>q</m:mi>
                                             </m:mrow>
                                          </m:msub>
                                       </m:mrow>
                                    </m:mtd>
                                 </m:mtr>
                                 <m:mtr columnalign="left">
                                    <m:mtd columnalign="left">
                                       <m:mo>&#8942;</m:mo>
                                    </m:mtd>
                                 </m:mtr>
                                 <m:mtr columnalign="left">
                                    <m:mtd columnalign="left">
                                       <m:mrow>
                                          <m:msub>
                                             <m:mi>X</m:mi>
                                             <m:mrow>
                                                <m:mi>&#947;</m:mi>
                                                <m:mo>,</m:mo>
                                                <m:mi>n</m:mi>
                                                <m:mn>1</m:mn>
                                             </m:mrow>
                                          </m:msub>
                                          <m:mo>,</m:mo>
                                          <m:msub>
                                             <m:mi>X</m:mi>
                                             <m:mrow>
                                                <m:mi>&#947;</m:mi>
                                                <m:mo>,</m:mo>
                                                <m:mi>n</m:mi>
                                                <m:mn>2</m:mn>
                                             </m:mrow>
                                          </m:msub>
                                          <m:mo>,</m:mo>
                                          <m:mn>...</m:mn>
                                          <m:mo>,</m:mo>
                                          <m:msub>
                                             <m:mi>X</m:mi>
                                             <m:mrow>
                                                <m:mi>&#947;</m:mi>
                                                <m:mo>,</m:mo>
                                                <m:mi>n</m:mi>
                                                <m:mi>q</m:mi>
                                             </m:mrow>
                                          </m:msub>
                                       </m:mrow>
                                    </m:mtd>
                                 </m:mtr>
                              </m:mtable>
                           </m:mrow>
                           <m:mo>]</m:mo>
                        </m:mrow>
                        <m:mo>=</m:mo>
                        <m:mrow>
                           <m:mo>[</m:mo>
                           <m:mrow>
                              <m:mtable columnalign="left">
                                 <m:mtr columnalign="left">
                                    <m:mtd columnalign="left">
                                       <m:mrow>
                                          <m:msub>
                                             <m:mi>x</m:mi>
                                             <m:mrow>
                                                <m:mi>&#947;</m:mi>
                                                <m:mn>1</m:mn>
                                             </m:mrow>
                                          </m:msub>
                                       </m:mrow>
                                    </m:mtd>
                                 </m:mtr>
                                 <m:mtr columnalign="left">
                                    <m:mtd columnalign="left">
                                       <m:mo>&#8942;</m:mo>
                                    </m:mtd>
                                 </m:mtr>
                                 <m:mtr columnalign="left">
                                    <m:mtd columnalign="left">
                                       <m:mrow>
                                          <m:msub>
                                             <m:mi>x</m:mi>
                                             <m:mrow>
                                                <m:mi>&#947;</m:mi>
                                                <m:mi>n</m:mi>
                                             </m:mrow>
                                          </m:msub>
                                       </m:mrow>
                                    </m:mtd>
                                 </m:mtr>
                              </m:mtable>
                           </m:mrow>
                           <m:mo>]</m:mo>
                        </m:mrow>
                        <m:mo>,</m:mo>
                        <m:mtext>&#160;&#160;&#160;&#160;&#160;</m:mtext>
                        <m:mrow>
                           <m:mo>(</m:mo>
                           <m:mn>3</m:mn>
                           <m:mo>)</m:mo>
                        </m:mrow>
                     </m:mrow>
                     <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaaieqacqWFybawdaWgaaWcbaacciGae43SdCgabeaakiabg2da9maadmaabaqbaeaabmqaaaqaaiabdIfaynaaBaaaleaacqGFZoWzcqGGSaalcqaIXaqmcqaIXaqmaeqaaOGaeiilaWIaemiwaG1aaSbaaSqaaiab+n7aNjabcYcaSiabigdaXiabikdaYaqabaGccqGGSaalcqGGUaGlcqGGUaGlcqGGUaGlcqGGSaalcqWGybawdaWgaaWcbaGae43SdCMaeiilaWIaeGymaeJaemyCaehabeaaaOqaaiabl6UinbqaaiabdIfaynaaBaaaleaacqGFZoWzcqGGSaalcqWGUbGBcqaIXaqmaeqaaOGaeiilaWIaemiwaG1aaSbaaSqaaiab+n7aNjabcYcaSiabd6gaUjabikdaYaqabaGccqGGSaalcqGGUaGlcqGGUaGlcqGGUaGlcqGGSaalcqWGybawdaWgaaWcbaGae43SdCMaeiilaWIaemOBa4MaemyCaehabeaaaaaakiaawUfacaGLDbaacqGH9aqpdaWadaqaauaabaqadeaaaeaacqWF4baEdaWgaaWcbaGae43SdCMaeGymaedabeaaaOqaaiabl6Uinbqaaiab=Hha4naaBaaaleaacqGFZoWzcqWGUbGBaeqaaaaaaOGaay5waiaaw2faaiabcYcaSiaaxMaacaWLjaWaaeWaaeaacqaIZaWmaiaawIcacaGLPaaaaaa@7715@</m:annotation>
                  </m:semantics>
               </m:math>
            </p>
            <p>where the <it>j </it>th column of <b>X</b><sub><b>&#947; </b></sub>is the <it>i </it>th column of the matrix <b>X </b>while the index of the <it>j </it>th non-zero element in the vector <b>&#947; </b>is <it>i</it>. In formula (3), there are <it>q </it>genes being selected out from the total <it>p </it>genes; and <it>q </it>&lt;&lt;<it>p </it>in a typical gene selection problem. Formulating the problem in a regression setting, we introduce <it>n </it>latent variables <it>z</it><sub>1</sub>, <it>z</it><sub>2</sub>,..., <it>z</it><sub><it>n</it></sub>, such that</p>
            <p><it>z</it><sub><it>i </it></sub>= <it>g</it>(<b>X</b><sub><b>&#947;</b><it>i</it></sub>) + <it>b </it>+ <it>e</it><sub><it>i </it></sub>= <it>t</it><sub><it>i </it></sub>+ <it>b </it>+ <it>e</it><sub><it>i</it></sub>, and</p>
            <p>
               <m:math name="1471-2105-8-67-i6" xmlns:m="http://www.w3.org/1998/Math/MathML">
                  <m:semantics>
                     <m:mrow>
                        <m:msub>
                           <m:mi>y</m:mi>
                           <m:mi>i</m:mi>
                        </m:msub>
                        <m:mo>=</m:mo>
                        <m:mrow>
                           <m:mo>{</m:mo>
                           <m:mrow>
                              <m:mtable>
                                 <m:mtr>
                                    <m:mtd>
                                       <m:mn>1</m:mn>
                                    </m:mtd>
                                    <m:mtd>
                                       <m:mrow>
                                          <m:mtext>if&#160;</m:mtext>
                                          <m:msub>
                                             <m:mi>z</m:mi>
                                             <m:mi>i</m:mi>
                                          </m:msub>
                                          <m:mo>&#8805;</m:mo>
                                          <m:mn>0</m:mn>
                                       </m:mrow>
                                    </m:mtd>
                                 </m:mtr>
                                 <m:mtr>
                                    <m:mtd>
                                       <m:mrow>
                                          <m:mo>&#8722;</m:mo>
                                          <m:mn>1</m:mn>
                                       </m:mrow>
                                    </m:mtd>
                                    <m:mtd>
                                       <m:mrow>
                                          <m:mtext>if&#160;</m:mtext>
                                          <m:msub>
                                             <m:mi>z</m:mi>
                                             <m:mi>i</m:mi>
                                          </m:msub>
                                          <m:mo>&lt;</m:mo>
                                          <m:mn>0</m:mn>
                                       </m:mrow>
                                    </m:mtd>
                                 </m:mtr>
                              </m:mtable>
                              <m:mo>,</m:mo>
                              <m:mi>i</m:mi>
                              <m:mo>=</m:mo>
                              <m:mn>1</m:mn>
                              <m:mo>,</m:mo>
                              <m:mn>2</m:mn>
                              <m:mo>,</m:mo>
                              <m:mn>...</m:mn>
                              <m:mo>,</m:mo>
                              <m:mi>n</m:mi>
                           </m:mrow>
                        </m:mrow>
                        <m:mo>,</m:mo>
                        <m:mtext>&#160;&#160;&#160;&#160;&#160;</m:mtext>
                        <m:mrow>
                           <m:mo>(</m:mo>
                           <m:mn>4</m:mn>
                           <m:mo>)</m:mo>
                        </m:mrow>
                     </m:mrow>
                     <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacqWG5bqEdaWgaaWcbaGaemyAaKgabeaakiabg2da9maaceqabaqbaeqabiGaaaqaaiabigdaXaqaaiabbMgaPjabbAgaMjabbccaGiabdQha6naaBaaaleaacqWGPbqAaeqaaOGaeyyzImRaeGimaadabaGaeyOeI0IaeGymaedabaGaeeyAaKMaeeOzayMaeeiiaaIaemOEaO3aaSbaaSqaaiabdMgaPbqabaGccqGH8aapcqaIWaamaaGaeiilaWIaemyAaKMaeyypa0JaeGymaeJaeiilaWIaeGOmaiJaeiilaWIaeiOla4IaeiOla4IaeiOla4IaeiilaWIaemOBa4gacaGL7baacqGGSaalcaWLjaGaaCzcamaabmaabaGaeGinaqdacaGLOaGaayzkaaaaaa@56D5@</m:annotation>
                  </m:semantics>
               </m:math>
            </p>
            <p>where <b>x</b><sub><it>&#947;i </it></sub>denotes the <it>i </it>th row of the matrix <b>X</b><sub><it>&#947;</it></sub>; <it>e</it><sub><it>i </it></sub>presents the independent noise term, which is assumed to be Guassian distributed with zero mean, <it>&#963;</it><sup>2 </sup>variance; <it>b </it>is the intercept term; and the link function <it>g</it>(&#183;) is assumed to be chosen from a class of real-valued functions and the output of which is a Gaussian process. In the vector form, we define <b>z </b>= [<it>z</it><sub>1</sub>, <it>z</it><sub>2</sub>,..., <it>z</it><sub><it>n</it></sub>]', <b>t </b>= [<it>t</it><sub>1</sub>, <it>t</it><sub>2</sub>,..., <it>t</it><sub><it>n</it></sub>]' and <b>e </b>= [<it>e</it><sub>1</sub>, <it>e</it><sub>2</sub>,..., <it>e</it><sub><it>n</it></sub>]'. Note that, if <it>g</it>(&#183;) is restricted to a linear function and <it>&#963;</it><sup>2 </sup>is fixed at 1, model (4) is very similar to a linear probit regression setting.</p>
         </sec>
         <sec>
            <st>
               <p>Kernel-Imbedded Gaussian Processes (KIGPs)</p>
            </st>
            <p>In general, a continuous stochastic process is a collection of random variables, and each of these random variables takes on real values from a probability distribution function. If we consider the outputs of a learning function <it>g</it>(&#183;), where <it>g </it>is chosen according to some distribution <it>D </it>defined over a class of real-valued functions, then the collection of these outputs is also a stochastic process and the distribution <it>D </it>presents the prior belief in the likelihood.</p>
            <p>A Gaussian process is a continuous stochastic process such that the marginal distribution for any finite subset of the collection of its outputs is a zero mean Gaussian distribution. In this paper, as defined in formula (4), <it>t</it><sub><it>i </it></sub>= <it>g</it>(<b>x</b><sub><it>&#947;i</it></sub>), where <b>x</b><sub><b>&#947;i </b></sub>= [<it>x</it><sub><it>&#947;,i</it>1</sub>, <it>x</it><sub><it>&#947;,i</it>2</sub>,..., <it>x</it><sub><it>&#947;,iq</it></sub>], <it>i </it>= 1,2,..., <it>n</it>; and in the formula, we assume</p>
            <p><it>P</it><sub><it>g</it>~<it>D</it></sub>([<it>g</it>(<b>x</b><sub><b>&#947;1</b></sub>), <it>g</it>(<b>x</b><sub><b>&#947;2</b></sub>),..., <it>g</it>(<b>x</b><sub><b>&#947;n</b></sub>)] = [<it>t</it><sub>1</sub>, <it>t</it><sub>2</sub>,..., <it>t</it><sub><it>n</it></sub>]) &#8733; <m:math name="1471-2105-8-67-i7" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:mi>exp</m:mi><m:mo>&#8289;</m:mo><m:mo stretchy="false">(</m:mo><m:mo>&#8722;</m:mo><m:mfrac><m:mn>1</m:mn><m:mn>2</m:mn></m:mfrac><m:mi>t</m:mi><m:mo>'</m:mo><m:msup><m:mi>K</m:mi><m:mrow><m:mo>&#8722;</m:mo><m:mn>1</m:mn></m:mrow></m:msup><m:mi>t</m:mi><m:mo stretchy="false">)</m:mo></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacyGGLbqzcqGG4baEcqGGWbaCcqGGOaakcqGHsisldaWcaaqaaiabigdaXaqaaiabikdaYaaaieqacqWF0baDcqGGNaWjcqWFlbWsdaahaaWcbeqaaiabgkHiTiabigdaXaaakiab=rha0jabcMcaPaaa@3C5A@</m:annotation></m:semantics></m:math>, where</p>
            <p><b>K</b><sub><it>ij </it></sub>= <it>K</it>(<b>x</b><sub><b>&#947;i</b></sub>, <b>x</b><sub><b>&#947;j</b></sub>), <it>i</it>, <it>j </it>= 1,2,..., <it>n</it>. &#160;&#160;&#160; (5)</p>
            <p>In (5), <it>K</it>(<b>x</b><sub><b>&#947;i</b></sub>, <b>x</b><sub><b>&#947;j</b></sub>) is a function defined in the observation space and it conceptually represents the inner product for sample vectors <b>x</b><sub><b>&#947;i </b></sub>and <b>x</b><sub><b>&#947;j </b></sub>in the feature space, &#10216;&#936;(<b>x</b><sub><b>&#947;i</b></sub>), &#936;(<b>x</b><sub><b>&#947;j</b></sub>)&#10217; (assuming &#936;(&#183;) is the mapping function from the observation space to the feature space). <b>K </b>is a kernel matrix called the Mercer kernel. Formula (5) formulates our prior belief for the learning model and the kernel function <it>K</it>(&#183;,&#183;) uniquely decides the properties of our learning functions. Some of the most commonly used kernel functions include:</p>
            <p>Linear kernel: <it>K</it>(<b>x</b><sub><it>&#947;</it><b>i</b></sub>, <b>x</b><sub><it>&#947;</it><b>j</b></sub>) = &#10216;<b>x</b><sub><it>&#947;</it><b>i</b></sub>, <b>x</b><sub><it>&#947;</it><b>j</b></sub>&#10217; &#160;&#160;&#160; (6a)</p>
            <p>Polynomial kernel: <it>K</it>(<b>x</b><sub><it>&#947;</it><b>i</b></sub>, <b>x</b><sub><it>&#947;</it><b>j</b></sub>) = (&#10216;<b>x</b><sub><it>&#947;</it><b>i</b></sub>, <b>x</b><sub><it>&#947;</it><b>j</b></sub>&#10217; + 1)<sup><it>d</it></sup>, where <it>d </it>= 1,2,.. is the degree parameter. &#160;&#160;&#160; (6b)</p>
            <p>Gaussian kernel: <it>K</it>(<b>x</b><sub><it>&#947;</it><b>i</b></sub>, <b>x</b><sub><it>&#947;</it><b>j</b></sub>) = <m:math name="1471-2105-8-67-i8" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:mi>exp</m:mi><m:mo>&#8289;</m:mo><m:mo stretchy="false">(</m:mo><m:mo>&#8722;</m:mo><m:mfrac><m:mrow><m:msup><m:mrow><m:mrow><m:mo>&#8214;</m:mo><m:mrow><m:msub><m:mi>x</m:mi><m:mrow><m:mi>&#947;</m:mi><m:mi>i</m:mi></m:mrow></m:msub><m:mo>&#8722;</m:mo><m:msub><m:mi>x</m:mi><m:mrow><m:mi>&#947;</m:mi><m:mi>j</m:mi></m:mrow></m:msub></m:mrow><m:mo>&#8214;</m:mo></m:mrow></m:mrow><m:mn>2</m:mn></m:msup></m:mrow><m:mrow><m:mn>2</m:mn><m:msup><m:mi>r</m:mi><m:mn>2</m:mn></m:msup></m:mrow></m:mfrac><m:mo stretchy="false">)</m:mo></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacyGGLbqzcqGG4baEcqGGWbaCcqGGOaakcqGHsisldaWcaaqaamaafmaabaacbeGae8hEaG3aaSbaaSqaaGGaciab+n7aNjab=LgaPbqabaGccqGHsislcqWF4baEdaWgaaWcbaGae43SdCMae8NAaOgabeaaaOGaayzcSlaawQa7amaaCaaaleqabaGaeGOmaidaaaGcbaGaeGOmaiJaemOCai3aaWbaaSqabeaacqaIYaGmaaaaaOGaeiykaKcaaa@45B2@</m:annotation></m:semantics></m:math>, where <it>r </it>> 0 is the width parameter. &#160;&#160;&#160; (6c)</p>
            <p>In (6a) and (6b), the term &#10216;<b>x</b><sub><it>&#947;</it><b>i</b></sub>, <b>x</b><sub><it>&#947;</it><b>j</b></sub>&#10217; presents the inner product between the vectors <b>x</b><sub><it>&#947;i </it></sub>and <b>x</b><sub><it>&#947;j</it></sub>. When one uses the linear kernel, the feature space is the same as the observation space. In this paper, we refer the linear kernel as the LK, the polynomial kernel with degree "d" as the PK(d) and the Gaussian kernel with width "r" as the GK(r). We primarily focus on the KIGP method with the Gaussian kernel and the polynomial kernel, and discuss them in parallel.</p>
            <p>In model (4), we have the latent vector <b>z </b>= <b>t </b>+ <b>e </b>+ <it>b</it><b>1<sub>n</sub></b>, where <b>e </b>~ <it>N</it>(<b>0</b>, <it>&#963;</it><sup>2</sup><b>I<sub>n</sub></b>), <b>I</b><sub><b>n </b></sub>denotes the <it>n </it>&#215; <it>n </it>identity matrix, and <b>1<sub>n </sub></b>presents the <it>n </it>&#215; 1 vector with all the elements being equal to 1; <it>N</it>(&#183;,&#183;) denotes the multivariate normal distribution. Hence,</p>
            <p><it>P</it>(<b>z</b>|<b>t</b>) &#8733; exp(-<m:math name="1471-2105-8-67-i9" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:mfrac><m:mn>1</m:mn><m:mn>2</m:mn></m:mfrac></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaadaWcaaqaaiabigdaXaqaaiabikdaYaaaaaa@2E9E@</m:annotation></m:semantics></m:math>(<b>z </b>- <b>t </b>- <it>b</it><b>1<sub>n</sub></b>)'&#937;<sup>-1</sup>(<b>z </b>- <b>t </b>- <it>b</it><b>1<sub>n</sub></b>)), where &#937; = <it>&#963;</it><sup>2</sup><b>I<sub>n</sub></b>. &#160;&#160;&#160; (7)</p>
            <p>With the Bayes rule, we have</p>
            <p>
               <m:math name="1471-2105-8-67-i10" xmlns:m="http://www.w3.org/1998/Math/MathML">
                  <m:semantics>
                     <m:mrow>
                        <m:mi>P</m:mi>
                        <m:mo stretchy="false">(</m:mo>
                        <m:mover accent="true">
                           <m:mi>t</m:mi>
                           <m:mo>&#732;</m:mo>
                        </m:mover>
                        <m:mo>,</m:mo>
                        <m:mi>t</m:mi>
                        <m:mo>|</m:mo>
                        <m:msub>
                           <m:mover accent="true">
                              <m:mi>x</m:mi>
                              <m:mo>&#732;</m:mo>
                           </m:mover>
                           <m:mi>&#947;</m:mi>
                        </m:msub>
                        <m:mo>,</m:mo>
                        <m:msub>
                           <m:mi>X</m:mi>
                           <m:mi>&#947;</m:mi>
                        </m:msub>
                        <m:mo>,</m:mo>
                        <m:mi>z</m:mi>
                        <m:mo stretchy="false">)</m:mo>
                        <m:mo>=</m:mo>
                        <m:mfrac>
                           <m:mrow>
                              <m:mi>P</m:mi>
                              <m:mo stretchy="false">(</m:mo>
                              <m:mi>z</m:mi>
                              <m:mo>|</m:mo>
                              <m:mi>t</m:mi>
                              <m:mo>,</m:mo>
                              <m:msub>
                                 <m:mover accent="true">
                                    <m:mi>x</m:mi>
                                    <m:mo>&#732;</m:mo>
                                 </m:mover>
                                 <m:mi>&#947;</m:mi>
                              </m:msub>
                              <m:mo>,</m:mo>
                              <m:msub>
                                 <m:mi>X</m:mi>
                                 <m:mi>&#947;</m:mi>
                              </m:msub>
                              <m:mo stretchy="false">)</m:mo>
                              <m:mi>P</m:mi>
                              <m:mo stretchy="false">(</m:mo>
                              <m:mover accent="true">
                                 <m:mi>t</m:mi>
                                 <m:mo>&#732;</m:mo>
                              </m:mover>
                              <m:mo>,</m:mo>
                              <m:mi>t</m:mi>
                              <m:mo>|</m:mo>
                              <m:msub>
                                 <m:mover accent="true">
                                    <m:mi>x</m:mi>
                                    <m:mo>&#732;</m:mo>
                                 </m:mover>
                                 <m:mi>&#947;</m:mi>
                              </m:msub>
                              <m:mo>,</m:mo>
                              <m:msub>
                                 <m:mi>X</m:mi>
                                 <m:mi>&#947;</m:mi>
                              </m:msub>
                              <m:mo stretchy="false">)</m:mo>
                           </m:mrow>
                           <m:mrow>
                              <m:mi>P</m:mi>
                              <m:mo stretchy="false">(</m:mo>
                              <m:mi>z</m:mi>
                              <m:mo>|</m:mo>
                              <m:msub>
                                 <m:mover accent="true">
                                    <m:mi>x</m:mi>
                                    <m:mo>&#732;</m:mo>
                                 </m:mover>
                                 <m:mi>&#947;</m:mi>
                              </m:msub>
                              <m:mo>,</m:mo>
                              <m:msub>
                                 <m:mi>X</m:mi>
                                 <m:mi>&#947;</m:mi>
                              </m:msub>
                              <m:mo stretchy="false">)</m:mo>
                           </m:mrow>
                        </m:mfrac>
                        <m:mo>&#8733;</m:mo>
                        <m:mi>P</m:mi>
                        <m:mo stretchy="false">(</m:mo>
                        <m:mi>z</m:mi>
                        <m:mo>|</m:mo>
                        <m:mi>t</m:mi>
                        <m:mo stretchy="false">)</m:mo>
                        <m:mi>P</m:mi>
                        <m:mo stretchy="false">(</m:mo>
                        <m:mover accent="true">
                           <m:mi>t</m:mi>
                           <m:mo>&#732;</m:mo>
                        </m:mover>
                        <m:mo>,</m:mo>
                        <m:mi>t</m:mi>
                        <m:mo>|</m:mo>
                        <m:msub>
                           <m:mover accent="true">
                              <m:mi>x</m:mi>
                              <m:mo>&#732;</m:mo>
                           </m:mover>
                           <m:mi>&#947;</m:mi>
                        </m:msub>
                        <m:mo>,</m:mo>
                        <m:msub>
                           <m:mi>X</m:mi>
                           <m:mi>&#947;</m:mi>
                        </m:msub>
                        <m:mo stretchy="false">)</m:mo>
                        <m:mo>,</m:mo>
                        <m:mtext>&#160;&#160;&#160;&#160;&#160;</m:mtext>
                        <m:mrow>
                           <m:mo>(</m:mo>
                           <m:mn>8</m:mn>
                           <m:mo>)</m:mo>
                        </m:mrow>
                     </m:mrow>
                     <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacqWGqbaucqGGOaakcuWG0baDgaacaiabcYcaSGqabiab=rha0jabcYha8jqb=Hha4zaaiaWaaSbaaSqaaGGaciab+n7aNbqabaGccqGGSaalcqWFybawdaWgaaWcbaGae43SdCgabeaakiabcYcaSiab=Pha6jabcMcaPiabg2da9maalaaabaGaemiuaaLaeiikaGIae8NEaONaeiiFaWNae8hDaqNaeiilaWIaf8hEaGNbaGaadaWgaaWcbaGae43SdCgabeaakiabcYcaSiab=HfaynaaBaaaleaacqGFZoWzaeqaaOGaeiykaKIaemiuaaLaeiikaGIafmiDaqNbaGaacqGGSaalcqWF0baDcqGG8baFcuWF4baEgaacamaaBaaaleaacqGFZoWzaeqaaOGaeiilaWIae8hwaG1aaSbaaSqaaiab+n7aNbqabaGccqGGPaqkaeaacqWGqbaucqGGOaakcqWF6bGEcqGG8baFcuWF4baEgaacamaaBaaaleaacqGFZoWzaeqaaOGaeiilaWIae8hwaG1aaSbaaSqaaiab+n7aNbqabaGccqGGPaqkaaGaeyyhIuRaemiuaaLaeiikaGIae8NEaONaeiiFaWNae8hDaqNaeiykaKIaemiuaaLaeiikaGIafmiDaqNbaGaacqGGSaalcqWF0baDcqGG8baFcuWF4baEgaacamaaBaaaleaacqGFZoWzaeqaaOGaeiilaWIae8hwaG1aaSbaaSqaaiab+n7aNbqabaGccqGGPaqkcqGGSaalcaWLjaGaaCzcamaabmaabaGaeGioaGdacaGLOaGaayzkaaaaaa@886D@</m:annotation>
                  </m:semantics>
               </m:math>
            </p>
            <p>where <m:math name="1471-2105-8-67-i11" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>x</m:mi><m:mo>&#732;</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaaieqacuWF4baEgaacaaaa@2E3A@</m:annotation></m:semantics></m:math><sub><it>&#947; </it></sub>is the new predictor associated with the given gene-selection vector <b>&#947; </b>and <m:math name="1471-2105-8-67-i12" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>t</m:mi><m:mo>&#732;</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuWG0baDgaacaaaa@2E2C@</m:annotation></m:semantics></m:math> is the posterior output (without intercept <it>b</it>) with respect to <m:math name="1471-2105-8-67-i11" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>x</m:mi><m:mo>&#732;</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaaieqacuWF4baEgaacaaaa@2E3A@</m:annotation></m:semantics></m:math><sub><it>&#947;</it></sub>, provided the matrix <b>X</b><sub><it>&#947; </it></sub>and the latent output <b>z</b>. With a kernel such as defined by (6) and assuming an intercept <it>b </it>and a variance of noise <it>&#963;</it><sup>2 </sup>are both given, plugging (5) and (7) into (8) and integrating out <b>t</b>, the marginal distribution of <m:math name="1471-2105-8-67-i12" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>t</m:mi><m:mo>&#732;</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuWG0baDgaacaaaa@2E2C@</m:annotation></m:semantics></m:math> given <m:math name="1471-2105-8-67-i11" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>x</m:mi><m:mo>&#732;</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaaieqacuWF4baEgaacaaaa@2E3A@</m:annotation></m:semantics></m:math><sub><it>&#947;</it></sub>, <b>X</b><sub><it>&#947; </it></sub>and <b>z </b>yields a Gaussian distribution as follow <abbrgrp><abbr bid="B26">26</abbr></abbrgrp>:</p>
            <p><m:math name="1471-2105-8-67-i12" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>t</m:mi><m:mo>&#732;</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuWG0baDgaacaaaa@2E2C@</m:annotation></m:semantics></m:math>|<m:math name="1471-2105-8-67-i11" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>x</m:mi><m:mo>&#732;</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaaieqacuWF4baEgaacaaaa@2E3A@</m:annotation></m:semantics></m:math><sub><it>&#947;</it></sub>, <b>X</b><sub><it>&#947;</it></sub>, <b>z </b>~ <it>N</it>(<it>f</it>(<m:math name="1471-2105-8-67-i11" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>x</m:mi><m:mo>&#732;</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaaieqacuWF4baEgaacaaaa@2E3A@</m:annotation></m:semantics></m:math><sub><it>&#947;</it></sub>, <b>X</b><sub><it>&#947;</it></sub>, <b>z</b>), <it>V</it>(<m:math name="1471-2105-8-67-i11" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>x</m:mi><m:mo>&#732;</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaaieqacuWF4baEgaacaaaa@2E3A@</m:annotation></m:semantics></m:math><sub><it>&#947;</it></sub>, <b>X</b><sub><it>&#947;</it></sub>, <b>z</b>)), where</p>
            <p><it>f</it>(<m:math name="1471-2105-8-67-i11" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>x</m:mi><m:mo>&#732;</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaaieqacuWF4baEgaacaaaa@2E3A@</m:annotation></m:semantics></m:math><sub><it>&#947;</it></sub>, <b>X</b><sub><it>&#947;</it></sub>, <b>z</b>) = (<b>z </b>- <it>b</it><b>l<sub>n</sub></b>)' (<b>K</b><sub><it>&#947; </it></sub>+ <it>&#963;</it><sup>2</sup><b>I</b>)<sup>-1 </sup><b>k</b><sub><it>&#947;</it></sub>, <it>V</it>(<m:math name="1471-2105-8-67-i11" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>x</m:mi><m:mo>&#732;</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaaieqacuWF4baEgaacaaaa@2E3A@</m:annotation></m:semantics></m:math><sub><it>&#947;</it></sub>, <b>X</b><sub><it>&#947;</it></sub>, <b>z</b>) = <it>K</it>(<m:math name="1471-2105-8-67-i11" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>x</m:mi><m:mo>&#732;</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaaieqacuWF4baEgaacaaaa@2E3A@</m:annotation></m:semantics></m:math><sub><it>&#947;</it></sub>, <m:math name="1471-2105-8-67-i11" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>x</m:mi><m:mo>&#732;</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaaieqacuWF4baEgaacaaaa@2E3A@</m:annotation></m:semantics></m:math><sub><it>&#947;</it></sub>) - <b>k</b><sub><it>&#947;</it></sub>'(<b>K</b><sub><it>&#947; </it></sub>+ <it>&#963;</it><sup>2</sup><b>I</b>)<sup>-1</sup><b>k</b><sub><it>&#947;</it></sub>,</p>
            <p><b>K</b><sub><it>&#947;,ij </it></sub>= <it>K</it>(<b>x</b><sub><it>&#947;</it><b>i</b></sub>, <b>x</b><sub><it>&#947;</it><b>j</b></sub>), <b>k</b><sub><it>&#947;i </it></sub>= <it>K</it>(<m:math name="1471-2105-8-67-i11" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>x</m:mi><m:mo>&#732;</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaaieqacuWF4baEgaacaaaa@2E3A@</m:annotation></m:semantics></m:math><sub><it>&#947;</it></sub>, <b>x</b><sub><it>&#947;i</it></sub>), <it>i</it>, <it>j </it>= 1,2,..., <it>n</it>. &#160;&#160;&#160; (9)</p>
         </sec>
         <sec>
            <st>
               <p>Supervised Microarray Data Analysis using KIGP</p>
            </st>
            <sec>
               <st>
                  <p>Prior specification</p>
               </st>
               <p><b>(1) </b><it>&#947;</it><sub><it>j </it></sub>is assumed to be a priori independent for all <it>j</it>, and</p>
               <p>Pr(<it>&#947;</it><sub><it>j </it></sub>= 1) = <it>&#960;</it><sub><it>j</it></sub>, for <it>j </it>= 1, 2,..., <it>p</it>, &#160;&#160;&#160; (10)</p>
               <p>where the prior probability <it>&#960;</it><sub><it>j </it></sub>reflects prior knowledge of the importance of the <it>j</it>th gene.</p>
               <p><b>(2) </b>A non-informative prior is applied for the intercept <it>b</it>:</p>
               <p><it>P</it>(<it>b</it>) &#8733; 1. &#160;&#160;&#160; (11a)</p>
               <p>This is not a proper probability distribution function (PDF), but it leads to a proper posterior PDF.</p>
               <p><b>(3) </b>The inverted gamma (IG) distribution is applied as the prior for the variance of noise <it>&#963;</it><sup>2</sup>. Specifically, we assume:</p>
               <p><it>P</it>(<it>&#963;</it><sup>2</sup>) ~ <it>IG</it>(1,1) &#160;&#160;&#160; (11b)</p>
               <p><b>(4) </b>For the width of a Gaussian kernel (i.e. a scaling parameter), an inverted gamma distribution is also a reasonable choice as a prior. To preclude too small or too big <it>r </it>(which will make the system to be numerically unstable), we apply IG(1,1) as the prior for <it>r</it><sup>2</sup>, that is</p>
               <p><it>P</it>(<it>r</it><sup>2</sup>) ~ <it>IG</it>(1,1) &#160;&#160;&#160; (11c)</p>
               <p><b>(5) </b>For the degree of a polynomial kernel, we assume a uniform distribution. In this paper, we only consider the PK(1) and the PK(2) to avoid the issue of overfitting for most practical cases. Therefore, we have <it>P</it>(<it>d </it>= 1) = <it>P</it>(<it>d </it>= 2) = 0.5.</p>
               <p><b>(6) </b>We assume that <b>&#947; </b>and <it>b </it>are a priori independent from each other, that is <it>P</it>(<b>&#947;</b>, <it>b</it>) = <it>P</it>(<b>&#947;</b>)<it>P</it>(<it>b</it>).</p>
            </sec>
            <sec>
               <st>
                  <p>Bayesian inferences for model parameters</p>
               </st>
               <p>Based on model (4), label <b>y </b>only depends on <b>z</b>, therefore, all other model parameters are conditionally independent from <b>y </b>if <b>z </b>is given. For convenience, we drop the notation of the training set <b>X </b>in the following derivation and drop <b>y </b>as well when <b>z </b>is given. We also assume the kernel type is given and the associated kernel parameter is termed by <b>&#952;</b>.</p>
               <p><b>(I) Sampling from &#947;</b>|<b>z</b>, <it>b</it>, <it>&#963;</it><sup>2</sup>, <b>&#952;</b></p>
               <p>Here, we drop the notation of the given parameters <it>b</it>, <it>&#963;</it><sup>2 </sup>and <b>&#952;</b>. With the model described in (2), (5) and (7), we have</p>
               <p>
                  <m:math name="1471-2105-8-67-i13" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:mi>P</m:mi>
                           <m:mo stretchy="false">(</m:mo>
                           <m:mi>z</m:mi>
                           <m:mo>|</m:mo>
                           <m:mi>&#947;</m:mi>
                           <m:mo stretchy="false">)</m:mo>
                           <m:mo>=</m:mo>
                           <m:mstyle displaystyle="true">
                              <m:mrow>
                                 <m:msub>
                                    <m:mo>&#8747;</m:mo>
                                    <m:mi>t</m:mi>
                                 </m:msub>
                                 <m:mrow>
                                    <m:mi>P</m:mi>
                                    <m:mo stretchy="false">(</m:mo>
                                    <m:mi>z</m:mi>
                                    <m:mo>|</m:mo>
                                    <m:mi>t</m:mi>
                                    <m:mo stretchy="false">)</m:mo>
                                    <m:mi>P</m:mi>
                                    <m:mo stretchy="false">(</m:mo>
                                    <m:mi>t</m:mi>
                                    <m:mo>|</m:mo>
                                    <m:msub>
                                       <m:mi>X</m:mi>
                                       <m:mi>&#947;</m:mi>
                                    </m:msub>
                                    <m:mo stretchy="false">)</m:mo>
                                    <m:mi>d</m:mi>
                                    <m:mi>t</m:mi>
                                    <m:mo>~</m:mo>
                                    <m:mi>N</m:mi>
                                    <m:mo stretchy="false">(</m:mo>
                                    <m:mi>b</m:mi>
                                    <m:msub>
                                       <m:mn>1</m:mn>
                                       <m:mi>n</m:mi>
                                    </m:msub>
                                    <m:mo>,</m:mo>
                                    <m:msub>
                                       <m:mi>K</m:mi>
                                       <m:mi>&#947;</m:mi>
                                    </m:msub>
                                    <m:mo>+</m:mo>
                                    <m:mi>&#937;</m:mi>
                                    <m:mo stretchy="false">)</m:mo>
                                 </m:mrow>
                              </m:mrow>
                           </m:mstyle>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacqWGqbaucqGGOaakieqacqWF6bGEcqGG8baFiiGacqGFZoWzcqGGPaqkcqGH9aqpdaWdraqaaiabdcfaqjabcIcaOiab=Pha6jabcYha8jab=rha0jabcMcaPiabdcfaqjabcIcaOiab=rha0jabcYha8jab=HfaynaaBaaaleaacqGFZoWzaeqaaOGaeiykaKIaemizaqMae8hDaqNaeiOFa4NaemOta4KaeiikaGIaemOyaiMae8xmaeZaaSbaaSqaaiab=5gaUbqabaGccqGGSaalcqWFlbWsdaWgaaWcbaGae43SdCgabeaakiabgUcaRGGabiab9L6axjabcMcaPaWcbaGae8hDaqhabeqdcqGHRiI8aaaa@59F2@</m:annotation>
                     </m:semantics>
                  </m:math>
               </p>
               <p><b>K</b><sub><it>&#947;</it>,<it>ij </it></sub>= <it>K</it>(<b>x</b><sub><it>&#947;i </it></sub>+ <b>x</b><sub><it>&#947;j</it></sub>), <it>i</it>, <it>j </it>= 1,2,..., <it>n</it>; <b>&#937; </b>= <it>&#963;</it><sup>2</sup><b>I<sub>n</sub></b>, <b>I</b><sub><b>n </b></sub>and <b>1</b><sub><b>n </b></sub>are defined in (7). &#160;&#160;&#160; (12)</p>
               <p>The detailed derivation for (12) is provided in Appendix. After inserting the prior given by (10), we have</p>
               <p>
                  <m:math name="1471-2105-8-67-i14" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:mtable columnalign="left">
                              <m:mtr columnalign="left">
                                 <m:mtd columnalign="left">
                                    <m:mrow>
                                       <m:mi>P</m:mi>
                                       <m:mo stretchy="false">(</m:mo>
                                       <m:mi>&#947;</m:mi>
                                       <m:mo>|</m:mo>
                                       <m:mi>z</m:mi>
                                       <m:mo stretchy="false">)</m:mo>
                                       <m:mo>&#8733;</m:mo>
                                       <m:msup>
                                          <m:mrow>
                                             <m:mo stretchy="false">[</m:mo>
                                             <m:mi>det</m:mi>
                                             <m:mo>&#8289;</m:mo>
                                             <m:mo stretchy="false">(</m:mo>
                                             <m:msub>
                                                <m:mi>K</m:mi>
                                                <m:mi>&#947;</m:mi>
                                             </m:msub>
                                             <m:mo>+</m:mo>
                                             <m:msup>
                                                <m:mi>&#963;</m:mi>
                                                <m:mn>2</m:mn>
                                             </m:msup>
                                             <m:msub>
                                                <m:mi>I</m:mi>
                                                <m:mi>n</m:mi>
                                             </m:msub>
                                             <m:mo stretchy="false">)</m:mo>
                                             <m:mo stretchy="false">]</m:mo>
                                          </m:mrow>
                                          <m:mrow>
                                             <m:mo>&#8722;</m:mo>
                                             <m:mfrac>
                                                <m:mn>1</m:mn>
                                                <m:mn>2</m:mn>
                                             </m:mfrac>
                                          </m:mrow>
                                       </m:msup>
                                       <m:mo>&#215;</m:mo>
                                    </m:mrow>
                                 </m:mtd>
                              </m:mtr>
                              <m:mtr columnalign="left">
                                 <m:mtd columnalign="left">
                                    <m:mrow>
                                       <m:mtext/>
                                       <m:mi>exp</m:mi>
                                       <m:mo>&#8289;</m:mo>
                                       <m:mo>{</m:mo>
                                       <m:mo>&#8722;</m:mo>
                                       <m:mfrac>
                                          <m:mn>1</m:mn>
                                          <m:mn>2</m:mn>
                                       </m:mfrac>
                                       <m:mo stretchy="false">[</m:mo>
                                       <m:mo stretchy="false">(</m:mo>
                                       <m:mi>z</m:mi>
                                       <m:mo>&#8722;</m:mo>
                                       <m:mi>b</m:mi>
                                       <m:msub>
                                          <m:mn>1</m:mn>
                                          <m:mi>n</m:mi>
                                       </m:msub>
                                       <m:mo stretchy="false">)</m:mo>
                                       <m:mo>'</m:mo>
                                       <m:msup>
                                          <m:mrow>
                                             <m:mo stretchy="false">(</m:mo>
                                             <m:msub>
                                                <m:mi>K</m:mi>
                                                <m:mi>&#947;</m:mi>
                                             </m:msub>
                                             <m:mo>+</m:mo>
                                             <m:msup>
                                                <m:mi>&#963;</m:mi>
                                                <m:mn>2</m:mn>
                                             </m:msup>
                                             <m:msub>
                                                <m:mi>I</m:mi>
                                                <m:mi>n</m:mi>
                                             </m:msub>
                                             <m:mo stretchy="false">)</m:mo>
                                          </m:mrow>
                                          <m:mrow>
                                             <m:mo>&#8722;</m:mo>
                                             <m:mn>1</m:mn>
                                          </m:mrow>
                                       </m:msup>
                                       <m:mo stretchy="false">(</m:mo>
                                       <m:mi>z</m:mi>
                                       <m:mo>&#8722;</m:mo>
                                       <m:mi>b</m:mi>
                                       <m:msub>
                                          <m:mn>1</m:mn>
                                          <m:mi>n</m:mi>
                                       </m:msub>
                                       <m:mo stretchy="false">)</m:mo>
                                       <m:mo stretchy="false">]</m:mo>
                                       <m:mo>}</m:mo>
                                       <m:mstyle displaystyle="true">
                                          <m:munderover>
                                             <m:mo>&#8719;</m:mo>
                                             <m:mrow>
                                                <m:mi>j</m:mi>
                                                <m:mo>=</m:mo>
                                                <m:mn>1</m:mn>
                                             </m:mrow>
                                             <m:mi>p</m:mi>
                                          </m:munderover>
                                          <m:mrow>
                                             <m:msubsup>
                                                <m:mi>&#960;</m:mi>
                                                <m:mi>j</m:mi>
                                                <m:mrow>
                                                   <m:msub>
                                                      <m:mi>&#947;</m:mi>
                                                      <m:mi>j</m:mi>
                                                   </m:msub>
                                                </m:mrow>
                                             </m:msubsup>
                                             <m:msup>
                                                <m:mrow>
                                                   <m:mo stretchy="false">(</m:mo>
                                                   <m:mn>1</m:mn>
                                                   <m:mo>&#8722;</m:mo>
                                                   <m:msub>
                                                      <m:mi>&#960;</m:mi>
                                                      <m:mi>j</m:mi>
                                                   </m:msub>
                                                   <m:mo stretchy="false">)</m:mo>
                                                </m:mrow>
                                                <m:mrow>
                                                   <m:mn>1</m:mn>
                                                   <m:mo>&#8722;</m:mo>
                                                   <m:msub>
                                                      <m:mi>&#947;</m:mi>
                                                      <m:mi>j</m:mi>
                                                   </m:msub>
                                                </m:mrow>
                                             </m:msup>
                                          </m:mrow>
                                       </m:mstyle>
                                    </m:mrow>
                                 </m:mtd>
                              </m:mtr>
                           </m:mtable>
                           <m:mtext>&#160;&#160;&#160;&#160;&#160;</m:mtext>
                           <m:mrow>
                              <m:mo>(</m:mo>
                              <m:mrow>
                                 <m:mn>13</m:mn>
                              </m:mrow>
                              <m:mo>)</m:mo>
                           </m:mrow>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaafaqaaeGabaaabaGaemiuaaLaeiikaGccceGae83SdCMaeiiFaWhcbeGae4NEaONaeiykaKIaeyyhIuRaei4waSLagiizaqMaeiyzauMaeiiDaqNaeiikaGIae43saS0aaSbaaSqaaGGaciab9n7aNbqabaGccqGHRaWkcqqFdpWCdaahaaWcbeqaaiabikdaYaaakiab+LeajnaaBaaaleaacqGFUbGBaeqaaOGaeiykaKIaeiyxa01aaWbaaSqabeaacqGHsisldaWcaaqaaiabigdaXaqaaiabikdaYaaaaaGccqGHxdaTaeaacaWLjaGagiyzauMaeiiEaGNaeiiCaaNaei4EaSNaeyOeI0YaaSaaaeaacqaIXaqmaeaacqaIYaGmaaGaei4waSLaeiikaGIae4NEaONaeyOeI0IaemOyaiMae4xmaeZaaSbaaSqaaiab+5gaUbqabaGccqGGPaqkcqGGNaWjcqGGOaakcqGFlbWsdaWgaaWcbaGae03SdCgabeaakiabgUcaRiab9n8aZnaaCaaaleqabaGaeGOmaidaaOGae4xsaK0aaSbaaSqaaiab+5gaUbqabaGccqGGPaqkdaahaaWcbeqaaiabgkHiTiabigdaXaaakiabcIcaOiab+Pha6jabgkHiTiabdkgaIjab+fdaXmaaBaaaleaacqGFUbGBaeqaaOGaeiykaKIaeiyxa0LaeiyFa03aaebCaeaacqqFapaCdaqhaaWcbaGaemOAaOgabaGae03SdC2aaSbaaWqaaiabdQgaQbqabaaaaOGaeiikaGIaeGymaeJaeyOeI0Iae0hWda3aaSbaaSqaaiabdQgaQbqabaGccqGGPaqkdaahaaWcbeqaaiabigdaXiabgkHiTiab9n7aNnaaBaaameaacqWGQbGAaeqaaaaaaSqaaiabdQgaQjabg2da9iabigdaXaqaaiabdchaWbqdcqGHpis1aaaakiaaxMaacaWLjaWaaeWaaeaacqaIXaqmcqaIZaWmaiaawIcacaGLPaaaaaa@95C0@</m:annotation>
                     </m:semantics>
                  </m:math>
               </p>
               <p>In practice, rather than sampling <b>&#947; </b>as a vector, we sample it component-wise from</p>
               <p>
                  <m:math name="1471-2105-8-67-i15" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:mtable columnalign="left">
                              <m:mtr columnalign="left">
                                 <m:mtd columnalign="left">
                                    <m:mrow>
                                       <m:mi>P</m:mi>
                                       <m:mo stretchy="false">(</m:mo>
                                       <m:msub>
                                          <m:mi>&#947;</m:mi>
                                          <m:mi>j</m:mi>
                                       </m:msub>
                                       <m:mo>|</m:mo>
                                       <m:mi>z</m:mi>
                                       <m:mo stretchy="false">)</m:mo>
                                       <m:mo>&#8733;</m:mo>
                                       <m:msup>
                                          <m:mrow>
                                             <m:mo stretchy="false">[</m:mo>
                                             <m:mi>det</m:mi>
                                             <m:mo>&#8289;</m:mo>
                                             <m:mo stretchy="false">(</m:mo>
                                             <m:msub>
                                                <m:mi>K</m:mi>
                                                <m:mi>&#947;</m:mi>
                                             </m:msub>
                                             <m:mo>+</m:mo>
                                             <m:msup>
                                                <m:mi>&#963;</m:mi>
                                                <m:mn>2</m:mn>
                                             </m:msup>
                                             <m:msub>
                                                <m:mi>I</m:mi>
                                                <m:mi>n</m:mi>
                                             </m:msub>
                                             <m:mo stretchy="false">)</m:mo>
                                             <m:mo stretchy="false">]</m:mo>
                                          </m:mrow>
                                          <m:mrow>
                                             <m:mo>&#8722;</m:mo>
                                             <m:mfrac>
                                                <m:mn>1</m:mn>
                                                <m:mn>2</m:mn>
                                             </m:mfrac>
                                          </m:mrow>
                                       </m:msup>
                                       <m:mo>&#215;</m:mo>
                                    </m:mrow>
                                 </m:mtd>
                              </m:mtr>
                              <m:mtr columnalign="left">
                                 <m:mtd columnalign="left">
                                    <m:mrow>
                                       <m:mtext/>
                                       <m:mi>exp</m:mi>
                                       <m:mo>&#8289;</m:mo>
                                       <m:mo>{</m:mo>
                                       <m:mo>&#8722;</m:mo>
                                       <m:mfrac>
                                          <m:mn>1</m:mn>
                                          <m:mn>2</m:mn>
                                       </m:mfrac>
                                       <m:mo stretchy="false">[</m:mo>
                                       <m:mo stretchy="false">(</m:mo>
                                       <m:mi>z</m:mi>
                                       <m:mo>&#8722;</m:mo>
                                       <m:mi>b</m:mi>
                                       <m:msub>
                                          <m:mn>1</m:mn>
                                          <m:mi>n</m:mi>
                                       </m:msub>
                                       <m:mo stretchy="false">)</m:mo>
                                       <m:mo>'</m:mo>
                                       <m:msup>
                                          <m:mrow>
                                             <m:mo stretchy="false">(</m:mo>
                                             <m:msub>
                                                <m:mi>K</m:mi>
                                                <m:mi>&#947;</m:mi>
                                             </m:msub>
                                             <m:mo>+</m:mo>
                                             <m:msup>
                                                <m:mi>&#963;</m:mi>
                                                <m:mn>2</m:mn>
                                             </m:msup>
                                             <m:msub>
                                                <m:mi>I</m:mi>
                                                <m:mi>n</m:mi>
                                             </m:msub>
                                             <m:mo stretchy="false">)</m:mo>
                                          </m:mrow>
                                          <m:mrow>
                                             <m:mo>&#8722;</m:mo>
                                             <m:mn>1</m:mn>
                                          </m:mrow>
                                       </m:msup>
                                       <m:mo stretchy="false">(</m:mo>
                                       <m:mi>z</m:mi>
                                       <m:mo>&#8722;</m:mo>
                                       <m:mi>b</m:mi>
                                       <m:msub>
                                          <m:mn>1</m:mn>
                                          <m:mi>n</m:mi>
                                       </m:msub>
                                       <m:mo stretchy="false">)</m:mo>
                                       <m:mo stretchy="false">]</m:mo>
                                       <m:mo>}</m:mo>
                                       <m:msubsup>
                                          <m:mi>&#960;</m:mi>
                                          <m:mi>j</m:mi>
                                          <m:mrow>
                                             <m:msub>
                                                <m:mi>&#947;</m:mi>
                                                <m:mi>j</m:mi>
                                             </m:msub>
                                          </m:mrow>
                                       </m:msubsup>
                                       <m:msup>
                                          <m:mrow>
                                             <m:mo stretchy="false">(</m:mo>
                                             <m:mn>1</m:mn>
                                             <m:mo>&#8722;</m:mo>
                                             <m:msub>
                                                <m:mi>&#960;</m:mi>
                                                <m:mi>j</m:mi>
                                             </m:msub>
                                             <m:mo stretchy="false">)</m:mo>
                                          </m:mrow>
                                          <m:mrow>
                                             <m:mn>1</m:mn>
                                             <m:mo>&#8722;</m:mo>
                                             <m:msub>
                                                <m:mi>&#947;</m:mi>
                                                <m:mi>j</m:mi>
                                             </m:msub>
                                          </m:mrow>
                                       </m:msup>
                                    </m:mrow>
                                 </m:mtd>
                              </m:mtr>
                           </m:mtable>
                           <m:mo>,</m:mo>
                           <m:mtext>&#160;for&#160;</m:mtext>
                           <m:mi>j</m:mi>
                           <m:mo>=</m:mo>
                           <m:mn>1</m:mn>
                           <m:mo>,</m:mo>
                           <m:mn>2</m:mn>
                           <m:mo>,</m:mo>
                           <m:mn>...</m:mn>
                           <m:mo>,</m:mo>
                           <m:mi>p</m:mi>
                           <m:mo>.</m:mo>
                           <m:mtext>&#160;&#160;&#160;&#160;&#160;</m:mtext>
                           <m:mrow>
                              <m:mo>(</m:mo>
                              <m:mrow>
                                 <m:mn>14</m:mn>
                              </m:mrow>
                              <m:mo>)</m:mo>
                           </m:mrow>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaafaqaaeGabaaabaGaemiuaaLaeiikaGccceGae83SdC2aaSbaaSqaaGqaciab+PgaQbqabaGccqGG8baFieqacqqF6bGEcqGGPaqkcqGHDisTcqGGBbWwcyGGKbazcqGGLbqzcqGG0baDcqGGOaakcqqFlbWsdaWgaaWcbaacciGaeW3SdCgabeaakiabgUcaRiab8n8aZnaaCaaaleqabaGaeGOmaidaaOGae0xsaK0aaSbaaSqaaiab95gaUbqabaGccqGGPaqkcqGGDbqxdaahaaWcbeqaaiabgkHiTmaalaaabaGaeGymaedabaGaeGOmaidaaaaakiabgEna0cqaaiaaxMaacyGGLbqzcqGG4baEcqGGWbaCcqGG7bWEcqGHsisldaWcaaqaaiabigdaXaqaaiabikdaYaaacqGGBbWwcqGGOaakcqqF6bGEcqGHsislcqWGIbGycqqFXaqmdaWgaaWcbaGae0NBa4gabeaakiabcMcaPiabcEcaNiabcIcaOiab9TealnaaBaaaleaacqaFZoWzaeqaaOGaey4kaSIaeW3Wdm3aaWbaaSqabeaacqaIYaGmaaGccqqFjbqsdaWgaaWcbaGae0NBa4gabeaakiabcMcaPmaaCaaaleqabaGaeyOeI0IaeGymaedaaOGaeiikaGIae0NEaONaeyOeI0IaemOyaiMae0xmaeZaaSbaaSqaaiab95gaUbqabaGccqGGPaqkcqGGDbqxcqGG9bqFcqaFapaCdaqhaaWcbaGaemOAaOgabaGaeW3SdC2aaSbaaWqaaiabdQgaQbqabaaaaOGaeiikaGIaeGymaeJaeyOeI0IaeWhWda3aaSbaaSqaaiabdQgaQbqabaGccqGGPaqkdaahaaWcbeqaaiabigdaXiabgkHiTiab8n7aNnaaBaaameaacqWGQbGAaeqaaaaaaaGccqGGSaalcqqGGaaicqqGMbGzcqqGVbWBcqqGYbGCcqqGGaaicqWGQbGAcqGH9aqpcqaIXaqmcqGGSaalcqaIYaGmcqGGSaalcqGGUaGlcqGGUaGlcqGGUaGlcqGGSaalcqWGWbaCcqGGUaGlcaWLjaGaaCzcamaabmaabaGaeGymaeJaeGinaqdacaGLOaGaayzkaaaaaa@A2C8@</m:annotation>
                     </m:semantics>
                  </m:math>
               </p>
               <p>In both (13) and (14), <b>K</b><sub><it>&#947; </it></sub>is defined in (12).</p>
               <p><b>(II) Sampling from t</b>|<b>&#947;</b>, <it>b</it>, <b>z</b>, <it>&#963;</it><sup>2</sup>, <b>&#952;</b></p>
               <p>As shown by Eq. (A6) in the Appendix, the conditional distribution <it>P</it>(<b>t</b>|<b>z</b>, <it>b</it>) is Gaussian:</p>
               <p><b>t</b>|<b>z</b>, <it>b </it>~ <it>N</it>((<b>I</b><sub><b>n </b></sub>- <b>&#937;</b>(<b>&#937; </b>+ <b>K</b><sub><it>&#947;</it></sub>)<sup>-1</sup>)(<b>z </b>- <it>b</it><b>l<sub>n</sub></b>), <b>&#937; </b>- <b>&#937;</b>(<b>&#937; </b>+ <b>K</b><sub><b>&#947;</b></sub>)<sup>-1</sup><b>&#937;</b>),</p>
               <p>where <b>K</b><sub><b>&#947; </b></sub>and <b>&#937; </b>are defined in Eq. (12). &#160;&#160;&#160; (15)</p>
               <p>We thus can draw <b>t </b>given <b>z </b>accordingly.</p>
               <p><b>(III) Sampling from z</b>|<b>t</b>, <it>b</it>, <it>&#963;</it><sup>2</sup>, <b>y</b></p>
               <p>Given the class label vector <b>y</b>, the conditional distribution of <b>z </b>given <b>t </b>is a truncated Gaussian distribution, and we have the following formula for <it>i </it>= 1,2,..., <it>n</it>:</p>
               <p><it>z</it><sub><it>i</it></sub>|<it>t</it><sub><it>i</it></sub>, <it>b</it>, <it>&#963;</it><sup>2</sup>, <it>y</it><sub><it>i </it></sub>= 1 &#8733; <it>N</it>(<it>t</it><sub><it>i </it></sub>+ <it>b</it>, <it>&#963;</it><sup>2</sup>) truncated at the left by 0,</p>
               <p><it>z</it><sub><it>i</it></sub>|<it>t</it><sub><it>i</it></sub>, <it>b</it>, <it>&#963;</it><sup>2</sup>, <it>y</it><sub><it>i </it></sub>= -1 &#8733; <it>N</it>(<it>t</it><sub><it>i </it></sub>+ <it>b</it>, <it>&#963;</it><sup>2</sup>) truncated at the right by 0. &#160;&#160;&#160; (16)</p>
               <p><b>(IV) Sampling from </b><it>b</it>|<b>z</b>, <b>t</b>, <it>&#963;</it><sup>2</sup></p>
               <p>When <b>z </b>and <b>t </b>are both given, this is a simple ordinary linear regression setting with only an intercept term. Under the non-informative prior assumption given by (11a), it yields</p>
               <p><it>b</it>|<b>z</b>, <b>t</b>, <it>&#963;</it><sup>2 </sup>~ <it>N</it>(<it>&#956;</it>, <it>&#963;</it><sup>2</sup>/<it>n</it>), where <m:math name="1471-2105-8-67-i16" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:mi>&#956;</m:mi><m:mo>=</m:mo><m:mfrac><m:mn>1</m:mn><m:mi>n</m:mi></m:mfrac><m:mstyle displaystyle="true"><m:munderover><m:mo>&#8721;</m:mo><m:mrow><m:mi>i</m:mi><m:mo>=</m:mo><m:mn>1</m:mn></m:mrow><m:mi>n</m:mi></m:munderover><m:mrow><m:mo stretchy="false">(</m:mo><m:msub><m:mi>z</m:mi><m:mi>i</m:mi></m:msub><m:mo>&#8722;</m:mo><m:msub><m:mi>t</m:mi><m:mi>i</m:mi></m:msub><m:mo stretchy="false">)</m:mo></m:mrow></m:mstyle></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaaiiGacqWF8oqBcqGH9aqpdaWcaaqaaiabigdaXaqaaiabd6gaUbaadaaeWbqaaiabcIcaOiabdQha6naaBaaaleaacqWGPbqAaeqaaOGaeyOeI0IaemiDaq3aaSbaaSqaaiabdMgaPbqabaGccqGGPaqkaSqaaiabdMgaPjabg2da9iabigdaXaqaaiabd6gaUbqdcqGHris5aaaa@417B@</m:annotation></m:semantics></m:math>. &#160;&#160;&#160; (17a)</p>
               <p><b>(V) Sampling from </b><it>&#963;</it><sup>2</sup>|<b>z</b>, <b>t</b>, <it>b</it></p>
               <p>With <it>IG</it>(<it>&#945;</it>, <it>&#946;</it>), <it>&#945; </it>> 0, <it>&#946; </it>> 0, as the prior, the conditional posterior distribution for <it>&#963;</it><sup>2 </sup>is also an inverted gamma distribution. That is</p>
               <p><it>&#963;</it><sup>2</sup>|<b>z</b>, <b>t</b>, <it>b </it>~ <it>IG</it>(<it>&#945; </it>+ <it>n</it>/2, <it>&#946; </it>+ <it>ns</it><sup>2</sup>/2), where <m:math name="1471-2105-8-67-i17" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msup><m:mi>s</m:mi><m:mn>2</m:mn></m:msup><m:mo>=</m:mo><m:mfrac><m:mn>1</m:mn><m:mi>n</m:mi></m:mfrac><m:mstyle displaystyle="true"><m:munderover><m:mo>&#8721;</m:mo><m:mrow><m:mi>i</m:mi><m:mo>=</m:mo><m:mn>1</m:mn></m:mrow><m:mi>n</m:mi></m:munderover><m:mrow><m:msup><m:mrow><m:mo stretchy="false">(</m:mo><m:msub><m:mi>z</m:mi><m:mi>i</m:mi></m:msub><m:mo>&#8722;</m:mo><m:msub><m:mi>t</m:mi><m:mi>i</m:mi></m:msub><m:mo>&#8722;</m:mo><m:mi>b</m:mi><m:mo stretchy="false">)</m:mo></m:mrow><m:mn>2</m:mn></m:msup></m:mrow></m:mstyle></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacqWGZbWCdaahaaWcbeqaaiabikdaYaaakiabg2da9maalaaabaGaeGymaedabaGaemOBa4gaamaaqahabaGaeiikaGIaemOEaO3aaSbaaSqaaiabdMgaPbqabaGccqGHsislcqWG0baDdaWgaaWcbaGaemyAaKgabeaakiabgkHiTiabdkgaIjabcMcaPmaaCaaaleqabaGaeGOmaidaaaqaaiabdMgaPjabg2da9iabigdaXaqaaiabd6gaUbqdcqGHris5aaaa@45A4@</m:annotation></m:semantics></m:math>. &#160;&#160;&#160; (17b)</p>
            </sec>
            <sec>
               <st>
                  <p>Kernel parameters tuning</p>
               </st>
               <p>One of the major advantages of kernel-induced learning methods is that one can explore the non-linearity feature of the underlying model for a given learning problem by applying different kernels. It is therefore necessary to discuss the issue of kernel parameter tuning. With the KIGP framework constructed above, this turns out to be rather straightforward.</p>
               <p>As in the last section, we denote the kernel parameter(s) as <b>&#952;</b>, which can be either a scalar (e.g. the width parameter of an GK or the degree parameter of an PK) or a vector. For algorithmic convenience, we work with the logarithm of the conditional likelihood for the parameter <b>&#952;</b>:</p>
               <p>
                  <m:math name="1471-2105-8-67-i18" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:mtable>
                              <m:mtr>
                                 <m:mtd>
                                    <m:mrow>
                                       <m:mi>L</m:mi>
                                       <m:mo stretchy="false">(</m:mo>
                                       <m:mi>&#952;</m:mi>
                                       <m:mo stretchy="false">)</m:mo>
                                       <m:mo>=</m:mo>
                                       <m:mi>log</m:mi>
                                       <m:mo>&#8289;</m:mo>
                                       <m:mo stretchy="false">(</m:mo>
                                       <m:mi>P</m:mi>
                                       <m:mo stretchy="false">(</m:mo>
                                       <m:mi>z</m:mi>
                                       <m:mo>|</m:mo>
                                       <m:mi>&#947;</m:mi>
                                       <m:mo>,</m:mo>
                                       <m:mi>b</m:mi>
                                       <m:mo>,</m:mo>
                                       <m:msup>
                                          <m:mi>&#963;</m:mi>
                                          <m:mn>2</m:mn>
                                       </m:msup>
                                       <m:mo>,</m:mo>
                                       <m:mi>&#952;</m:mi>
                                       <m:mo stretchy="false">)</m:mo>
                                       <m:mo stretchy="false">)</m:mo>
                                       <m:mo>=</m:mo>
                                       <m:mo>&#8722;</m:mo>
                                       <m:mfrac>
                                          <m:mn>1</m:mn>
                                          <m:mn>2</m:mn>
                                       </m:mfrac>
                                       <m:mi>log</m:mi>
                                       <m:mo>&#8289;</m:mo>
                                       <m:mo stretchy="false">(</m:mo>
                                       <m:mi>det</m:mi>
                                       <m:mo>&#8289;</m:mo>
                                       <m:mo stretchy="false">(</m:mo>
                                       <m:msub>
                                          <m:mi>K</m:mi>
                                          <m:mi>&#947;</m:mi>
                                       </m:msub>
                                       <m:mo stretchy="false">(</m:mo>
                                       <m:mi>&#952;</m:mi>
                                       <m:mo stretchy="false">)</m:mo>
                                       <m:mo>+</m:mo>
                                       <m:msup>
                                          <m:mi>&#963;</m:mi>
                                          <m:mn>2</m:mn>
                                       </m:msup>
                                       <m:msub>
                                          <m:mi>I</m:mi>
                                          <m:mi>n</m:mi>
                                       </m:msub>
                                       <m:mo stretchy="false">)</m:mo>
                                       <m:mo stretchy="false">)</m:mo>
                                       <m:mo>&#8722;</m:mo>
                                    </m:mrow>
                                 </m:mtd>
                              </m:mtr>
                              <m:mtr>
                                 <m:mtd>
                                    <m:mrow>
                                       <m:mfrac>
                                          <m:mn>1</m:mn>
                                          <m:mn>2</m:mn>
                                       </m:mfrac>
                                       <m:mo stretchy="false">[</m:mo>
                                       <m:mo stretchy="false">(</m:mo>
                                       <m:mi>z</m:mi>
                                       <m:mo>&#8722;</m:mo>
                                       <m:mi>b</m:mi>
                                       <m:msub>
                                          <m:mn>1</m:mn>
                                          <m:mi>n</m:mi>
                                       </m:msub>
                                       <m:mo stretchy="false">)</m:mo>
                                       <m:mo>'</m:mo>
                                       <m:msup>
                                          <m:mrow>
                                             <m:mo stretchy="false">(</m:mo>
                                             <m:msub>
                                                <m:mi>K</m:mi>
                                                <m:mi>&#947;</m:mi>
                                             </m:msub>
                                             <m:mo stretchy="false">(</m:mo>
                                             <m:mi>&#952;</m:mi>
                                             <m:mo stretchy="false">)</m:mo>
                                             <m:mo>+</m:mo>
                                             <m:msup>
                                                <m:mi>&#963;</m:mi>
                                                <m:mn>2</m:mn>
                                             </m:msup>
                                             <m:msub>
                                                <m:mi>I</m:mi>
                                                <m:mi>n</m:mi>
                                             </m:msub>
                                             <m:mo stretchy="false">)</m:mo>
                                          </m:mrow>
                                          <m:mrow>
                                             <m:mo>&#8722;</m:mo>
                                             <m:mn>1</m:mn>
                                          </m:mrow>
                                       </m:msup>
                                       <m:mo stretchy="false">(</m:mo>
                                       <m:mi>z</m:mi>
                                       <m:mo>&#8722;</m:mo>
                                       <m:mi>b</m:mi>
                                       <m:msub>
                                          <m:mn>1</m:mn>
                                          <m:mi>n</m:mi>
                                       </m:msub>
                                       <m:mo stretchy="false">)</m:mo>
                                       <m:mo stretchy="false">]</m:mo>
                                       <m:mo>&#8722;</m:mo>
                                       <m:mfrac>
                                          <m:mi>n</m:mi>
                                          <m:mn>2</m:mn>
                                       </m:mfrac>
                                       <m:mi>log</m:mi>
                                       <m:mo>&#8289;</m:mo>
                                       <m:mo stretchy="false">(</m:mo>
                                       <m:mn>2</m:mn>
                                       <m:mi>&#960;</m:mi>
                                       <m:mo stretchy="false">)</m:mo>
                                    </m:mrow>
                                 </m:mtd>
                              </m:mtr>
                           </m:mtable>
                           <m:mtext>&#160;&#160;&#160;&#160;&#160;</m:mtext>
                           <m:mrow>
                              <m:mo>(</m:mo>
                              <m:mrow>
                                 <m:mn>18</m:mn>
                              </m:mrow>
                              <m:mo>)</m:mo>
                           </m:mrow>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaafaqabeGabaaabaGaemitaWKaeiikaGccceGae8hUdeNaeiykaKIaeyypa0JagiiBaWMaei4Ba8Maei4zaCMaeiikaGIaemiuaaLaeiikaGccbeGae4NEaONaeiiFaWNae83SdCMaeiilaWIaemOyaiMaeiilaWccciGae03Wdm3aaWbaaSqabeaacqaIYaGmaaGccqGGSaalcqWF4oqCcqGGPaqkcqGGPaqkcqGH9aqpcqGHsisldaWcaaqaaiabigdaXaqaaiabikdaYaaacyGGSbaBcqGGVbWBcqGGNbWzcqGGOaakcyGGKbazcqGGLbqzcqGG0baDcqGGOaakcqGFlbWsdaWgaaWcbaGae03SdCgabeaakiabcIcaOiab=H7aXjabcMcaPiabgUcaRiab9n8aZnaaCaaaleqabaGaeGOmaidaaOGae4xsaK0aaSbaaSqaaiab+5gaUbqabaGccqGGPaqkcqGGPaqkcqGHsislaeaadaWcaaqaaiabigdaXaqaaiabikdaYaaacqGGBbWwcqGGOaakcqGF6bGEcqGHsislcqWGIbGycqGFXaqmdaWgaaWcbaGae4NBa4gabeaakiabcMcaPiabcEcaNiabcIcaOiab+TealnaaBaaaleaacqqFZoWzaeqaaOGaeiikaGIae8hUdeNaeiykaKIaey4kaSIae03Wdm3aaWbaaSqabeaacqaIYaGmaaGccqGFjbqsdaWgaaWcbaGae4NBa4gabeaakiabcMcaPmaaCaaaleqabaGaeyOeI0IaeGymaedaaOGaeiikaGIae4NEaONaeyOeI0IaemOyaiMae4xmaeZaaSbaaSqaaiab+5gaUbqabaGccqGGPaqkcqGGDbqxcqGHsisldaWcaaqaaiabd6gaUbqaaiabikdaYaaacyGGSbaBcqGGVbWBcqGGNbWzcqGGOaakcqaIYaGmcqqFapaCcqGGPaqkaaGaaCzcaiaaxMaadaqadaqaaiabigdaXiabiIda4aGaayjkaiaawMcaaaaa@9B75@</m:annotation>
                     </m:semantics>
                  </m:math>
               </p>
               <p>With a proper prior distribution for <b>&#952;</b>, <it>P</it>(<b>&#952;</b>), we have:</p>
               <p><it>P</it>(<b>&#952;</b>|<b>z</b>, <b>&#947;</b>, <it>b</it>, <it>&#963;</it><sup>2</sup>) &#8733; exp (<it>L</it>(<b>&#952;</b>))* <it>P</it>(<b>&#952;</b>), &#160;&#160;&#160; (19)</p>
               <p>where <it>L</it>(<b>&#952;</b>) is defined in (18). In this paper, we specifically focus on two kernel types: the polynomial kernel and the Gaussian kernel, as defined in (6b) and (6c) respectively. For an GK, with the prior for the width parameter given in the "Prior specification" subsection, the resulted posterior distribution given by (19) is non-regular. We apply the Metropolis-Hasting algorithm (the details can be found in <abbrgrp><abbr bid="B31">31</abbr></abbrgrp>) to draw the sample. For an PK, we simply calculate the likelihood with respect to each <it>d </it>by (18) and sample <it>d </it>accordingly. Sometimes, one may need to calculate the gradient of <it>L</it>(<b>&#952;</b>) with respect to <b>&#952; </b>(assume <b>&#952; </b>= [<it>&#952;</it><sub>1</sub>,..., <it>&#952;</it><sub><it>J</it></sub>]') when adopting other plausible algorithms:</p>
               <p>
                  <m:math name="1471-2105-8-67-i19" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:mfrac>
                              <m:mrow>
                                 <m:mo>&#8706;</m:mo>
                                 <m:mi>L</m:mi>
                                 <m:mo stretchy="false">(</m:mo>
                                 <m:mi>&#952;</m:mi>
                                 <m:mo stretchy="false">)</m:mo>
                              </m:mrow>
                              <m:mrow>
                                 <m:mo>&#8706;</m:mo>
                                 <m:msub>
                                    <m:mi>&#952;</m:mi>
                                    <m:mi>i</m:mi>
                                 </m:msub>
                              </m:mrow>
                           </m:mfrac>
                           <m:mo>=</m:mo>
                           <m:mo>&#8722;</m:mo>
                           <m:mfrac>
                              <m:mn>1</m:mn>
                              <m:mn>2</m:mn>
                           </m:mfrac>
                           <m:mi>T</m:mi>
                           <m:mi>r</m:mi>
                           <m:mi>a</m:mi>
                           <m:mi>c</m:mi>
                           <m:mi>e</m:mi>
                           <m:mo stretchy="false">[</m:mo>
                           <m:msub>
                              <m:mi>&#937;</m:mi>
                              <m:mi>&#947;</m:mi>
                           </m:msub>
                           <m:msup>
                              <m:mrow>
                                 <m:mo stretchy="false">(</m:mo>
                                 <m:mi>&#952;</m:mi>
                                 <m:mo stretchy="false">)</m:mo>
                              </m:mrow>
                              <m:mrow>
                                 <m:mo>&#8722;</m:mo>
                                 <m:mn>1</m:mn>
                              </m:mrow>
                           </m:msup>
                           <m:mfrac>
                              <m:mrow>
                                 <m:mo>&#8706;</m:mo>
                                 <m:msub>
                                    <m:mi>&#937;</m:mi>
                                    <m:mi>&#947;</m:mi>
                                 </m:msub>
                                 <m:mo stretchy="false">(</m:mo>
                                 <m:mi>&#952;</m:mi>
                                 <m:mo stretchy="false">)</m:mo>
                              </m:mrow>
                              <m:mrow>
                                 <m:mo>&#8706;</m:mo>
                                 <m:msub>
                                    <m:mi>&#952;</m:mi>
                                    <m:mi>i</m:mi>
                                 </m:msub>
                              </m:mrow>
                           </m:mfrac>
                           <m:mo stretchy="false">]</m:mo>
                           <m:mo>+</m:mo>
                           <m:mfrac>
                              <m:mn>1</m:mn>
                              <m:mn>2</m:mn>
                           </m:mfrac>
                           <m:mo stretchy="false">[</m:mo>
                           <m:mo stretchy="false">(</m:mo>
                           <m:mi>z</m:mi>
                           <m:mo>&#8722;</m:mo>
                           <m:mi>b</m:mi>
                           <m:msub>
                              <m:mn>1</m:mn>
                              <m:mi>n</m:mi>
                           </m:msub>
                           <m:mo stretchy="false">)</m:mo>
                           <m:mo>'</m:mo>
                           <m:msub>
                              <m:mi>&#937;</m:mi>
                              <m:mi>&#947;</m:mi>
                           </m:msub>
                           <m:msup>
                              <m:mrow>
                                 <m:mo stretchy="false">(</m:mo>
                                 <m:mi>&#952;</m:mi>
                                 <m:mo stretchy="false">)</m:mo>
                              </m:mrow>
                              <m:mrow>
                                 <m:mo>&#8722;</m:mo>
                                 <m:mn>1</m:mn>
                              </m:mrow>
                           </m:msup>
                           <m:mfrac>
                              <m:mrow>
                                 <m:mo>&#8706;</m:mo>
                                 <m:msub>
                                    <m:mi>&#937;</m:mi>
                                    <m:mi>&#947;</m:mi>
                                 </m:msub>
                                 <m:mo stretchy="false">(</m:mo>
                                 <m:mi>&#952;</m:mi>
                                 <m:mo stretchy="false">)</m:mo>
                              </m:mrow>
                              <m:mrow>
                                 <m:mo>&#8706;</m:mo>
                                 <m:msub>
                                    <m:mi>&#952;</m:mi>
                                    <m:mi>i</m:mi>
                                 </m:msub>
                              </m:mrow>
                           </m:mfrac>
                           <m:msub>
                              <m:mi>&#937;</m:mi>
                              <m:mi>&#947;</m:mi>
                           </m:msub>
                           <m:msup>
                              <m:mrow>
                                 <m:mo stretchy="false">(</m:mo>
                                 <m:mi>&#952;</m:mi>
                                 <m:mo stretchy="false">)</m:mo>
                              </m:mrow>
                              <m:mrow>
                                 <m:mo>&#8722;</m:mo>
                                 <m:mn>1</m:mn>
                              </m:mrow>
                           </m:msup>
                           <m:mo stretchy="false">(</m:mo>
                           <m:mi>z</m:mi>
                           <m:mo>&#8722;</m:mo>
                           <m:mi>b</m:mi>
                           <m:msub>
                              <m:mn>1</m:mn>
                              <m:mi>n</m:mi>
                           </m:msub>
                           <m:mo stretchy="false">)</m:mo>
                           <m:mo stretchy="false">]</m:mo>
                           <m:mo>,</m:mo>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaadaWcaaqaaiabgkGi2kabdYeamjabcIcaOGGabiab=H7aXjabcMcaPaqaaiabgkGi2IGaciab+H7aXnaaBaaaleaacqWGPbqAaeqaaaaakiabg2da9iabgkHiTmaalaaabaGaeGymaedabaGaeGOmaidaaiabdsfaujabdkhaYjabdggaHjabdogaJjabdwgaLjabcUfaBjabfM6axnaaBaaaleaacqGFZoWzaeqaaOGaeiikaGIae8hUdeNaeiykaKYaaWbaaSqabeaacqGHsislcqaIXaqmaaGcdaWcaaqaaiabgkGi2kabfM6axnaaBaaaleaacqGFZoWzaeqaaOGaeiikaGIae8hUdeNaeiykaKcabaGaeyOaIyRae4hUde3aaSbaaSqaaiabdMgaPbqabaaaaOGaeiyxa0Laey4kaSYaaSaaaeaacqaIXaqmaeaacqaIYaGmaaGaei4waSLaeiikaGccbeGae0NEaONaeyOeI0IaemOyaiMae0xmaeZaaSbaaSqaaiab95gaUbqabaGccqGGPaqkcqGGNaWjcqqHPoWvdaWgaaWcbaGae43SdCgabeaakiabcIcaOiab=H7aXjabcMcaPmaaCaaaleqabaGaeyOeI0IaeGymaedaaOWaaSaaaeaacqGHciITcqqHPoWvdaWgaaWcbaGae43SdCgabeaakiabcIcaOiab=H7aXjabcMcaPaqaaiabgkGi2kab+H7aXnaaBaaaleaacqWGPbqAaeqaaaaakiabfM6axnaaBaaaleaacqGFZoWzaeqaaOGaeiikaGIae8hUdeNaeiykaKYaaWbaaSqabeaacqGHsislcqaIXaqmaaGccqGGOaakcqqF6bGEcqGHsislcqWGIbGycqqFXaqmdaWgaaWcbaGae0NBa4gabeaakiabcMcaPiabc2faDjabcYcaSaaa@8F80@</m:annotation>
                     </m:semantics>
                  </m:math>
               </p>
               <p>where &#937;<sub><it>&#947;</it></sub>(<b>&#952;</b>) = <b>K</b><sub><it>&#947; </it></sub>(<b>&#952;</b>) + <it>&#963;</it><sup>2</sup><b>I<sub>n</sub></b>, <it>i </it>= 1,..., <it>J</it>. &#160;&#160;&#160; (20)</p>
               <p>Theoretically, the proposed KIGP with the linear kernel performs very close to most other classical linear methods. As the width parameter of a Gaussian kernel increases (bigger and bigger than 1), within a reasonable range, the KIGP with such an GK performs fairly close to the KIGP with a linear kernel. On the contrary, when the width decreases (smaller and smaller than 1), the performance of the KIGP in the observation space behaves very non-linear. When the degree of a polynomial kernel of an KIGP increases, the non-linearity of the KIGP also increases. When the degree is equal to 1, the only difference between the PK(1) and the linear kernel is a constant. In short, within a kernel class, different values of the kernel parameter represent different feature spaces. For certain specific kernel parameter values, the performance of the KIGP with an GK or with an PK will be close to the KIGP with a linear kernel or a classical linear model in general. Therefore, the posterior distribution of the kernel parameter will provide some clues on what kind of a feature space is more appropriate to the target problem with the given training samples.</p>
            </sec>
            <sec>
               <st>
                  <p>Proposed Gibbs sampler</p>
               </st>
               <p>With the derivation above and a given kernel type, we propose our Gibbs sampling algorithm as follows:</p>
               <p>1. Start with proper initial value [<b>&#947;</b><sup>[0]</sup>, <it>b</it><sup>[0]</sup>, <b>t</b><sup>[0]</sup>, <b>z</b><sup>[0]</sup>, <it>&#963;</it><sup>2[0]</sup>, <it>&#952;</it><sup>[0]</sup>]; then set <it>i </it>= 1.</p>
               <p>2. Sample <b>z </b>[<it>i</it>] from <b>z</b>|<b>t</b><sup>[<it>i</it>-1]</sup>, <it>b</it><sup>[<it>i</it>-1]</sup>, <it>&#963;</it><sup>2[<it>i</it>-1] </sup>via (16).</p>
               <p>3. Sample <b>t</b><sup>[<it>i</it>] </sup>from <b>t</b>|<b>&#947;</b><sup>[<it>i</it>-1]</sup>, <it>b</it><sup>[<it>i</it>-1]</sup>, <b>z</b><sup>[<it>i</it>]</sup>, <it>&#963;</it><sup>2[<it>i</it>-1]</sup>, <it>&#952;</it><sup>[<it>i</it>-1] </sup>via (15).</p>
               <p>4. Sample <it>b</it><sup>[<it>i</it>] </sup>from <it>b</it>|<b>z</b><sup>[<it>i</it>]</sup>, <b>t</b><sup>[<it>i</it>]</sup>, <it>&#963;</it><sup>2[<it>i</it>-1] </sup>via (17a).</p>
               <p>5. Sample <it>&#963;</it><sup>2[<it>i</it>] </sup>from <it>&#963;</it><sup>2</sup>|<b>z</b><sup>[<it>i</it>]</sup>, <b>t</b><sup>[<it>i</it>]</sup>, <it>b</it><sup>[<it>i</it>] </sup>via (17b).</p>
               <p>6. Sample <b>&#947;</b><sup>[<it>i</it>] </sup>from <b>&#947;</b>|<b>z</b><sup>[<it>i</it>]</sup>, <it>b</it><sup>[<it>i</it>]</sup>, <it>&#963;</it><sup>2[<it>i</it>]</sup>, <it>&#952;</it><sup>[<it>i</it>-1] </sup>via (14) component-wise.</p>
               <p>7. Sample <it>&#952;</it><sup>[<it>i</it>] </sup>from <it>&#952;</it>|<b>z</b><sup>[<it>i</it>]</sup>, <it>b</it><sup>[<it>i</it>]</sup>, <it>&#963;</it><sup>2[<it>i</it>]</sup>, <b>&#947;</b><sup>[<it>i</it>]</sup>.</p>
               <p>8. Set <it>i </it>= <it>i </it>+ 1 and go back to the step 2 until the required number of iterations.</p>
               <p>9. Stop. &#160;&#160;&#160; (21)</p>
               <p>In the above procedure, the kernel parameter <it>&#952; </it>denotes the degree parameter "d" of a polynomial kernel or the width parameter "r" of a Gaussian kernel. In step 2, we follow the optimal exponential accept-reject algorithm suggested by Robert <abbrgrp><abbr bid="B32">32</abbr></abbrgrp> to draw from a truncated Gaussian distribution. After a suitable burn-in period, we can obtain the posterior samples of [<b>z</b><sup>[<it>i</it>]</sup>, <b>t</b><sup>[<it>i</it>]</sup>, <it>b</it><sup>[<it>i</it>]</sup>, <it>&#963;</it><sup>2[<it>i</it>]</sup>, <b>&#947;</b><sup>[<it>i</it>]</sup>, <it>&#952;</it><sup>[<it>i</it>]</sup>] at the <it>i </it>th iteration with the procedure described in (21). The core calculation of the proposed Gibbs sampler involves calculating the inverse of the matrix <b>K</b><sub><it>&#947; </it></sub>+ <it>&#963;</it><sup>2</sup><b>I</b>. Since the kernel matrix <b>K</b><sub><it>&#947; </it></sub>is symmetric and non-negative definite, <b>K</b><sub><it>&#947; </it></sub>+ <it>&#963;</it><sup>2</sup><b>I </b>is symmetric and positive definite. Therefore, the algorithm is theoretically robust and the Cholesky decomposition can be applied in the numerical computation. The total computation complexity of the proposed Gibbs sampler within each iteration is <it>O</it>(<it>pn</it><sup>3</sup>).</p>
            </sec>
            <sec>
               <st>
                  <p>Overall algorithm</p>
               </st>
               <p>In Fig. <figr fid="F1">1</figr>, we epitomize the general framework of the proposed KIGP method. The box bounded by the dotted lines represents the KIGP learning algorithm. A kernel type is supposed to be given a priori. The algorithm basically has a cascading structure and is composed of three consecutive phases: the "kernel parameter fitting phase", the "gene selection phase" and the "prediction phase". Although in the Bayesian sense one can involve all the parameters into the proposed Gibbs sampler for all three phases, we suggest to fix the kernel parameter(s) after the "kernel parameter fitting phase" and fix the gene-selection vector after the "gene selection phase" for practicality. Very often, we are only interested in the area around the peak of the posterior PDF (or probability mass function (PMF)) of a parameter, especially for the kernel parameter(s) and the gene-selection vector. This strategy will lead to a much faster convergence of the proposed Gibbs sampler as long as the posterior PDF or PMF of the kernel parameter(s) is unimodal. For all three phases, we need to discard some proper number of iterations as their burn-in periods. Some dynamic monitoring strategies to track the convergence of a MCMC simulation can be used (e.g. in <abbrgrp><abbr bid="B31">31</abbr></abbrgrp>).</p>
               <p>A practical issue needs to be addressed here. It's better to fix the variance parameter <it>&#963;</it><sup>2 </sup>at a proper constant during the "kernel parameter fitting phase" and the "gene selection phase" because this will help the proposed algorithm be more numerically stable and converge faster. For all the simulations of this paper, as in a regular probit regression model, we set <it>&#963;</it><sup>2 </sup>equal to 1 (step 5 in (21)) in the first two phases and only involve it into the Gibbs sampler in the "prediction phase". More details of each phase are described as follows.</p>
            </sec>
            <sec>
               <st>
                  <p>Kernel Parameter Fitting Phase</p>
               </st>
               <p>In the kernel parameter fitting phase, our primary interest is to find the appropriate value(s) for the kernel parameter(s) of the given kernel type. In this study, we focus on two kernel types, the polynomial kernel and the Gaussian kernel. With the knowledge of the training set <b>X </b>and <b>y</b>, we firstly involve all model parameters (except <it>&#963;</it><sup>2</sup>), the gene selection vector and the kernel parameter into the simulation of the algorithm given by (21). After convergence, the samples obtained from (21) within each iteration are drawn from the joint posterior distribution of all the parameters. For a PK, since the degree parameter is a discrete number, we simply take the degree value with the highest posterior probability. For a GK, we calculate the histogram of the sample values of the width parameter with some proper number of bins. Then we use a Gaussian smoother to smooth over the histogram bars (similar to a Gaussian kernel density estimation). Finally, we take the center of the bin with the highest histogram counts as the best fitted value of the width parameter.</p>
            </sec>
            <sec>
               <st>
                  <p>Gene Selection Phase</p>
               </st>
               <p>After the "kernel parameter fitting phase", we fix the kernel parameter(s) at the fitted value(s) and then continue to run the proposed Gibbs sampler. In this subsection, we present an empirical approach to determining whether a gene is potentially significant based on the posterior samples and a given threshold.</p>
               <p>Efron <abbrgrp><abbr bid="B29">29</abbr></abbrgrp> thoroughly discussed an empirical Bayes approach for estimating the null hypothesis based on a large-scale simultaneous t-test. In this paper, we essentially follow the key concept therein to assess whether or not a gene is of significant importance for the given classification problem. We first define a statistic named by "Normalized Log-Frequency" (NLF) to measure the relative potential significance for a gene. By denoting <it>F</it><sub><it>j </it></sub>as the appearing frequency of the <it>j </it>th gene appeared in the posterior samples, the definition of NLF is formulated as:</p>
               <p>
                  <m:math name="1471-2105-8-67-i20" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:mtable columnalign="left">
                              <m:mtr columnalign="left">
                                 <m:mtd columnalign="left">
                                    <m:mrow>
                                       <m:mi>N</m:mi>
                                       <m:mi>L</m:mi>
                                       <m:msub>
                                          <m:mi>F</m:mi>
                                          <m:mi>j</m:mi>
                                       </m:msub>
                                       <m:mo>=</m:mo>
                                       <m:mfrac>
                                          <m:mrow>
                                             <m:mi>L</m:mi>
                                             <m:msub>
                                                <m:mi>F</m:mi>
                                                <m:mi>j</m:mi>
                                             </m:msub>
                                             <m:mo>&#8722;</m:mo>
                                             <m:msub>
                                                <m:mi>&#956;</m:mi>
                                                <m:mi>L</m:mi>
                                             </m:msub>
                                          </m:mrow>
                                          <m:mrow>
                                             <m:msub>
                                                <m:mi>s</m:mi>
                                                <m:mi>L</m:mi>
                                             </m:msub>
                                          </m:mrow>
                                       </m:mfrac>
                                       <m:mo>,</m:mo>
                                       <m:mtext>&#160;where&#160;</m:mtext>
                                       <m:mi>L</m:mi>
                                       <m:msub>
                                          <m:mi>F</m:mi>
                                          <m:mi>j</m:mi>
                                       </m:msub>
                                       <m:mo>=</m:mo>
                                       <m:mi>log</m:mi>
                                       <m:mo>&#8289;</m:mo>
                                       <m:mo stretchy="false">(</m:mo>
                                       <m:msub>
                                          <m:mi>F</m:mi>
                                          <m:mi>j</m:mi>
                                       </m:msub>
                                       <m:mo stretchy="false">)</m:mo>
                                    </m:mrow>
                                 </m:mtd>
                              </m:mtr>
                              <m:mtr columnalign="left">
                                 <m:mtd columnalign="left">
                                    <m:mrow>
                                       <m:msub>
                                          <m:mi>&#956;</m:mi>
                                          <m:mi>L</m:mi>
                                       </m:msub>
                                       <m:mo>=</m:mo>
                                       <m:mfrac>
                                          <m:mn>1</m:mn>
                                          <m:mi>p</m:mi>
                                       </m:mfrac>
                                       <m:mstyle displaystyle="true">
                                          <m:munderover>
                                             <m:mo>&#8721;</m:mo>
                                             <m:mrow>
                                                <m:mi>j</m:mi>
                                                <m:mo>=</m:mo>
                                                <m:mn>1</m:mn>
                                             </m:mrow>
                                             <m:mi>p</m:mi>
                                          </m:munderover>
                                          <m:mrow>
                                             <m:mi>L</m:mi>
                                             <m:msub>
                                                <m:mi>F</m:mi>
                                                <m:mi>j</m:mi>
                                             </m:msub>
                                             <m:mo>,</m:mo>
                                             <m:mtext>&#160;</m:mtext>
                                          </m:mrow>
                                       </m:mstyle>
                                       <m:msubsup>
                                          <m:mi>s</m:mi>
                                          <m:mi>L</m:mi>
                                          <m:mn>2</m:mn>
                                       </m:msubsup>
                                       <m:mo>=</m:mo>
                                       <m:mfrac>
                                          <m:mn>1</m:mn>
                                          <m:mrow>
                                             <m:mi>p</m:mi>
                                             <m:mo>&#8722;</m:mo>
                                             <m:mn>1</m:mn>
                                          </m:mrow>
                                       </m:mfrac>
                                       <m:mstyle displaystyle="true">
                                          <m:munderover>
                                             <m:mo>&#8721;</m:mo>
                                             <m:mrow>
                                                <m:mi>j</m:mi>
                                                <m:mo>=</m:mo>
                                                <m:mn>1</m:mn>
                                             </m:mrow>
                                             <m:mi>P</m:mi>
                                          </m:munderover>
                                          <m:mrow>
                                             <m:msup>
                                                <m:mrow>
                                                   <m:mo stretchy="false">(</m:mo>
                                                   <m:mi>L</m:mi>
                                                   <m:msub>
                                                      <m:mi>F</m:mi>
                                                      <m:mi>j</m:mi>
                                                   </m:msub>
                                                   <m:mo>&#8722;</m:mo>
                                                   <m:msub>
                                                      <m:mi>&#956;</m:mi>
                                                      <m:mi>L</m:mi>
                                                   </m:msub>
                                                   <m:mo stretchy="false">)</m:mo>
                                                </m:mrow>
                                                <m:mn>2</m:mn>
                                             </m:msup>
                                             <m:mtext>&#160;&#160;for&#160;&#160;</m:mtext>
                                             <m:mi>j</m:mi>
                                             <m:mo>=</m:mo>
                                             <m:mn>1</m:mn>
                                             <m:mo>,</m:mo>
                                             <m:mn>2</m:mn>
                                             <m:mo>,</m:mo>
                                             <m:mn>...</m:mn>
                                             <m:mo>,</m:mo>
                                             <m:mi>p</m:mi>
                                          </m:mrow>
                                       </m:mstyle>
                                       <m:mo>.</m:mo>
                                    </m:mrow>
                                 </m:mtd>
                              </m:mtr>
                           </m:mtable>
                           <m:mtext>&#160;&#160;&#160;&#160;&#160;</m:mtext>
                           <m:mrow>
                              <m:mo>(</m:mo>
                              <m:mrow>
                                 <m:mn>22</m:mn>
                              </m:mrow>
                              <m:mo>)</m:mo>
                           </m:mrow>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaafaqaaeGabaaabaGaemOta4KaemitaWKaemOray0aaSbaaSqaaiabdQgaQbqabaGccqGH9aqpdaWcaaqaaiabdYeamjabdAeagnaaBaaaleaacqWGQbGAaeqaaOGaeyOeI0ccciGae8hVd02aaSbaaSqaaiabdYeambqabaaakeaacqWGZbWCdaWgaaWcbaGaemitaWeabeaaaaGccqGGSaalcqqGGaaicqqG3bWDcqqGObaAcqqGLbqzcqqGYbGCcqqGLbqzcqqGGaaicqWGmbatcqWGgbGrdaWgaaWcbaGaemOAaOgabeaakiabg2da9iGbcYgaSjabc+gaVjabcEgaNjabcIcaOiabdAeagnaaBaaaleaacqWGQbGAaeqaaOGaeiykaKcabaGae8hVd02aaSbaaSqaaiabdYeambqabaGccqGH9aqpdaWcaaqaaiabigdaXaqaaiabdchaWbaadaaeWbqaaiabdYeamjabdAeagnaaBaaaleaacqWGQbGAaeqaaOGaeiilaWIaeeiiaacaleaacqWGQbGAcqGH9aqpcqaIXaqmaeaacqWGWbaCa0GaeyyeIuoakiabdohaZnaaDaaaleaacqWGmbataeaacqaIYaGmaaGccqGH9aqpdaWcaaqaaiabigdaXaqaaiabdchaWjabgkHiTiabigdaXaaadaaeWbqaaiabcIcaOiabdYeamjabdAeagnaaBaaaleaacqWGQbGAaeqaaOGaeyOeI0Iae8hVd02aaSbaaSqaaiabdYeambqabaGccqGGPaqkdaahaaWcbeqaaiabikdaYaaakiabbccaGiabbccaGiabbAgaMjabb+gaVjabbkhaYjabbccaGiabbccaGiabdQgaQjabg2da9iabigdaXiabcYcaSiabikdaYiabcYcaSiabc6caUiabc6caUiabc6caUiabcYcaSiabdchaWbWcbaGaemOAaOMaeyypa0JaeGymaedabaGaemiuaafaniabggHiLdGccqGGUaGlaaGaaCzcaiaaxMaadaqadaqaaiabikdaYiabikdaYaGaayjkaiaawMcaaaaa@98FD@</m:annotation>
                     </m:semantics>
                  </m:math>
               </p>
               <p>In practice, if <it>F</it><sub><it>j </it></sub>is 0, we simply set it as 1/2 divided by the total number of iterations. Our use of the NLF as the key statistic is based on the fact that the logarithm of a gamma distribution can be well approximated by a normal distribution, while a gamma distribution is empirically a proper distribution for the appearing frequency of any of the genes from a homogenous group in the posterior samples.</p>
               <p>Suppose that the <it>p </it>NLF-values fall into two classes, "insignificant" or "significant", corresponding to whether or not <it>NLF</it><sub><it>j</it></sub>, for <it>j </it>= 1, 2,..., <it>p</it>, is generated according to the null hypothesis, with prior probabilities <it>Pb</it><sub>0 </sub>and <it>Pb</it><sub>1 </sub>= 1 - <it>Pb</it><sub>0</sub>, for the two classes respectively; and that <it>NLF</it><sub><it>j </it></sub>has the conditional prior density either <it>f</it><sub>0 </sub>(<it>NLF</it>) or <it>f</it><sub>1</sub>(<it>NLF</it>) depending on its class. I.e.</p>
               <p><it>Pb</it><sub>0 </sub>= Pr{<it>Insignificant</it>}, Pr(<it>NLF</it>|<it>Insignificant</it>) = <it>f</it><sub>0</sub>(<it>NLF</it>)</p>
               <p><it>Pb</it><sub>1 </sub>= Pr{<it>Significant</it>}, Pr(<it>NLF</it>|<it>Significant</it>) = <it>f</it><sub>1</sub>(<it>NLF</it>) &#160;&#160;&#160; (23)</p>
               <p>The marginal distribution for <it>NLF</it><sub><it>j </it></sub>is thus</p>
               <p>Pr(<it>NLF</it>) = <it>f</it><sub>0</sub>(<it>NLF</it>)*<it>Pb</it><sub>0 </sub>+ <it>f</it><sub>1</sub>(<it>NLF</it>) * <it>Pb</it><sub>1 </sub>= <it>f </it>(<it>NLF</it>) &#160;&#160;&#160; (24)</p>
               <p>By using the Bayes' formula, the posterior probability for "insignificant" class given the NLF therefore yields</p>
               <p>Pr(<it>Insignificant</it>)|<it>NLF</it>) = <it>f</it><sub>0</sub>(<it>NLF</it>) * <it>Pb</it><sub>0</sub>/<it>f</it>(<it>NLF</it>) &#160;&#160;&#160; (25)</p>
               <p>Abiding to <abbrgrp><abbr bid="B29">29</abbr></abbrgrp>, we further define a term, the local "false discovery rate (fdr)", by</p>
               <p><it>fdr</it>(<it>NLF</it>) = <it>f</it><sub>0</sub>(<it>NLF</it>)/<it>f</it>(<it>NLF</it>) &#160;&#160;&#160; (26)</p>
               <p>Since in a typical microarray study, <it>Pb</it><sub>0 </sub>generally is very close to 1 (say <it>Pb</it><sub>0 </sub>> 0.99), so <it>fdr</it>(<it>NLF</it>) is a fairly precise estimator for the posterior probability of the null hypothesis (insignificant class) given the statistic NLF. With <it>fdr</it>(<it>NLF</it>), we can decide whether or not a target gene is "significant" at some confidence level accordingly. For all the examples, we report all the genes with fdr smaller than 0.05.</p>
               <p>To calculate <it>fdr</it>(<it>NLF</it>), one needs to estimate <it>f</it>(<it>NLF</it>) and to choose <it>f</it><sub>0</sub>(<it>NLF</it>) properly. For estimating <it>f</it>(<it>NLF</it>), one can resort to the ensemble values of the NLFs, {<it>NLF</it><sub><it>j</it></sub>, <it>j </it>= 1, 2,..., <it>p</it>}. We divide the target range of NLF into <it>M </it>equal length bins with the center of each bin at <it>x</it><sub><it>i </it></sub>for <it>i </it>= 1,2,..., <it>M</it>. A heuristic choice of M is the roundup of the maximum NLF value multiplied by 10. Then we calculate the histogram for the given NLF set with respect to each of these bins followed by fitting a Gaussian smoother. The output divided by the product of the width of the bin and the number of genes (i.e. <it>p</it>) will be a proper estimation for <it>f</it>(<it>NLF</it>) on the center of each bin.</p>
               <p>The more critical part is the choice of the density of NLF under null hypothesis, i.e. <it>f</it><sub>0</sub>(<it>NLF</it>). The basic assumption we impose here is that the statistic NLF under null hypothesis follows a normal distribution. Since <it>Pb</it><sub>1 </sub>is much smaller than <it>Pb</it><sub>0 </sub>(say <it>Pb</it><sub>0 </sub>> 0.99) in most real microarray analysis problems, it is very safe to choose the standard normal (zero mean, unit variance) as <it>f</it><sub>0</sub>(<it>NLF</it>) based on the definition (22). Throughout this paper, we always choose the standard normal as the density of NLF under null hypothesis. (In case <it>Pb</it><sub>0 </sub>> 0.99, some more elaborated schemes are needed and an easy approach can be found in <abbrgrp><abbr bid="B29">29</abbr></abbrgrp>.) After both <it>f</it>(<it>NLF</it>) and <it>f</it><sub>0</sub>(<it>NLF</it>) are obtained, the local fdr for each gene can be calculated by (26) consequently. Based on the local fdr, one can select the "significant" class of genes and fix the gene-selection vector at some given confidence level thereafter.</p>
            </sec>
            <sec>
               <st>
                  <p>Prediction Phase</p>
               </st>
               <p>After the "gene selection phase", both the kernel parameter(s) and the gene-selection vector have been fixed. We continue to run the proposed Gibbs sampler (21) and the computational complexity of the Gibbs sampler dramatically decreases to <it>O</it>(<it>n</it><sup>3</sup>). After a new proper burn-in period, we can draw samples of <b>z</b>, <it>b </it>and <it>&#963;</it><sup>2 </sup>within each iteration in the "prediction phase". Following (9), the posterior PDF for the output <m:math name="1471-2105-8-67-i12" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>t</m:mi><m:mo>&#732;</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuWG0baDgaacaaaa@2E2C@</m:annotation></m:semantics></m:math> given the testing data <m:math name="1471-2105-8-67-i11" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>x</m:mi><m:mo>&#732;</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaaieqacuWF4baEgaacaaaa@2E3A@</m:annotation></m:semantics></m:math> in the <it>l</it>-th iteration is Gaussian:</p>
               <p><m:math name="1471-2105-8-67-i12" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>t</m:mi><m:mo>&#732;</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuWG0baDgaacaaaa@2E2C@</m:annotation></m:semantics></m:math><sup>[<it>l</it>]</sup>|<m:math name="1471-2105-8-67-i11" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>x</m:mi><m:mo>&#732;</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaaieqacuWF4baEgaacaaaa@2E3A@</m:annotation></m:semantics></m:math><sub><it>&#947;</it></sub>, <b>X</b><sub><it>&#947;</it></sub>, <b>z</b><sup>[<it>l</it>]</sup>, <it>b</it><sup>[<it>l</it>]</sup>, <it>&#963;</it><sup>2[<it>l</it>] </sup>~ <it>N</it>(<it>f</it>(<m:math name="1471-2105-8-67-i11" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>x</m:mi><m:mo>&#732;</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaaieqacuWF4baEgaacaaaa@2E3A@</m:annotation></m:semantics></m:math><sub><it>&#947;</it></sub>, <b>X</b><sub><it>&#947;</it></sub>, <b>z</b><sup>[<it>l</it>]</sup>, <it>b</it><sup>[<it>l</it>]</sup>, <it>&#963;</it><sup>2[<it>l</it>]</sup>), <it>V</it>(<m:math name="1471-2105-8-67-i11" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>x</m:mi><m:mo>&#732;</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaaieqacuWF4baEgaacaaaa@2E3A@</m:annotation></m:semantics></m:math><sub><it>&#947;</it></sub>, <b>X</b><sub><it>&#947;</it></sub>, <b>z</b><sup>[<it>l</it>]</sup>, <it>b</it><sup>[<it>l</it>]</sup>, <it>&#963;</it><sup>2[<it>l</it>]</sup>)) = <it>N</it>(<it>f</it><sup>[<it>l</it>]</sup>, <it>V</it><sup>[<it>l</it>]</sup>)</p>
               <p>where <it>f</it><sup>[<it>l</it>] </sup>= (<b>z</b><sup>[<it>l</it>] </sup>- <it>b</it><sup>[<it>l</it>]</sup><b>l</b><sub><b>n</b></sub>)' (<b>K</b><sub><it>&#947; </it></sub>+ <it>&#963;</it><sup>2[<it>l</it>]</sup><b>I</b><sub><it>n</it></sub>)<sup>-1</sup><b>k</b><sub><it>&#947;</it></sub>, <it>V</it><sup>[<it>l</it>] </sup>= <it>K</it>(<m:math name="1471-2105-8-67-i11" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>x</m:mi><m:mo>&#732;</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaaieqacuWF4baEgaacaaaa@2E3A@</m:annotation></m:semantics></m:math><sub><it>&#947;</it></sub>, <m:math name="1471-2105-8-67-i11" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>x</m:mi><m:mo>&#732;</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaaieqacuWF4baEgaacaaaa@2E3A@</m:annotation></m:semantics></m:math><sub><it>&#947;</it></sub>) - <b>k</b><sub><it>&#947;</it></sub>'(<b>K</b><sub><it>&#947; </it></sub>+ <it>&#963;</it><sup>2[<it>l</it>]</sup><b>I</b><sub><it>n</it></sub>)<sup>-1</sup><b>k</b><sub><it>&#947;</it></sub>,</p>
               <p><b>K</b><sub><it>&#947;,ij</it></sub>, = <it>K</it>(<b>x</b><sub><it>&#947;</it>,<b>i</b></sub>, <b>x</b><sub><it>&#947;</it>,<b>j</b></sub>), <b>k</b><sub><it>&#947;,i </it></sub>= <it>K</it>(<m:math name="1471-2105-8-67-i11" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>x</m:mi><m:mo>&#732;</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaaieqacuWF4baEgaacaaaa@2E3A@</m:annotation></m:semantics></m:math><sub><it>&#947;</it></sub>, <b>x</b><sub><it>&#947;,i</it></sub>), for <it>i</it>, <it>j </it>= 1,..., <it>n </it>; <it>l </it>= 1,..., <it>L</it>. &#160;&#160;&#160; (27a)</p>
               <p>Then, the predictive probability for the output label <m:math name="1471-2105-8-67-i21" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>y</m:mi><m:mo>&#732;</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuWG5bqEgaacaaaa@2E36@</m:annotation></m:semantics></m:math> given <m:math name="1471-2105-8-67-i11" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>x</m:mi><m:mo>&#732;</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaaieqacuWF4baEgaacaaaa@2E3A@</m:annotation></m:semantics></m:math> can be estimated by using the Monte Carlo integration:</p>
               <p>
                  <m:math name="1471-2105-8-67-i22" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:mtable columnalign="left">
                              <m:mtr columnalign="left">
                                 <m:mtd columnalign="left">
                                    <m:mrow>
                                       <m:mi>P</m:mi>
                                       <m:mo stretchy="false">(</m:mo>
                                       <m:mover accent="true">
                                          <m:mi>y</m:mi>
                                          <m:mo>&#732;</m:mo>
                                       </m:mover>
                                       <m:mo>=</m:mo>
                                       <m:mn>1</m:mn>
                                       <m:mo>|</m:mo>
                                       <m:mi>X</m:mi>
                                       <m:mo>,</m:mo>
                                       <m:mi>y</m:mi>
                                       <m:mo>,</m:mo>
                                       <m:mover accent="true">
                                          <m:mi>x</m:mi>
                                          <m:mo>&#732;</m:mo>
                                       </m:mover>
                                       <m:mo stretchy="false">)</m:mo>
                                       <m:mo>=</m:mo>
                                       <m:mfrac>
                                          <m:mn>1</m:mn>
                                          <m:mi>L</m:mi>
                                       </m:mfrac>
                                       <m:mstyle displaystyle="true">
                                          <m:munderover>
                                             <m:mo>&#8721;</m:mo>
                                             <m:mrow>
                                                <m:mi>l</m:mi>
                                                <m:mo>=</m:mo>
                                                <m:mn>1</m:mn>
                                             </m:mrow>
                                             <m:mi>L</m:mi>
                                          </m:munderover>
                                          <m:mi>&#934;</m:mi>
                                       </m:mstyle>
                                       <m:mo stretchy="false">(</m:mo>
                                       <m:mfrac>
                                          <m:mrow>
                                             <m:msup>
                                                <m:mi>f</m:mi>
                                                <m:mrow>
                                                   <m:mo stretchy="false">[</m:mo>
                                                   <m:mi>l</m:mi>
                                                   <m:mo stretchy="false">]</m:mo>
                                                </m:mrow>
                                             </m:msup>
                                             <m:mo>+</m:mo>
                                             <m:msup>
                                                <m:mi>b</m:mi>
                                                <m:mrow>
                                                   <m:mo stretchy="false">[</m:mo>
                                                   <m:mi>l</m:mi>
                                                   <m:mo stretchy="false">]</m:mo>
                                                </m:mrow>
                                             </m:msup>
                                          </m:mrow>
                                          <m:mrow>
                                             <m:msqrt>
                                                <m:mrow>
                                                   <m:msup>
                                                      <m:mi>V</m:mi>
                                                      <m:mrow>
                                                         <m:mo stretchy="false">[</m:mo>
                                                         <m:mi>l</m:mi>
                                                         <m:mo stretchy="false">]</m:mo>
                                                      </m:mrow>
                                                   </m:msup>
                                                </m:mrow>
                                             </m:msqrt>
                                          </m:mrow>
                                       </m:mfrac>
                                       <m:mo stretchy="false">)</m:mo>
                                       <m:mo>,</m:mo>
                                       <m:mi>P</m:mi>
                                       <m:mo stretchy="false">(</m:mo>
                                       <m:mover accent="true">
                                          <m:mi>y</m:mi>
                                          <m:mo>&#732;</m:mo>
                                       </m:mover>
                                       <m:mo>=</m:mo>
                                       <m:mo>&#8722;</m:mo>
                                       <m:mn>1</m:mn>
                                       <m:mo>|</m:mo>
                                       <m:mi>X</m:mi>
                                       <m:mo>,</m:mo>
                                       <m:mi>y</m:mi>
                                       <m:mo>,</m:mo>
                                       <m:mover accent="true">
                                          <m:mi>x</m:mi>
                                          <m:mo>&#732;</m:mo>
                                       </m:mover>
                                       <m:mo stretchy="false">)</m:mo>
                                       <m:mo>=</m:mo>
                                       <m:mn>1</m:mn>
                                       <m:mo>&#8722;</m:mo>
                                       <m:mi>P</m:mi>
                                       <m:mo stretchy="false">(</m:mo>
                                       <m:mover accent="true">
                                          <m:mi>y</m:mi>
                                          <m:mo>&#732;</m:mo>
                                       </m:mover>
                                       <m:mo>=</m:mo>
                                       <m:mn>1</m:mn>
                                       <m:mo>|</m:mo>
                                       <m:mi>X</m:mi>
                                       <m:mo>,</m:mo>
                                       <m:mi>y</m:mi>
                                       <m:mo>,</m:mo>
                                       <m:mover accent="true">
                                          <m:mi>x</m:mi>
                                          <m:mo>&#732;</m:mo>
                                       </m:mover>
                                       <m:mo stretchy="false">)</m:mo>
                                       <m:mo>,</m:mo>
                                    </m:mrow>
                                 </m:mtd>
                              </m:mtr>
                              <m:mtr columnalign="left">
                                 <m:mtd columnalign="left">
                                    <m:mrow>
                                       <m:mtext>where&#160;&#160;&#160;&#160;&#160;</m:mtext>
                                       <m:mi>&#934;</m:mi>
                                       <m:mo stretchy="false">(</m:mo>
                                       <m:mi>x</m:mi>
                                       <m:mo stretchy="false">)</m:mo>
                                       <m:mo>=</m:mo>
                                       <m:mstyle displaystyle="true">
                                          <m:mrow>
                                             <m:msubsup>
                                                <m:mo>&#8747;</m:mo>
                                                <m:mrow>
                                                   <m:mo>&#8722;</m:mo>
                                                   <m:mi>&#8734;</m:mi>
                                                </m:mrow>
                                                <m:mi>x</m:mi>
                                             </m:msubsup>
                                             <m:mrow>
                                                <m:mfrac>
                                                   <m:mn>1</m:mn>
                                                   <m:mrow>
                                                      <m:msqrt>
                                                         <m:mrow>
                                                            <m:mn>2</m:mn>
                                                            <m:mi>&#960;</m:mi>
                                                         </m:mrow>
                                                      </m:msqrt>
                                                   </m:mrow>
                                                </m:mfrac>
                                                <m:msup>
                                                   <m:mi>e</m:mi>
                                                   <m:mrow>
                                                      <m:mo>&#8722;</m:mo>
                                                      <m:msup>
                                                         <m:mi>t</m:mi>
                                                         <m:mn>2</m:mn>
                                                      </m:msup>
                                                      <m:mo>/</m:mo>
                                                      <m:mn>2</m:mn>
                                                   </m:mrow>
                                                </m:msup>
                                                <m:mi>d</m:mi>
                                                <m:mi>t</m:mi>
                                                <m:mo>.</m:mo>
                                             </m:mrow>
                                          </m:mrow>
                                       </m:mstyle>
                                    </m:mrow>
                                 </m:mtd>
                              </m:mtr>
                           </m:mtable>
                           <m:mtext>&#160;&#160;&#160;&#160;&#160;</m:mtext>
                           <m:mrow>
                              <m:mo>(</m:mo>
                              <m:mrow>
                                 <m:mn>27</m:mn>
                                 <m:mtext>b</m:mtext>
                              </m:mrow>
                              <m:mo>)</m:mo>
                           </m:mrow>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaafaqaaeGabaaabaGaemiuaaLaeiikaGIafmyEaKNbaGaacqGH9aqpcqaIXaqmcqGG8baFieqacqWFybawcqGGSaalcqWF5bqEcqGGSaalcuWF4baEgaacaiabcMcaPiabg2da9maalaaabaGaeGymaedabaGaemitaWeaamaaqahabaGaeuOPdyealeaacqWGSbaBcqGH9aqpcqaIXaqmaeaacqWGmbata0GaeyyeIuoakiabcIcaOmaalaaabaGaemOzay2aaWbaaSqabeaacqGGBbWwcqWGSbaBcqGGDbqxaaGccqGHRaWkcqWGIbGydaahaaWcbeqaaiabcUfaBjabdYgaSjabc2faDbaaaOqaamaakaaabaGaemOvay1aaWbaaSqabeaacqGGBbWwcqWGSbaBcqGGDbqxaaaabeaaaaGccqGGPaqkcqGGSaalcqWGqbaucqGGOaakcuWG5bqEgaacaiabg2da9iabgkHiTiabigdaXiabcYha8jab=HfayjabcYcaSiab=Lha5jabcYcaSiqb=Hha4zaaiaGaeiykaKIaeyypa0JaeGymaeJaeyOeI0IaemiuaaLaeiikaGIafmyEaKNbaGaacqGH9aqpcqaIXaqmcqGG8baFcqWFybawcqGGSaalcqWF5bqEcqGGSaalcuWF4baEgaacaiabcMcaPiabcYcaSaqaaiabbEha3jabbIgaOjabbwgaLjabbkhaYjabbwgaLjabbccaGiabbccaGiabbccaGiabbccaGiabbccaGiabfA6agjabcIcaOiabdIha4jabcMcaPiabg2da9maapedabaWaaSaaaeaacqaIXaqmaeaadaGcaaqaaiabikdaYGGaciab+b8aWbWcbeaaaaGccqWGLbqzdaahaaWcbeqaaiabgkHiTiabdsha0naaCaaameqabaGaeGOmaidaaSGaei4la8IaeGOmaidaaOGaemizaqMaemiDaqNaeiOla4caleaacqGHsislcqGHEisPaeaacqWG4baEa0Gaey4kIipaaaGccaWLjaGaaCzcamaabmaabaGaeGOmaiJaeG4naCJaeeOyaigacaGLOaGaayzkaaaaaa@A4CD@</m:annotation>
                     </m:semantics>
                  </m:math>
               </p>
            </sec>
            <sec>
               <st>
                  <p>Kernel type competition</p>
               </st>
               <p>Another important issue needs to be addressed is how to properly select a kernel type. If an independent set of testing samples is available, one approach is to calculate its predictive fit measure such as the misclassification rate (MR) or average predictive probability (APP) of the true class label. If the number of the available testing samples is sufficiently large, this approach is very reliable.</p>
               <p>Assuming that there are <it>M </it>testing samples {(<m:math name="1471-2105-8-67-i11" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>x</m:mi><m:mo>&#732;</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaaieqacuWF4baEgaacaaaa@2E3A@</m:annotation></m:semantics></m:math><sub>1</sub>, <m:math name="1471-2105-8-67-i23" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>y</m:mi><m:mo>&#8994;</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuWG5bqEgaWeaaaa@2E41@</m:annotation></m:semantics></m:math><sub>1</sub>),...,(<m:math name="1471-2105-8-67-i11" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>x</m:mi><m:mo>&#732;</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaaieqacuWF4baEgaacaaaa@2E3A@</m:annotation></m:semantics></m:math><sub><it>M</it></sub>, <m:math name="1471-2105-8-67-i23" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>y</m:mi><m:mo>&#8994;</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuWG5bqEgaWeaaaa@2E41@</m:annotation></m:semantics></m:math><sub><it>M</it></sub>)}, where <m:math name="1471-2105-8-67-i11" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>x</m:mi><m:mo>&#732;</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaaieqacuWF4baEgaacaaaa@2E3A@</m:annotation></m:semantics></m:math><sub><it>i </it></sub>denotes the microarray data and <m:math name="1471-2105-8-67-i23" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>y</m:mi><m:mo>&#8994;</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuWG5bqEgaWeaaaa@2E41@</m:annotation></m:semantics></m:math><sub><it>i </it></sub>is its class label for <it>i </it>= 1,2,..., <it>M</it>, the MR for the testing set can be estimated by</p>
               <p>
                  <m:math name="1471-2105-8-67-i24" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:mtable columnalign="left">
                              <m:mtr columnalign="left">
                                 <m:mtd columnalign="left">
                                    <m:mrow>
                                       <m:mi>M</m:mi>
                                       <m:msub>
                                          <m:mi>R</m:mi>
                                          <m:mrow>
                                             <m:mi>t</m:mi>
                                             <m:mi>e</m:mi>
                                             <m:mi>s</m:mi>
                                             <m:mi>t</m:mi>
                                          </m:mrow>
                                       </m:msub>
                                       <m:mo>=</m:mo>
                                       <m:mfrac>
                                          <m:mn>1</m:mn>
                                          <m:mi>M</m:mi>
                                       </m:mfrac>
                                       <m:mstyle displaystyle="true">
                                          <m:munderover>
                                             <m:mo>&#8721;</m:mo>
                                             <m:mrow>
                                                <m:mi>i</m:mi>
                                                <m:mo>=</m:mo>
                                                <m:mn>1</m:mn>
                                             </m:mrow>
                                             <m:mi>M</m:mi>
                                          </m:munderover>
                                          <m:mrow>
                                             <m:mi>M</m:mi>
                                             <m:msub>
                                                <m:mi>C</m:mi>
                                                <m:mi>i</m:mi>
                                             </m:msub>
                                             <m:mo>,</m:mo>
                                          </m:mrow>
                                       </m:mstyle>
                                       <m:mtext>&#160;&#160;where</m:mtext>
                                    </m:mrow>
                                 </m:mtd>
                              </m:mtr>
                              <m:mtr columnalign="left">
                                 <m:mtd columnalign="left">
                                    <m:mrow>
                                       <m:mi>M</m:mi>
                                       <m:msub>
                                          <m:mi>C</m:mi>
                                          <m:mi>i</m:mi>
                                       </m:msub>
                                       <m:mo>=</m:mo>
                                       <m:mrow>
                                          <m:mo>{</m:mo>
                                          <m:mrow>
                                             <m:mtable>
                                                <m:mtr>
                                                   <m:mtd>
                                                      <m:mn>1</m:mn>
                                                   </m:mtd>
                                                   <m:mtd>
                                                      <m:mrow>
                                                         <m:mi>i</m:mi>
                                                         <m:mi>f</m:mi>
                                                         <m:mtext>&#160;</m:mtext>
                                                         <m:msub>
                                                            <m:mover accent="true">
                                                               <m:mi>y</m:mi>
                                                               <m:mo>&#732;</m:mo>
                                                            </m:mover>
                                                            <m:mi>i</m:mi>
                                                         </m:msub>
                                                         <m:mo>&#8800;</m:mo>
                                                         <m:msub>
                                                            <m:mover accent="true">
                                                               <m:mi>y</m:mi>
                                                               <m:mo>&#8994;</m:mo>
                                                            </m:mover>
                                                            <m:mi>i</m:mi>
                                                         </m:msub>
                                                      </m:mrow>
                                                   </m:mtd>
                                                </m:mtr>
                                                <m:mtr>
                                                   <m:mtd>
                                                      <m:mn>0</m:mn>
                                                   </m:mtd>
                                                   <m:mtd>
                                                      <m:mrow>
                                                         <m:mi>i</m:mi>
                                                         <m:mi>f</m:mi>
                                                         <m:mtext>&#160;</m:mtext>
                                                         <m:msub>
                                                            <m:mover accent="true">
                                                               <m:mi>y</m:mi>
                                                               <m:mo>&#732;</m:mo>
                                                            </m:mover>
                                                            <m:mi>i</m:mi>
                                                         </m:msub>
                                                         <m:mo>=</m:mo>
                                                         <m:msub>
                                                            <m:mover accent="true">
                                                               <m:mi>y</m:mi>
                                                               <m:mo>&#8994;</m:mo>
                                                            </m:mover>
                                                            <m:mi>i</m:mi>
                                                         </m:msub>
                                                      </m:mrow>
                                                   </m:mtd>
                                                </m:mtr>
                                             </m:mtable>
                                             <m:mo>,</m:mo>
                                             <m:msub>
                                                <m:mover accent="true">
                                                   <m:mi>y</m:mi>
                                                   <m:mo>&#732;</m:mo>
                                                </m:mover>
                                                <m:mi>i</m:mi>
                                             </m:msub>
                                             <m:mo>=</m:mo>
                                             <m:mrow>
                                                <m:mo>{</m:mo>
                                                <m:mrow>
                                                   <m:mtable>
                                                      <m:mtr>
                                                         <m:mtd>
                                                            <m:mn>1</m:mn>
                                                         </m:mtd>
                                                         <m:mtd>
                                                            <m:mrow>
                                                               <m:mi>i</m:mi>
                                                               <m:mi>f</m:mi>
                                                               <m:mtext>&#160;</m:mtext>
                                                               <m:mi>P</m:mi>
                                                               <m:mo stretchy="false">(</m:mo>
                                                               <m:msub>
                                                                  <m:mover accent="true">
                                                                     <m:mi>y</m:mi>
                                                                     <m:mo>&#732;</m:mo>
                                                                  </m:mover>
                                                                  <m:mi>i</m:mi>
                                                               </m:msub>
                                                               <m:mo>=</m:mo>
                                                               <m:mn>1</m:mn>
                                                               <m:mo>|</m:mo>
                                                               <m:mi>X</m:mi>
                                                               <m:mo>,</m:mo>
                                                               <m:mi>y</m:mi>
                                                               <m:mo>,</m:mo>
                                                               <m:msub>
                                                                  <m:mover accent="true">
                                                                     <m:mi>x</m:mi>
                                                                     <m:mo>&#732;</m:mo>
                                                                  </m:mover>
                                                                  <m:mi>i</m:mi>
                                                               </m:msub>
                                                               <m:mo>,</m:mo>
                                                               <m:mi>K</m:mi>
                                                               <m:mo stretchy="false">)</m:mo>
                                                               <m:mo>|</m:mo>
                                                               <m:mo>&#8805;</m:mo>
                                                               <m:mn>0.5</m:mn>
                                                            </m:mrow>
                                                         </m:mtd>
                                                      </m:mtr>
                                                      <m:mtr>
                                                         <m:mtd>
                                                            <m:mrow>
                                                               <m:mo>&#8722;</m:mo>
                                                               <m:mn>1</m:mn>
                                                            </m:mrow>
                                                         </m:mtd>
                                                         <m:mtd>
                                                            <m:mrow>
                                                               <m:mi>i</m:mi>
                                                               <m:mi>f</m:mi>
                                                               <m:mtext>&#160;</m:mtext>
                                                               <m:mi>P</m:mi>
                                                               <m:mo stretchy="false">(</m:mo>
                                                               <m:msub>
                                                                  <m:mover accent="true">
                                                                     <m:mi>y</m:mi>
                                                                     <m:mo>&#732;</m:mo>
                                                                  </m:mover>
                                                                  <m:mi>i</m:mi>
                                                               </m:msub>
                                                               <m:mo>=</m:mo>
                                                               <m:mn>1</m:mn>
                                                               <m:mo>|</m:mo>
                                                               <m:mi>X</m:mi>
                                                               <m:mo>,</m:mo>
                                                               <m:mi>y</m:mi>
                                                               <m:mo>,</m:mo>
                                                               <m:msub>
                                                                  <m:mover accent="true">
                                                                     <m:mi>x</m:mi>
                                                                     <m:mo>&#732;</m:mo>
                                                                  </m:mover>
                                                                  <m:mi>i</m:mi>
                                                               </m:msub>
                                                               <m:mo>,</m:mo>
                                                               <m:mi>K</m:mi>
                                                               <m:mo stretchy="false">)</m:mo>
                                                               <m:mo>|</m:mo>
                                                               <m:mo>&lt;</m:mo>
                                                               <m:mn>0.5</m:mn>
                                                            </m:mrow>
                                                         </m:mtd>
                                                      </m:mtr>
                                                   </m:mtable>
                                                </m:mrow>
                                             </m:mrow>
                                          </m:mrow>
                                       </m:mrow>
                                       <m:mtext>&#160;&#160;for&#160;&#160;</m:mtext>
                                       <m:mi>i</m:mi>
                                       <m:mo>=</m:mo>
                                       <m:mn>1</m:mn>
                                       <m:mo>,</m:mo>
                                       <m:mn>2</m:mn>
                                       <m:mo>,</m:mo>
                                       <m:mn>...</m:mn>
                                       <m:mo>,</m:mo>
                                       <m:mi>M</m:mi>
                                    </m:mrow>
                                 </m:mtd>
                              </m:mtr>
                           </m:mtable>
                           <m:mtext>&#160;&#160;&#160;&#160;&#160;</m:mtext>
                           <m:mrow>
                              <m:mo>(</m:mo>
                              <m:mrow>
                                 <m:mn>28</m:mn>
                                 <m:mtext>a</m:mtext>
                              </m:mrow>
                              <m:mo>)</m:mo>
                           </m:mrow>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaafaqaaeGabaaabaGaemyta0KaemOuai1aaSbaaSqaaiabdsha0jabdwgaLjabdohaZjabdsha0bqabaGccqGH9aqpdaWcaaqaaiabigdaXaqaaiabd2eanbaadaaeWbqaaiabd2eanjabdoeadnaaBaaaleaacqWGPbqAaeqaaOGaeiilaWcaleaacqWGPbqAcqGH9aqpcqaIXaqmaeaacqWGnbqta0GaeyyeIuoakiabbccaGiabbccaGiabbEha3jabbIgaOjabbwgaLjabbkhaYjabbwgaLbqaaiabd2eanjabdoeadnaaBaaaleaacqWGPbqAaeqaaOGaeyypa0ZaaiqabeaafaqabeGacaaabaGaeGymaedabaacbiGae8xAaKMae8NzayMaeeiiaaIafmyEaKNbaGaadaWgaaWcbaGaemyAaKgabeaakiabgcMi5kqbdMha5zaataWaaSbaaSqaaiabdMgaPbqabaaakeaacqaIWaamaeaacqWFPbqAcqWFMbGzcqqGGaaicuWG5bqEgaacamaaBaaaleaacqWGPbqAaeqaaOGaeyypa0JafmyEaKNbambadaWgaaWcbaGaemyAaKgabeaaaaGccqGGSaalcuWG5bqEgaacamaaBaaaleaacqWGPbqAaeqaaOGaeyypa0ZaaiqabeaafaqabeGacaaabaGaeGymaedabaGae8xAaKMae8NzayMaeeiiaaIaemiuaaLaeiikaGIafmyEaKNbaGaadaWgaaWcbaGaemyAaKgabeaakiabg2da9iabigdaXiabcYha8Hqabiab+HfayjabcYcaSiab+Lha5jabcYcaSiqb+Hha4zaaiaWaaSbaaSqaaiabdMgaPbqabaGccqGGSaalieaacqqFlbWscqGGPaqkcqGG8baFcqGHLjYScqaIWaamcqGGUaGlcqaI1aqnaeaacqGHsislcqaIXaqmaeaacqWFPbqAcqWFMbGzcqqGGaaicqWGqbaucqGGOaakcuWG5bqEgaacamaaBaaaleaacqWGPbqAaeqaaOGaeyypa0JaeGymaeJaeiiFaWNae4hwaGLaeiilaWIae4xEaKNaeiilaWIaf4hEaGNbaGaadaWgaaWcbaGaemyAaKgabeaakiabcYcaSiab9TealjabcMcaPiabcYha8jabgYda8iabicdaWiabc6caUiabiwda1aaaaiaawUhaaaGaay5EaaGaeeiiaaIaeeiiaaIaeeOzayMaee4Ba8MaeeOCaiNaeeiiaaIaeeiiaaIaemyAaKMaeyypa0JaeGymaeJaeiilaWIaeGOmaiJaeiilaWIaeiOla4IaeiOla4IaeiOla4IaeiilaWIaemyta0eaaiaaxMaacaWLjaWaaeWaaeaacqaIYaGmcqaI4aaocqqGHbqyaiaawIcacaGLPaaaaaa@C2E4@</m:annotation>
                     </m:semantics>
                  </m:math>
               </p>
               <p>The smaller the MR a kernel type has, the better general performance it should have. If the number of the available testing samples is small, the APP of the true class label is a more consistent measure. Throughout this paper, we always refer APP to the APP of the true class label and it is defined as:</p>
               <p>
                  <m:math name="1471-2105-8-67-i25" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:mi>A</m:mi>
                           <m:mi>P</m:mi>
                           <m:msub>
                              <m:mi>P</m:mi>
                              <m:mrow>
                                 <m:mi>t</m:mi>
                                 <m:mi>e</m:mi>
                                 <m:mi>s</m:mi>
                                 <m:mi>t</m:mi>
                              </m:mrow>
                           </m:msub>
                           <m:mo>=</m:mo>
                           <m:mfrac>
                              <m:mn>1</m:mn>
                              <m:mi>M</m:mi>
                           </m:mfrac>
                           <m:mstyle displaystyle="true">
                              <m:munderover>
                                 <m:mo>&#8721;</m:mo>
                                 <m:mrow>
                                    <m:mi>i</m:mi>
                                    <m:mo>=</m:mo>
                                    <m:mn>1</m:mn>
                                 </m:mrow>
                                 <m:mi>M</m:mi>
                              </m:munderover>
                              <m:mrow>
                                 <m:mi>P</m:mi>
                                 <m:mo stretchy="false">(</m:mo>
                                 <m:msub>
                                    <m:mover accent="true">
                                       <m:mi>y</m:mi>
                                       <m:mo>&#732;</m:mo>
                                    </m:mover>
                                    <m:mi>i</m:mi>
                                 </m:msub>
                                 <m:mo>=</m:mo>
                                 <m:msub>
                                    <m:mover accent="true">
                                       <m:mi>y</m:mi>
                                       <m:mo>&#8994;</m:mo>
                                    </m:mover>
                                    <m:mi>i</m:mi>
                                 </m:msub>
                                 <m:mo>|</m:mo>
                                 <m:mi>X</m:mi>
                                 <m:mo>,</m:mo>
                                 <m:mi>y</m:mi>
                                 <m:mo>,</m:mo>
                                 <m:msub>
                                    <m:mover accent="true">
                                       <m:mi>x</m:mi>
                                       <m:mo>&#732;</m:mo>
                                    </m:mover>
                                    <m:mi>i</m:mi>
                                 </m:msub>
                                 <m:mo>,</m:mo>
                                 <m:mi>K</m:mi>
                                 <m:mo stretchy="false">)</m:mo>
                                 <m:mo>.</m:mo>
                              </m:mrow>
                           </m:mstyle>
                           <m:mtext>&#160;&#160;&#160;&#160;&#160;</m:mtext>
                           <m:mrow>
                              <m:mo>(</m:mo>
                              <m:mrow>
                                 <m:mn>28</m:mn>
                                 <m:mtext>b</m:mtext>
                              </m:mrow>
                              <m:mo>)</m:mo>
                           </m:mrow>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacqWGbbqqcqWGqbaucqWGqbaudaWgaaWcbaGaemiDaqNaemyzauMaem4CamNaemiDaqhabeaakiabg2da9maalaaabaGaeGymaedabaGaemyta0eaamaaqahabaGaemiuaaLaeiikaGIafmyEaKNbaGaadaWgaaWcbaGaemyAaKgabeaakiabg2da9iqbdMha5zaataWaaSbaaSqaaiabdMgaPbqabaGccqGG8baFieqacqWFybawcqGGSaalcqWF5bqEcqGGSaalcuWF4baEgaacamaaBaaaleaacqWGPbqAaeqaaOGaeiilaWIaem4saSKaeiykaKIaeiOla4caleaacqWGPbqAcqGH9aqpcqaIXaqmaeaacqWGnbqta0GaeyyeIuoakiaaxMaacaWLjaWaaeWaaeaacqaIYaGmcqaI4aaocqqGIbGyaiaawIcacaGLPaaaaaa@5BE4@</m:annotation>
                     </m:semantics>
                  </m:math>
               </p>
               <p>In both (28a) and (28b), the probability <it>P</it>(<m:math name="1471-2105-8-67-i21" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>y</m:mi><m:mo>&#732;</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuWG5bqEgaacaaaa@2E36@</m:annotation></m:semantics></m:math><sub><it>i</it></sub>|<b>X</b>, <b>y</b>, <m:math name="1471-2105-8-67-i11" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>x</m:mi><m:mo>&#732;</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaaieqacuWF4baEgaacaaaa@2E3A@</m:annotation></m:semantics></m:math><sub><it>i</it></sub>, K) is evaluated by (27a) and (27b). Obviously, a better model should have a higher APP. The APP usually provides a less biased predictive fit measure when the number of testing samples is limited.</p>
               <p>After running the simulations under each candidate kernel type, one can simply choose the kernel type with the least MR or with the largest APP for the testing set. However, the independent testing samples are not always available. To use the predictive fit approach, one may resort to a rigorous cross-validation (CV) procedure. Sometimes, a "leave-one-out" cross-validation (LOOCV) is proper. That is, one treats one of the training samples as the testing sample and applies the proposed KIGP, including all three phases, to the rest n-1 samples and obtains the predictive measure for this sample. One does this procedure for each training sample and the average of the predictive measures should give a consistent evaluation to the target kernel type.</p>
               <p>A more unbiased approach is to use a multiple independent 3-fold CVs. For each round of CV, one first randomly partitions the training set into three sets with a balanced ratio of the class labels. Then for each of the three sets, one treats it as the testing set and applies the KIGP (including all three phases) to the remaining two sets as the training set and gets the predictive fit measure for this testing set. After running this procedure for all three sets, one gets the predictive measure of all available samples for this round. One does multiple rounds of independent 3-fold CVs (through different random partitioning) and the average of the predictive measure for the whole set will deliver an unbiased assessment of the given kernel type.</p>
               <p>The predictive fit approach through a multiple 3-fold CVs works very well. Throughout this study, we always use it to select the proper kernel type for a given problem if the independent testing set is not available. As the nature of the MCMC-based methods however, the KIGP method is extremely computationally intensive, especially when the number of the candidate kernel types is large. A more integrative implementation for kernel or model selection, such as making use of a reversible jump MCMC approach, would help further streamline the current KIGP.</p>
            </sec>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Abbreviations</p>
         </st>
         <p>KIGP kernel-imbedded Gaussian process</p>
         <p>AIC Akaike information criterion</p>
         <p>BIC Bayesian information criterion</p>
         <p>MCMC Markov chain Monte Carlo</p>
         <p>PAM prediction analysis for Microarrays</p>
         <p>SVM support vector machine</p>
         <p>RFE recursive feature elimination</p>
         <p>LSSVM least square support vector machine</p>
         <p>GP Gaussian process</p>
         <p>LK linear kernel</p>
         <p>PK polynomial kernel</p>
         <p>PK(d) polynomial kernel with degree "d"</p>
         <p>GK Gaussian kernel</p>
         <p>GK(r) Gaussian kernel with width "r"</p>
         <p>NLF normalized log-frequency</p>
         <p>fdr false discovery rate</p>
         <p>MR misclassification rate</p>
         <p>APP average predictive probability</p>
         <p>LOOCV leave-one-out cross-validation</p>
         <p>3-fold CV 3-fold cross-validation</p>
         <p>PDF probability density function</p>
         <p>AML acute myeloid leukemia</p>
         <p>ALL acute lymphoblastic leukemia</p>
         <p>SRBCT small round blue-cell tumor</p>
         <p>EWS Ewing family of tumors</p>
         <p>RMS rhabdomyosarcoma</p>
         <p>NB neuroblastoma</p>
         <p>NHL non-Hodgkin lymphoma</p>
         <p>ANN artificial neural network</p>
         <p>RJMCMC reversible jump Markov chain Monte Carlo</p>
         <p>IG inverted gamma</p>
         <p>PMF probability mass function</p>
         <p>RF random forests</p>
         <p>DLDA diagonal linear discriminant analysis</p>
      </sec>
      <sec>
         <st>
            <p>Authors' contributions</p>
         </st>
         <p>LWKC conceived and supervised this study. XZ performed the computational work and analyzed the data. Both authors contributed to the designing and developing of the methodology and computational/statistical strategy. Both authors contributed to the writing of the manuscript.</p>
      </sec>
      <sec>
         <st>
            <p>Appendix: Inference for <it>P</it>(t|z, <it>b</it>, K<sub><it>&#947;</it></sub>)</p>
         </st>
         <p>First of all, for convenience, we drop the notation of <it>b </it>and <b>K</b><sub><b>&#947; </b></sub>in the following derivations. Under an KIGP model, we have</p>
         <p><b>t </b>~ <it>N </it>(<b>0</b>, <b>K</b><sub><b>&#947;</b></sub>), <b>z</b>|<b>t </b>~ <it>N</it>(<b>t </b>+ <it>b</it><b>l<sub>n</sub></b>, <b>&#937;</b>), where <b>K</b><sub><b>&#947;</b></sub>, <b>l</b><sub><b>n </b></sub>and <b>&#937; </b>are defined in Eq. (12). &#160;&#160;&#160; (A1)</p>
         <p>The joint distribution of <b>z </b>and <b>t </b>is still Gaussian, which can be formulated as:</p>
         <p>
            <m:math name="1471-2105-8-67-i26" xmlns:m="http://www.w3.org/1998/Math/MathML">
               <m:semantics>
                  <m:mtable columnalign="left">
                     <m:mtr>
                        <m:mtd>
                           <m:mi>P</m:mi>
                           <m:mo stretchy="false">(</m:mo>
                           <m:mstyle mathvariant="bold" mathsize="normal">
                              <m:mi>z</m:mi>
                           </m:mstyle>
                           <m:mo>,</m:mo>
                           <m:mstyle mathvariant="bold" mathsize="normal">
                              <m:mi>t</m:mi>
                           </m:mstyle>
                           <m:mo stretchy="false">)</m:mo>
                           <m:mo>=</m:mo>
                           <m:mi>P</m:mi>
                           <m:mo stretchy="false">(</m:mo>
                           <m:mstyle mathvariant="bold" mathsize="normal">
                              <m:mi>z</m:mi>
                           </m:mstyle>
                           <m:mo>|</m:mo>
                           <m:mstyle mathvariant="bold" mathsize="normal">
                              <m:mi>t</m:mi>
                           </m:mstyle>
                           <m:mo stretchy="false">)</m:mo>
                           <m:mi>P</m:mi>
                           <m:mo stretchy="false">(</m:mo>
                           <m:mstyle mathvariant="bold" mathsize="normal">
                              <m:mi>t</m:mi>
                           </m:mstyle>
                           <m:mo stretchy="false">)</m:mo>
                           <m:mo>&#8733;</m:mo>
                           <m:mi>exp</m:mi>
                           <m:mo>&#8289;</m:mo>
                           <m:mo>{</m:mo>
                           <m:mo>&#8722;</m:mo>
                           <m:mfrac>
                              <m:mn>1</m:mn>
                              <m:mn>2</m:mn>
                           </m:mfrac>
                           <m:mo stretchy="false">[</m:mo>
                           <m:mo stretchy="false">(</m:mo>
                           <m:mstyle mathvariant="bold" mathsize="normal">
                              <m:mi>z</m:mi>
                           </m:mstyle>
                           <m:mo>&#8722;</m:mo>
                           <m:mstyle mathvariant="bold" mathsize="normal">
                              <m:mi>t</m:mi>
                           </m:mstyle>
                           <m:mo>&#8722;</m:mo>
                           <m:mi>b</m:mi>
                           <m:msub>
                              <m:mstyle mathvariant="bold" mathsize="normal">
                                 <m:mn>1</m:mn>
                              </m:mstyle>
                              <m:mstyle mathvariant="bold" mathsize="normal">
                                 <m:mi>n</m:mi>
                              </m:mstyle>
                           </m:msub>
                           <m:mo stretchy="false">)</m:mo>
                           <m:mo>'</m:mo>
                           <m:msup>
                              <m:mi>&#937;</m:mi>
                              <m:mrow>
                                 <m:mo>&#8722;</m:mo>
                                 <m:mn>1</m:mn>
                              </m:mrow>
                           </m:msup>
                           <m:mo stretchy="false">(</m:mo>
                           <m:mstyle mathvariant="bold" mathsize="normal">
                              <m:mi>z</m:mi>
                           </m:mstyle>
                           <m:mo>&#8722;</m:mo>
                           <m:mstyle mathvariant="bold" mathsize="normal">
                              <m:mi>t</m:mi>
                           </m:mstyle>
                           <m:mo>&#8722;</m:mo>
                           <m:mi>b</m:mi>
                           <m:msub>
                              <m:mstyle mathvariant="bold" mathsize="normal">
                                 <m:mn>1</m:mn>
                              </m:mstyle>
                              <m:mstyle mathvariant="bold" mathsize="normal">
                                 <m:mi>n</m:mi>
                              </m:mstyle>
                           </m:msub>
                           <m:mo stretchy="false">)</m:mo>
                           <m:mo>+</m:mo>
                           <m:mstyle mathvariant="bold" mathsize="normal">
                              <m:mi>t</m:mi>
                           </m:mstyle>
                           <m:mo>'</m:mo>
                           <m:msubsup>
                              <m:mstyle mathvariant="bold" mathsize="normal">
                                 <m:mi>K</m:mi>
                              </m:mstyle>
                              <m:mi>&#947;</m:mi>
                              <m:mrow>
                                 <m:mo>&#8722;</m:mo>
                                 <m:mn>1</m:mn>
                              </m:mrow>
                           </m:msubsup>
                           <m:mstyle mathvariant="bold" mathsize="normal">
                              <m:mi>t</m:mi>
                           </m:mstyle>
                           <m:mo stretchy="false">)</m:mo>
                        </m:mtd>
                     </m:mtr>
                     <m:mtr>
                        <m:mtd>
                           <m:mo>&#8733;</m:mo>
                           <m:mi>exp</m:mi>
                           <m:mo>&#8289;</m:mo>
                           <m:mo>{</m:mo>
                           <m:mo>&#8722;</m:mo>
                           <m:mfrac>
                              <m:mn>1</m:mn>
                              <m:mn>2</m:mn>
                           </m:mfrac>
                           <m:mo stretchy="false">[</m:mo>
                           <m:mo stretchy="false">(</m:mo>
                           <m:mstyle mathvariant="bold" mathsize="normal">
                              <m:mi>t</m:mi>
                           </m:mstyle>
                           <m:mo>&#8722;</m:mo>
                           <m:msub>
                              <m:mi>&#956;</m:mi>
                              <m:mstyle mathvariant="bold" mathsize="normal">
                                 <m:mi>t</m:mi>
                              </m:mstyle>
                           </m:msub>
                           <m:mo stretchy="false">)</m:mo>
                           <m:mo>'</m:mo>
                           <m:mo stretchy="false">(</m:mo>
                           <m:msubsup>
                              <m:mstyle mathvariant="bold" mathsize="normal">
                                 <m:mi>K</m:mi>
                              </m:mstyle>
                              <m:mi>&#947;</m:mi>
                              <m:mrow>
                                 <m:mo>&#8722;</m:mo>
                                 <m:mn>1</m:mn>
                              </m:mrow>
                           </m:msubsup>
                           <m:mo>+</m:mo>
                           <m:msup>
                              <m:mi>&#937;</m:mi>
                              <m:mrow>
                                 <m:mo>&#8722;</m:mo>
                                 <m:mn>1</m:mn>
                              </m:mrow>
                           </m:msup>
                           <m:mo stretchy="false">)</m:mo>
                           <m:mo stretchy="false">(</m:mo>
                           <m:mstyle mathvariant="bold" mathsize="normal">
                              <m:mi>t</m:mi>
                           </m:mstyle>
                           <m:mo>&#8722;</m:mo>
                           <m:msub>
                              <m:mi>&#956;</m:mi>
                              <m:mstyle mathvariant="bold" mathsize="normal">
                                 <m:mi>t</m:mi>
                              </m:mstyle>
                           </m:msub>
                           <m:mo stretchy="false">)</m:mo>
                           <m:mo>+</m:mo>
                           <m:mo stretchy="false">(</m:mo>
                           <m:mstyle mathvariant="bold" mathsize="normal">
                              <m:mi>z</m:mi>
                           </m:mstyle>
                           <m:mo>&#8722;</m:mo>
                           <m:mi>b</m:mi>
                           <m:msub>
                              <m:mstyle mathvariant="bold" mathsize="normal">
                                 <m:mn>1</m:mn>
                              </m:mstyle>
                              <m:mstyle mathvariant="bold" mathsize="normal">
                                 <m:mi>n</m:mi>
                              </m:mstyle>
                           </m:msub>
                           <m:mo stretchy="false">)</m:mo>
                           <m:mo>'</m:mo>
                           <m:msup>
                              <m:mi>&#937;</m:mi>
                              <m:mrow>
                                 <m:mo>&#8722;</m:mo>
                                 <m:mn>1</m:mn>
                              </m:mrow>
                           </m:msup>
                           <m:mo stretchy="false">(</m:mo>
                           <m:mstyle mathvariant="bold" mathsize="normal">
                              <m:mi>z</m:mi>
                           </m:mstyle>
                           <m:mo>&#8722;</m:mo>
                           <m:mi>b</m:mi>
                           <m:msub>
                              <m:mstyle mathvariant="bold" mathsize="normal">
                                 <m:mn>1</m:mn>
                              </m:mstyle>
                              <m:mstyle mathvariant="bold" mathsize="normal">
                                 <m:mi>n</m:mi>
                              </m:mstyle>
                           </m:msub>
                           <m:mo stretchy="false">)</m:mo>
                           <m:mo>&#8722;</m:mo>
                           <m:msubsup>
                              <m:mi>&#956;</m:mi>
                              <m:mstyle mathvariant="bold" mathsize="normal">
                                 <m:mi>t</m:mi>
                              </m:mstyle>
                              <m:mo>'</m:mo>
                           </m:msubsup>
                           <m:mo stretchy="false">(</m:mo>
                           <m:msubsup>
                              <m:mstyle mathvariant="bold" mathsize="normal">
                                 <m:mi>K</m:mi>
                              </m:mstyle>
                              <m:mi>&#947;</m:mi>
                              <m:mrow>
                                 <m:mo>&#8722;</m:mo>
                                 <m:mn>1</m:mn>
                              </m:mrow>
                           </m:msubsup>
                           <m:mo>+</m:mo>
                           <m:msup>
                              <m:mi>&#937;</m:mi>
                              <m:mrow>
                                 <m:mo>&#8722;</m:mo>
                                 <m:mn>1</m:mn>
                              </m:mrow>
                           </m:msup>
                           <m:mo stretchy="false">)</m:mo>
                           <m:msub>
                              <m:mi>&#956;</m:mi>
                              <m:mstyle mathvariant="bold" mathsize="normal">
                                 <m:mi>t</m:mi>
                              </m:mstyle>
                           </m:msub>
                           <m:mo stretchy="false">]</m:mo>
                           <m:mo>}</m:mo>
                        </m:mtd>
                     </m:mtr>
                  </m:mtable>
                  <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakqaabeqaaiabdcfaqjabcIcaOiabhQha6jabcYcaSiabhsha0jabcMcaPiabg2da9iabdcfaqjabcIcaOiabhQha6jabcYha8jabhsha0jabcMcaPiabdcfaqjabcIcaOiabhsha0jabcMcaPiabg2Hi1kGbcwgaLjabcIha4jabcchaWjabcUha7jabgkHiTmaalaaabaGaeGymaedabaGaeGOmaidaaiabcUfaBjabcIcaOiabhQha6jabgkHiTiabhsha0jabgkHiTiabdkgaIjabhgdaXmaaBaaaleaacqWHUbGBaeqaaOGaeiykaKIaei4jaCccceGae8xQdC1aaWbaaSqabeaacqGHsislcqaIXaqmaaGccqGGOaakcqWH6bGEcqGHsislcqWH0baDcqGHsislcqWGIbGycqWHXaqmdaWgaaWcbaGaeCOBa4gabeaakiabcMcaPiabgUcaRiabhsha0jabcEcaNiabhUealnaaDaaaleaaiiGacqGFZoWzaeaacqGHsislcqaIXaqmaaGccqWH0baDcqGGPaqkaeaacqGHDisTcyGGLbqzcqGG4baEcqGGWbaCcqGG7bWEcqGHsisldaWcaaqaaiabigdaXaqaaiabikdaYaaacqGGBbWwcqGGOaakcqWH0baDcqGHsislcqWF8oqBdaWgaaWcbaGaeCiDaqhabeaakiabcMcaPiabcEcaNiabcIcaOiabhUealnaaDaaaleaacqGFZoWzaeaacqGHsislcqaIXaqmaaGccqGHRaWkcqWFPoWvdaahaaWcbeqaaiabgkHiTiabigdaXaaakiabcMcaPiabcIcaOiabhsha0jabgkHiTiab=X7aTnaaBaaaleaacqWH0baDaeqaaOGaeiykaKIaey4kaSIaeiikaGIaeCOEaONaeyOeI0IaemOyaiMaeCymaeZaaSbaaSqaaiabh6gaUbqabaGccqGGPaqkcqGGNaWjcqWFPoWvdaahaaWcbeqaaiabgkHiTiabigdaXaaakiabcIcaOiabhQha6jabgkHiTiabdkgaIjabhgdaXmaaBaaaleaacqWHUbGBaeqaaOGaeiykaKIaeyOeI0Iae8hVd02aa0baaSqaaiabhsha0bqaaiabcEcaNaaakiabcIcaOiabhUealnaaDaaaleaacqGFZoWzaeaacqGHsislcqaIXaqmaaGccqGHRaWkcqWFPoWvdaahaaWcbeqaaiabgkHiTiabigdaXaaakiabcMcaPiab=X7aTnaaBaaaleaacqWH0baDaeqaaOGaeiyxa0LaeiyFa0haaaa@C090@</m:annotation>
               </m:semantics>
            </m:math>
         </p>
         <p>where <b>&#956;</b><sub><b>t </b></sub>= (<b>&#937;</b><sup>-1 </sup>+ <m:math name="1471-2105-8-67-i27" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>K</m:mi><m:mi>&#947;</m:mi><m:mrow><m:mo>&#8722;</m:mo><m:mn>1</m:mn></m:mrow></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaaieqacqWFlbWsdaqhaaWcbaacciGae43SdCgabaGaeyOeI0IaeGymaedaaaaa@3188@</m:annotation></m:semantics></m:math>)<sup>-1</sup><b>&#937;</b><sup>-1</sup>(<b>z </b>- <it>b</it><b>l<sub>n</sub></b>). &#160;&#160;&#160; (A2)</p>
         <p>In principle, if <b>z </b>and <b>t </b>form a joint Gaussian distribution, both the marginal distribution of <b>z </b>and the conditional distribution of <b>t </b>given <b>z </b>are also Gaussian. Making use of the following equation from <abbrgrp><abbr bid="B33">33</abbr></abbrgrp>:</p>
         <p>(<b>A </b>+ <b>C</b>)<sup>-1 </sup>= <b>A</b><sup>-1 </sup>- <b>A</b><sup>-1</sup>(<b>A</b><sup>-1 </sup>+ <b>C</b><sup>-1</sup>)<sup>-1</sup><b>A</b><sup>-1</sup>, &#160;&#160;&#160; (A3)</p>
         <p>it consequently yields</p>
         <p>
            <m:math name="1471-2105-8-67-i28" xmlns:m="http://www.w3.org/1998/Math/MathML">
               <m:semantics>
                  <m:mrow>
                     <m:mtable>
                        <m:mtr>
                           <m:mtd>
                              <m:mrow>
                                 <m:mi>P</m:mi>
                                 <m:mo stretchy="false">(</m:mo>
                                 <m:mstyle mathvariant="bold" mathsize="normal">
                                    <m:mi>z</m:mi>
                                 </m:mstyle>
                                 <m:mo stretchy="false">)</m:mo>
                                 <m:mo>=</m:mo>
                                 <m:mstyle displaystyle="true">
                                    <m:mrow>
                                       <m:munder>
                                          <m:mo>&#8747;</m:mo>
                                          <m:mstyle mathvariant="bold" mathsize="normal">
                                             <m:mi>t</m:mi>
                                          </m:mstyle>
                                       </m:munder>
                                       <m:mrow>
                                          <m:mi>P</m:mi>
                                          <m:mo stretchy="false">(</m:mo>
                                          <m:mstyle mathvariant="bold" mathsize="normal">
                                             <m:mi>z</m:mi>
                                          </m:mstyle>
                                          <m:mo>,</m:mo>
                                          <m:mstyle mathvariant="bold" mathsize="normal">
                                             <m:mi>t</m:mi>
                                          </m:mstyle>
                                          <m:mo stretchy="false">)</m:mo>
                                       </m:mrow>
                                    </m:mrow>
                                 </m:mstyle>
                                 <m:mi>d</m:mi>
                                 <m:mstyle mathvariant="bold" mathsize="normal">
                                    <m:mi>t</m:mi>
                                 </m:mstyle>
                                 <m:mo>&#8733;</m:mo>
                                 <m:mi>exp</m:mi>
                                 <m:mo>&#8289;</m:mo>
                                 <m:mo>{</m:mo>
                                 <m:mo>&#8722;</m:mo>
                                 <m:mfrac>
                                    <m:mn>1</m:mn>
                                    <m:mn>2</m:mn>
                                 </m:mfrac>
                                 <m:mo stretchy="false">[</m:mo>
                                 <m:mo stretchy="false">(</m:mo>
                                 <m:mstyle mathvariant="bold" mathsize="normal">
                                    <m:mi>z</m:mi>
                                 </m:mstyle>
                                 <m:mo>&#8722;</m:mo>
                                 <m:mi>b</m:mi>
                                 <m:msub>
                                    <m:mstyle mathvariant="bold" mathsize="normal">
                                       <m:mn>1</m:mn>
                                    </m:mstyle>
                                    <m:mstyle mathvariant="bold" mathsize="normal">
                                       <m:mi>n</m:mi>
                                    </m:mstyle>
                                 </m:msub>
                                 <m:mo stretchy="false">)</m:mo>
                                 <m:mo>'</m:mo>
                                 <m:msup>
                                    <m:mi>&#937;</m:mi>
                                    <m:mrow>
                                       <m:mo>&#8722;</m:mo>
                                       <m:mn>1</m:mn>
                                    </m:mrow>
                                 </m:msup>
                                 <m:mo stretchy="false">(</m:mo>
                                 <m:mstyle mathvariant="bold" mathsize="normal">
                                    <m:mi>z</m:mi>
                                 </m:mstyle>
                                 <m:mo>&#8722;</m:mo>
                                 <m:mi>b</m:mi>
                                 <m:msub>
                                    <m:mstyle mathvariant="bold" mathsize="normal">
                                       <m:mn>1</m:mn>
                                    </m:mstyle>
                                    <m:mstyle mathvariant="bold" mathsize="normal">
                                       <m:mi>n</m:mi>
                                    </m:mstyle>
                                 </m:msub>
                                 <m:mo stretchy="false">)</m:mo>
                                 <m:mo>&#8722;</m:mo>
                                 <m:msubsup>
                                    <m:mi>&#956;</m:mi>
                                    <m:mstyle mathvariant="bold" mathsize="normal">
                                       <m:mi>t</m:mi>
                                    </m:mstyle>
                                    <m:mo>'</m:mo>
                                 </m:msubsup>
                                 <m:mo stretchy="false">(</m:mo>
                                 <m:msubsup>
                                    <m:mstyle mathvariant="bold" mathsize="normal">
                                       <m:mi>K</m:mi>
                                    </m:mstyle>
                                    <m:mi>&#947;</m:mi>
                                    <m:mrow>
                                       <m:mo>&#8722;</m:mo>
                                       <m:mn>1</m:mn>
                                    </m:mrow>
                                 </m:msubsup>
                                 <m:mo>+</m:mo>
                                 <m:msup>
                                    <m:mi>&#937;</m:mi>
                                    <m:mrow>
                                       <m:mo>&#8722;</m:mo>
                                       <m:mn>1</m:mn>
                                    </m:mrow>
                                 </m:msup>
                                 <m:mo stretchy="false">)</m:mo>
                                 <m:msub>
                                    <m:mi>&#956;</m:mi>
                                    <m:mstyle mathvariant="bold" mathsize="normal">
                                       <m:mi>t</m:mi>
                                    </m:mstyle>
                                 </m:msub>
                                 <m:mo stretchy="false">]</m:mo>
                                 <m:mo>}</m:mo>
                              </m:mrow>
                           </m:mtd>
                        </m:mtr>
                        <m:mtr>
                           <m:mtd>
                              <m:mrow>
                                 <m:mo>&#8733;</m:mo>
                                 <m:mi>exp</m:mi>
                                 <m:mo>&#8289;</m:mo>
                                 <m:mo>{</m:mo>
                                 <m:mo>&#8722;</m:mo>
                                 <m:mfrac>
                                    <m:mn>1</m:mn>
                                    <m:mn>2</m:mn>
                                 </m:mfrac>
                                 <m:mo stretchy="false">[</m:mo>
                                 <m:mo stretchy="false">(</m:mo>
                                 <m:mstyle mathvariant="bold" mathsize="normal">
                                    <m:mi>z</m:mi>
                                 </m:mstyle>
                                 <m:mo>&#8722;</m:mo>
                                 <m:mi>b</m:mi>
                                 <m:msub>
                                    <m:mstyle mathvariant="bold" mathsize="normal">
                                       <m:mn>1</m:mn>
                                    </m:mstyle>
                                    <m:mstyle mathvariant="bold" mathsize="normal">
                                       <m:mi>n</m:mi>
                                    </m:mstyle>
                                 </m:msub>
                                 <m:mo stretchy="false">)</m:mo>
                                 <m:mo>'</m:mo>
                                 <m:msup>
                                    <m:mrow>
                                       <m:mo stretchy="false">(</m:mo>
                                       <m:msub>
                                          <m:mstyle mathvariant="bold" mathsize="normal">
                                             <m:mi>K</m:mi>
                                          </m:mstyle>
                                          <m:mi>&#947;</m:mi>
                                       </m:msub>
                                       <m:mo>+</m:mo>
                                       <m:mi>&#937;</m:mi>
                                       <m:mo stretchy="false">)</m:mo>
                                    </m:mrow>
                                    <m:mrow>
                                       <m:mo>&#8722;</m:mo>
                                       <m:mn>1</m:mn>
                                    </m:mrow>
                                 </m:msup>
                                 <m:mo stretchy="false">(</m:mo>
                                 <m:mstyle mathvariant="bold" mathsize="normal">
                                    <m:mi>z</m:mi>
                                 </m:mstyle>
                                 <m:mo>&#8722;</m:mo>
                                 <m:mi>b</m:mi>
                                 <m:msub>
                                    <m:mstyle mathvariant="bold" mathsize="normal">
                                       <m:mn>1</m:mn>
                                    </m:mstyle>
                                    <m:mstyle mathvariant="bold" mathsize="normal">
                                       <m:mi>n</m:mi>
                                    </m:mstyle>
                                 </m:msub>
                                 <m:mo stretchy="false">)</m:mo>
                                 <m:mo stretchy="false">]</m:mo>
                                 <m:mo>}</m:mo>
                              </m:mrow>
                           </m:mtd>
                        </m:mtr>
                     </m:mtable>
                     <m:mtext>&#160;&#160;&#160;&#160;&#160;</m:mtext>
                     <m:mrow>
                        <m:mo>(</m:mo>
                        <m:mrow>
                           <m:mtext>A</m:mtext>
                           <m:mn>4</m:mn>
                        </m:mrow>
                        <m:mo>)</m:mo>
                     </m:mrow>
                  </m:mrow>
                  <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaafaqabeGabaaabaGaemiuaaLaeiikaGIaeCOEaONaeiykaKIaeyypa0Zaa8quaeaacqWGqbaucqGGOaakcqWH6bGEcqGGSaalcqWH0baDcqGGPaqkaSqaaiabhsha0bqab0Gaey4kIipakiabdsgaKjabhsha0jabg2Hi1kGbcwgaLjabcIha4jabcchaWjabcUha7jabgkHiTmaalaaabaGaeGymaedabaGaeGOmaidaaiabcUfaBjabcIcaOiabhQha6jabgkHiTiabdkgaIjabhgdaXmaaBaaaleaacqWHUbGBaeqaaOGaeiykaKIaei4jaCccceGae8xQdC1aaWbaaSqabeaacqGHsislcqaIXaqmaaGccqGGOaakcqWH6bGEcqGHsislcqWGIbGycqWHXaqmdaWgaaWcbaGaeCOBa4gabeaakiabcMcaPiabgkHiTiab=X7aTnaaDaaaleaacqWH0baDaeaacqGGNaWjaaGccqGGOaakcqWHlbWsdaqhaaWcbaacciGae43SdCgabaGaeyOeI0IaeGymaedaaOGaey4kaSIae8xQdC1aaWbaaSqabeaacqGHsislcqaIXaqmaaGccqGGPaqkcqWF8oqBdaWgaaWcbaGaeCiDaqhabeaakiabc2faDjabc2ha9bqaaiabg2Hi1kGbcwgaLjabcIha4jabcchaWjabcUha7jabgkHiTmaalaaabaGaeGymaedabaGaeGOmaidaaiabcUfaBjabcIcaOiabhQha6jabgkHiTiabdkgaIjabhgdaXmaaBaaaleaacqWHUbGBaeqaaOGaeiykaKIaei4jaCIaeiikaGIaeC4saS0aaSbaaSqaaiab+n7aNbqabaGccqGHRaWkcqWFPoWvcqGGPaqkdaahaaWcbeqaaiabgkHiTiabigdaXaaakiabcIcaOiabhQha6jabgkHiTiabdkgaIjabhgdaXmaaBaaaleaacqWHUbGBaeqaaOGaeiykaKIaeiyxa0LaeiyFa0haaiaaxMaacaWLjaWaaeWaaeaacqqGbbqqcqaI0aanaiaawIcacaGLPaaaaaa@A246@</m:annotation>
               </m:semantics>
            </m:math>
         </p>
         <p>and</p>
         <p>
            <m:math name="1471-2105-8-67-i29" xmlns:m="http://www.w3.org/1998/Math/MathML">
               <m:semantics>
                  <m:mrow>
                     <m:mtable columnalign="left">
                        <m:mtr columnalign="left">
                           <m:mtd columnalign="left">
                              <m:mrow>
                                 <m:mi>P</m:mi>
                                 <m:mo stretchy="false">(</m:mo>
                                 <m:mstyle mathvariant="bold" mathsize="normal">
                                    <m:mi>t</m:mi>
                                 </m:mstyle>
                                 <m:mo>|</m:mo>
                                 <m:mstyle mathvariant="bold" mathsize="normal">
                                    <m:mi>z</m:mi>
                                 </m:mstyle>
                                 <m:mo stretchy="false">)</m:mo>
                                 <m:mo>&#8733;</m:mo>
                                 <m:mi>exp</m:mi>
                                 <m:mo>&#8289;</m:mo>
                                 <m:mo>{</m:mo>
                                 <m:mo>&#8722;</m:mo>
                                 <m:mfrac>
                                    <m:mn>1</m:mn>
                                    <m:mn>2</m:mn>
                                 </m:mfrac>
                                 <m:mo stretchy="false">[</m:mo>
                                 <m:mo stretchy="false">(</m:mo>
                                 <m:mstyle mathvariant="bold" mathsize="normal">
                                    <m:mi>t</m:mi>
                                 </m:mstyle>
                                 <m:mo>&#8722;</m:mo>
                                 <m:msub>
                                    <m:mi>&#956;</m:mi>
                                    <m:mstyle mathvariant="bold" mathsize="normal">
                                       <m:mi>t</m:mi>
                                    </m:mstyle>
                                 </m:msub>
                                 <m:mo stretchy="false">)</m:mo>
                                 <m:mo>'</m:mo>
                                 <m:mo stretchy="false">(</m:mo>
                                 <m:msubsup>
                                    <m:mstyle mathvariant="bold" mathsize="normal">
                                       <m:mi>K</m:mi>
                                    </m:mstyle>
                                    <m:mi>&#947;</m:mi>
                                    <m:mrow>
                                       <m:mo>&#8722;</m:mo>
                                       <m:mn>1</m:mn>
                                    </m:mrow>
                                 </m:msubsup>
                                 <m:mo>+</m:mo>
                                 <m:msup>
                                    <m:mi>&#937;</m:mi>
                                    <m:mrow>
                                       <m:mo>&#8722;</m:mo>
                                       <m:mn>1</m:mn>
                                    </m:mrow>
                                 </m:msup>
                                 <m:mo stretchy="false">)</m:mo>
                                 <m:mo stretchy="false">(</m:mo>
                                 <m:mstyle mathvariant="bold" mathsize="normal">
                                    <m:mi>t</m:mi>
                                 </m:mstyle>
                                 <m:mo>&#8722;</m:mo>
                                 <m:msub>
                                    <m:mi>&#956;</m:mi>
                                    <m:mstyle mathvariant="bold" mathsize="normal">
                                       <m:mi>t</m:mi>
                                    </m:mstyle>
                                 </m:msub>
                                 <m:mo stretchy="false">)</m:mo>
                                 <m:mo stretchy="false">]</m:mo>
                                 <m:mo>}</m:mo>
                                 <m:mi>p</m:mi>
                              </m:mrow>
                           </m:mtd>
                        </m:mtr>
                        <m:mtr columnalign="left">
                           <m:mtd columnalign="left">
                              <m:mrow>
                                 <m:mtext/>
                                 <m:mo>&#8733;</m:mo>
                                 <m:mi>exp</m:mi>
                                 <m:mo>&#8289;</m:mo>
                                 <m:mo>{</m:mo>
                                 <m:mo>&#8722;</m:mo>
                                 <m:mfrac>
                                    <m:mn>1</m:mn>
                                    <m:mn>2</m:mn>
                                 </m:mfrac>
                                 <m:mo stretchy="false">[</m:mo>
                                 <m:mo stretchy="false">(</m:mo>
                                 <m:mstyle mathvariant="bold" mathsize="normal">
                                    <m:mi>t</m:mi>
                                 </m:mstyle>
                                 <m:mo>&#8722;</m:mo>
                                 <m:msub>
                                    <m:mi>&#956;</m:mi>
                                    <m:mstyle mathvariant="bold" mathsize="normal">
                                       <m:mi>t</m:mi>
                                    </m:mstyle>
                                 </m:msub>
                                 <m:mo stretchy="false">)</m:mo>
                                 <m:mo>'</m:mo>
                                 <m:msup>
                                    <m:mrow>
                                       <m:mo stretchy="false">(</m:mo>
                                       <m:mi>&#937;</m:mi>
                                       <m:mo>&#8722;</m:mo>
                                       <m:mi>&#937;</m:mi>
                                       <m:msup>
                                          <m:mrow>
                                             <m:mo stretchy="false">(</m:mo>
                                             <m:mi>&#937;</m:mi>
                                             <m:mo>+</m:mo>
                                             <m:msub>
                                                <m:mstyle mathvariant="bold" mathsize="normal">
                                                   <m:mi>K</m:mi>
                                                </m:mstyle>
                                                <m:mi>&#947;</m:mi>
                                             </m:msub>
                                             <m:mo stretchy="false">)</m:mo>
                                          </m:mrow>
                                          <m:mrow>
                                             <m:mo>&#8722;</m:mo>
                                             <m:mn>1</m:mn>
                                          </m:mrow>
                                       </m:msup>
                                       <m:mi>&#937;</m:mi>
                                       <m:mo stretchy="false">)</m:mo>
                                    </m:mrow>
                                    <m:mrow>
                                       <m:mo>&#8722;</m:mo>
                                       <m:mn>1</m:mn>
                                    </m:mrow>
                                 </m:msup>
                                 <m:mo stretchy="false">(</m:mo>
                                 <m:mstyle mathvariant="bold" mathsize="normal">
                                    <m:mi>t</m:mi>
                                 </m:mstyle>
                                 <m:mo>&#8722;</m:mo>
                                 <m:msub>
                                    <m:mi>&#956;</m:mi>
                                    <m:mstyle mathvariant="bold" mathsize="normal">
                                       <m:mi>t</m:mi>
                                    </m:mstyle>
                                 </m:msub>
                                 <m:mo stretchy="false">)</m:mo>
                                 <m:mo stretchy="false">]</m:mo>
                                 <m:mo>}</m:mo>
                              </m:mrow>
                           </m:mtd>
                        </m:mtr>
                     </m:mtable>
                  </m:mrow>
                  <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeGadea7baaRcaaKjuaabaqaceaaaeaacqWGqbaucqGGOaakcqWH0baDcqGG8baFcqWH6bGEcqGGPaqkcqGHDisTcyGGLbqzcqGG4baEcqGGWbaCcqGG7bWEcqGHsisldaWcaaqaaiabigdaXaqaaiabikdaYaaacqGGBbWwcqGGOaakcqWH0baDcqGHsisliiqacqWF8oqBdaWgaaWcbaGaeCiDaqhabeaakiabcMcaPiabcEcaNiabcIcaOiabhUealnaaDaaaleaaiiGacqGFZoWzaeaacqGHsislcqaIXaqmaaGccqGHRaWkcqWFPoWvdaahaaWcbeqaaiabgkHiTiabigdaXaaakiabcMcaPiabcIcaOiabhsha0jabgkHiTiab=X7aTnaaBaaaleaacqWH0baDaeqaaOGaeiykaKIaeiyxa0LaeiyFa0NaemiCaahabaGaaCzcaiabg2Hi1kGbcwgaLjabcIha4jabcchaWjabcUha7jabgkHiTmaalaaabaGaeGymaedabaGaeGOmaidaaiabcUfaBjabcIcaOiabhsha0jabgkHiTiab=X7aTnaaBaaaleaacqWH0baDaeqaaOGaeiykaKIaei4jaCIaeiikaGIae8xQdCLaeyOeI0Iae8xQdCLaeiikaGIae8xQdCLaey4kaSIaeC4saS0aaSbaaSqaaiab+n7aNbqabaGccqGGPaqkdaahaaWcbeqaaiabgkHiTiabigdaXaaakiab=L6axjabcMcaPmaaCaaaleqabaGaeyOeI0IaeGymaedaaOGaeiikaGIaeCiDaqNaeyOeI0Iae8hVd02aaSbaaSqaaiabhsha0bqabaGccqGGPaqkcqGGDbqxcqGG9bqFaaaaaa@9174@</m:annotation>
               </m:semantics>
            </m:math>
         </p>
         <p>where <b>&#956;</b><sub><b>t </b></sub>= (<b>I</b><sub><b>n </b></sub>- <b>&#937;</b>(<b>&#937; </b>+ <b>K</b><sub><b>&#947;</b></sub>)<sup>-1</sup>(<b>z </b>- <it>b</it><b>l<sub>n</sub></b>). &#160;&#160;&#160; (A5)</p>
         <p><b>z </b>~ <it>N</it>(<it>b</it><b>l<sub>n</sub></b>, <b>K</b><sub><it>&#947; </it></sub>+ <b>&#937;</b>)</p>
         <p>Or strictly,</p>
         <p><b>t</b>|<b>z </b>~ <it>N</it>((<b>I</b><sub><b>n </b></sub>- <b>&#937;</b>(<b>&#937; </b>+ <b>K</b><sub><b>&#947;</b></sub>)<sup>-1</sup>)(<b>z </b>- <it>b</it><b>l<sub>n</sub></b>), <b>&#937; </b>- <b>&#937;</b>(<b>&#937; </b>+ <b>K</b><sub><it>&#947;</it></sub>)<sup>-1</sup><b>&#937;</b>) &#160;&#160;&#160; (A6)</p>
         <p>NOTE: The matrix <b>&#937; </b>- <b>&#937;</b>(<b>&#937; </b>+ <b>K</b><sub><it>&#947;</it></sub>)<sup>-1</sup><b>&#937; </b>is non-negative definite.</p>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>This work was partially supported by the Loyola University Medical Center Research Development Funds, and the SUN Microsystems Academic Equipment Grant for Bioinformatics awarded to LWKC. We would like to thank Dr. Will Gersch at the Department of Information and Computer Sciences, University of Hawaii, for his helpful comments and suggestions.</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>Molecular classification of cancer: class discovery and class prediction by gene expression monitoring</p>
            </title>
            <aug>
               <au>
                  <snm>Golub</snm>
                  <fnm>TR</fnm>
               </au>
               <au>
                  <snm>Slonim</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Tamayo</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Huard</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Gaasenbeek</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Mesirov</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Coller</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Loh</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Downing</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Caligiuri</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Bloomfield</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Lender</snm>
                  <fnm>E</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>1999</pubdate>
            <volume>286</volume>
            <fpage>531</fpage>
            <lpage>537</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.286.5439.531</pubid>
                  <pubid idtype="pmpid" link="fulltext">10521349</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <title>
               <p>Genome-Wide Views of Cancer</p>
            </title>
            <aug>
               <au>
                  <snm>Golub</snm>
                  <fnm>TR</fnm>
               </au>
            </aug>
            <source>N Engl J Med</source>
            <pubdate>2001</pubdate>
            <volume>344</volume>
            <fpage>601</fpage>
            <lpage>602</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1056/NEJM200102223440809</pubid>
                  <pubid idtype="pmpid" link="fulltext">11207357</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>DNA Microarrays in Clinical Oncology</p>
            </title>
            <aug>
               <au>
                  <snm>Ramaswamy</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Golub</snm>
                  <fnm>TR</fnm>
               </au>
            </aug>
            <source>Journal of Clinical Oncology</source>
            <pubdate>2002</pubdate>
            <volume>20</volume>
            <fpage>1932</fpage>
            <lpage>1941</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">11919254</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>"Cancer Genomics and Molecular Pattern Recognition" in Expressing profiling of human tumors: diagnostic and research applications</p>
            </title>
            <aug>
               <au>
                  <snm>Tamayo</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Ramaswamy</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <publisher>Human Press</publisher>
            <editor>Marc Ladanyi, William Gerald</editor>
            <pubdate>2003</pubdate>
         </bibl>
         <bibl id="B5">
            <title>
               <p>Some comments on C<sub><it>p</it></sub></p>
            </title>
            <aug>
               <au>
                  <snm>Mallows</snm>
                  <fnm>CL</fnm>
               </au>
            </aug>
            <source>Technometrics</source>
            <pubdate>1973</pubdate>
            <volume>15</volume>
            <fpage>661</fpage>
            <lpage>676</lpage>
            <xrefbib>
               <pubid idtype="doi">10.2307/1267380</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>Information theory and an extension of the maximum likelihood principle</p>
            </title>
            <aug>
               <au>
                  <snm>Akaike</snm>
                  <fnm>H</fnm>
               </au>
            </aug>
            <source>2nd International Symposium on Information Theory</source>
            <editor>Petrov BN, Csaki F</editor>
            <pubdate>1973</pubdate>
            <fpage>267</fpage>
            <lpage>281</lpage>
         </bibl>
         <bibl id="B7">
            <title>
               <p>Estimation the dimension of a model</p>
            </title>
            <aug>
               <au>
                  <snm>Schwarz</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>Ann Statist</source>
            <pubdate>1978</pubdate>
            <volume>6</volume>
            <fpage>461</fpage>
            <lpage>464</lpage>
         </bibl>
         <bibl id="B8">
            <title>
               <p>Calibration and emperical Bayes variable selection</p>
            </title>
            <aug>
               <au>
                  <snm>George</snm>
                  <fnm>EI</fnm>
               </au>
               <au>
                  <snm>Foster</snm>
                  <fnm>DP</fnm>
               </au>
            </aug>
            <source>Biometrika</source>
            <pubdate>2000</pubdate>
            <volume>87</volume>
            <fpage>731</fpage>
            <lpage>747</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1093/biomet/87.4.731</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p>Efficient Empirical Bayes Variable Selection and Estimation in Linear Models</p>
            </title>
            <aug>
               <au>
                  <snm>Yuan</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Lin</snm>
                  <fnm>Y</fnm>
               </au>
            </aug>
            <source>J Amer Statis l Assoc</source>
            <pubdate>2005</pubdate>
            <volume>100</volume>
            <fpage>1215</fpage>
            <lpage>1225</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1198/016214505000000367</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B10">
            <title>
               <p>Gene selection: a Bayesian variable selection approach</p>
            </title>
            <aug>
               <au>
                  <snm>Lee</snm>
                  <fnm>KE</fnm>
               </au>
               <au>
                  <snm>Sha</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Dougherty</snm>
                  <fnm>ER</fnm>
               </au>
               <au>
                  <snm>Vannucci</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Mallick</snm>
                  <fnm>BK</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2003</pubdate>
            <volume>19</volume>
            <issue>1</issue>
            <fpage>90</fpage>
            <lpage>97</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/19.1.90</pubid>
                  <pubid idtype="pmpid" link="fulltext">12499298</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>Gene Prediction Using Multinomial Probit Regression with Bayesian Gene Selection</p>
            </title>
            <aug>
               <au>
                  <snm>Zhou</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Dougherty</snm>
                  <fnm>ER</fnm>
               </au>
            </aug>
            <source>EURASIP Journal on Applied Signal Processing</source>
            <pubdate>2004</pubdate>
            <volume>1</volume>
            <fpage>115</fpage>
            <lpage>124</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1155/S1110865704309157</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>Cancer classification and prediction using logistic regression with Bayesian gene selection</p>
            </title>
            <aug>
               <au>
                  <snm>Zhou</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Liu</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Wong</snm>
                  <fnm>STC</fnm>
               </au>
            </aug>
            <source>Journal of Biomedical Informatics</source>
            <pubdate>2004</pubdate>
            <volume>37</volume>
            <fpage>249</fpage>
            <lpage>259</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.jbi.2004.07.009</pubid>
                  <pubid idtype="pmpid" link="fulltext">15465478</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B13">
            <title>
               <p>Gene Selection using Logistic Regressions based on AIC, BIC and MDL Criteria</p>
            </title>
            <aug>
               <au>
                  <snm>Zhou</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Dougherty</snm>
                  <fnm>ER</fnm>
               </au>
            </aug>
            <source>New Mathematics and Neural Computation</source>
            <pubdate>2005</pubdate>
            <volume>1</volume>
            <fpage>129</fpage>
            <lpage>145</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1142/S179300570500007X</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B14">
            <title>
               <p>A Bayesian approach to non-linear probit gene selection and classification</p>
            </title>
            <aug>
               <au>
                  <snm>Zhou</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Dougherty</snm>
                  <fnm>ER</fnm>
               </au>
            </aug>
            <source>Journal of the Franklin Institute</source>
            <pubdate>2004</pubdate>
            <volume>341</volume>
            <fpage>137</fpage>
            <lpage>156</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1016/j.jfranklin.2003.12.010</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B15">
            <title>
               <p>Diagnosis of multiple cancer types by shrunken centroids of gene expression</p>
            </title>
            <aug>
               <au>
                  <snm>Tibshirani</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Hastie</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Narasimhan</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Chu</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2002</pubdate>
            <volume>99</volume>
            <fpage>6567</fpage>
            <lpage>6572</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">124443</pubid>
                  <pubid idtype="pmpid" link="fulltext">12011421</pubid>
                  <pubid idtype="doi">10.1073/pnas.082099299</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B16">
            <title>
               <p>BagBoosting for tumor classification with gene expression data</p>
            </title>
            <aug>
               <au>
                  <snm>Dettling</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2004</pubdate>
            <volume>20</volume>
            <issue>18</issue>
            <fpage>3583</fpage>
            <lpage>3593</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/bth447</pubid>
                  <pubid idtype="pmpid" link="fulltext">15466910</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B17">
            <title>
               <p>Statistical Properties and Adaptive Tuning of Support Vector Machines</p>
            </title>
            <aug>
               <au>
                  <snm>Lin</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Wahba</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Lee</snm>
                  <fnm>Y</fnm>
               </au>
            </aug>
            <source>Machine Learning</source>
            <pubdate>2002</pubdate>
            <volume>48</volume>
            <fpage>115</fpage>
            <lpage>136</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1023/A:1013951620650</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B18">
            <title>
               <p>Support Vector Machines and The Bayes Rule in Classification</p>
            </title>
            <aug>
               <au>
                  <snm>Lin</snm>
                  <fnm>Y</fnm>
               </au>
            </aug>
            <source>Data Mining and Knowledge Discovery</source>
            <pubdate>2002</pubdate>
            <volume>6</volume>
            <fpage>259</fpage>
            <lpage>275</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1023/A:1015469627679</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B19">
            <title>
               <p>Gene selection for cancer classification using support vector machines</p>
            </title>
            <aug>
               <au>
                  <snm>Guyon</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Weston</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Barnhill</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Vapnik</snm>
                  <fnm>V</fnm>
               </au>
            </aug>
            <source>Machine Learning</source>
            <pubdate>2002</pubdate>
            <volume>46</volume>
            <fpage>389</fpage>
            <lpage>422</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1023/A:1012487302797</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B20">
            <title>
               <p>Kernel logistic regression and the import vector machine</p>
            </title>
            <aug>
               <au>
                  <snm>Zhu</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Hastie</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Journal of Computational and Graphical Statistics</source>
            <pubdate>2005</pubdate>
            <volume>14</volume>
            <issue>1</issue>
            <fpage>185</fpage>
            <lpage>205</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1198/106186005X25619</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B21">
            <title>
               <p>Classification of Gene Microarrays by Penalized Logistic Regression</p>
            </title>
            <aug>
               <au>
                  <snm>Zhu</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Hastie</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Biostatistics</source>
            <pubdate>2004</pubdate>
            <volume>5</volume>
            <issue>3</issue>
            <fpage>427</fpage>
            <lpage>443</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/biostatistics/kxg046</pubid>
                  <pubid idtype="pmpid" link="fulltext">15208204</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B22">
            <title>
               <p>The Evidence Framework Applied to Classification Networks</p>
            </title>
            <aug>
               <au>
                  <snm>MacKay</snm>
                  <fnm>DJC</fnm>
               </au>
            </aug>
            <source>Neural Computation</source>
            <pubdate>1992</pubdate>
            <volume>4</volume>
            <issue>5</issue>
            <fpage>720</fpage>
            <lpage>736</lpage>
         </bibl>
         <bibl id="B23">
            <title>
               <p>The Evidence Framework Applied to Support Vector Machines</p>
            </title>
            <aug>
               <au>
                  <snm>Kwok</snm>
                  <fnm>JT</fnm>
               </au>
            </aug>
            <source>IEEE Trans on Neural Networks</source>
            <pubdate>2000</pubdate>
            <volume>11</volume>
            <fpage>1162</fpage>
            <lpage>1173</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1109/72.870047</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B24">
            <title>
               <p>Bayesian framework for least-squares support vector machine classifiers, Gaussian processes, and kernel fisher discriminant analysis</p>
            </title>
            <aug>
               <au>
                  <snm>Gestel</snm>
                  <fnm>TV</fnm>
               </au>
               <au>
                  <snm>Suykens</snm>
                  <fnm>JVK</fnm>
               </au>
               <au>
                  <snm>Lanckriet</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Lambrechts</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Moor</snm>
                  <fnm>BD</fnm>
               </au>
               <au>
                  <snm>Vandewalle</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Neural Computation</source>
            <pubdate>2002</pubdate>
            <volume>14</volume>
            <issue>5</issue>
            <fpage>1115</fpage>
            <lpage>1147</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1162/089976602753633411</pubid>
                  <pubid idtype="pmpid" link="fulltext">11972910</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B25">
            <title>
               <p>Bayesian Learning for Neural Networks</p>
            </title>
            <aug>
               <au>
                  <snm>Neal</snm>
                  <fnm>RM</fnm>
               </au>
            </aug>
            <publisher>Springer-Verlag, New York</publisher>
            <pubdate>1996</pubdate>
         </bibl>
         <bibl id="B26">
            <title>
               <p>An introduction to Support Vector Machines</p>
            </title>
            <aug>
               <au>
                  <snm>Cristianini</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Shawe-Tayer</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <publisher>Cambridge University Press</publisher>
            <pubdate>2000</pubdate>
         </bibl>
         <bibl id="B27">
            <title>
               <p>Classification and diagnostic prediction of cancer using gene expression profiling and artificial neural networks</p>
            </title>
            <aug>
               <au>
                  <snm>Khan</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Wei</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Ringner</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Saal</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Ladanyi</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Westermann</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Berthold</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Schwab</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Antonescu</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>Nature Medicine</source>
            <pubdate>2001</pubdate>
            <volume>7</volume>
            <issue>6</issue>
            <fpage>673</fpage>
            <lpage>679</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1282521</pubid>
                  <pubid idtype="pmpid" link="fulltext">11385503</pubid>
                  <pubid idtype="doi">10.1038/89044</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B28">
            <title>
               <p>Gene expression profiles in hereditary breast cancer</p>
            </title>
            <aug>
               <au>
                  <snm>Hedenfalk</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Duggan</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Radmacher</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Bittner</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Simon</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Meltzer</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Gusterson</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Esteller</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Raffeld</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>The New England Journal of Medicine</source>
            <pubdate>2001</pubdate>
            <volume>344</volume>
            <fpage>539</fpage>
            <lpage>548</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1056/NEJM200102223440801</pubid>
                  <pubid idtype="pmpid" link="fulltext">11207349</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B29">
            <title>
               <p>Large-Scale simultaneous hypothesis testing: the choice of a null hypothesis</p>
            </title>
            <aug>
               <au>
                  <snm>Efron</snm>
                  <fnm>B</fnm>
               </au>
            </aug>
            <source>J Amer Statis l Assoc</source>
            <pubdate>2004</pubdate>
            <volume>99</volume>
            <fpage>96</fpage>
            <lpage>104</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1198/016214504000000089</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B30">
            <title>
               <p>Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays</p>
            </title>
            <aug>
               <au>
                  <snm>Alon</snm>
                  <fnm>U</fnm>
               </au>
               <au>
                  <snm>Barkai</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Notterman</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Gish</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Mack</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Levine</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Proc Natural Academic Science USA</source>
            <pubdate>1999</pubdate>
            <volume>96</volume>
            <fpage>6745</fpage>
            <lpage>6750</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1073/pnas.96.12.6745</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B31">
            <title>
               <p>Bayesian data analysis</p>
            </title>
            <aug>
               <au>
                  <snm>Gelman</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Carlin</snm>
                  <fnm>JB</fnm>
               </au>
               <au>
                  <snm>Stern</snm>
                  <fnm>HS</fnm>
               </au>
               <au>
                  <snm>Rubin</snm>
                  <fnm>DB</fnm>
               </au>
            </aug>
            <publisher>Chapman &amp; Hall/CRC</publisher>
            <edition>2</edition>
         </bibl>
         <bibl id="B32">
            <title>
               <p>Simulation of truncated normal variables</p>
            </title>
            <aug>
               <au>
                  <snm>Robert</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>Statistics and Computing</source>
            <pubdate>1995</pubdate>
            <volume>5</volume>
            <fpage>121</fpage>
            <lpage>125</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1007/BF00143942</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B33">
            <title>
               <p>Matrix Methods for Engineers and Scientist</p>
            </title>
            <aug>
               <au>
                  <snm>Barnett</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <publisher>McGraw-Hill</publisher>
            <pubdate>1979</pubdate>
         </bibl>
      </refgrp>
   </bm>
</art>
