<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1755-8794-1-8</ui>
   <ji>1755-8794</ji>
   <fm>
      <dochead>Technical advance</dochead>
      <bibl>
         <title>
            <p>Predictive gene lists for breast cancer prognosis: A topographic visualisation study</p>
         </title>
         <aug>
            <au id="A1" ca="yes">
               <snm>Sivaraksa</snm>
               <fnm>Mingmanas</fnm>
               <insr iid="I1"/>
               <email>sivarakm@aston.ac.uk</email>
            </au>
            <au id="A2">
               <snm>Lowe</snm>
               <fnm>David</fnm>
               <insr iid="I1"/>
               <email>d.lowe@aston.ac.uk</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Neural Computing Research Group, Aston University, Birmingham, UK</p>
            </ins>
         </insg>
         <source>BMC Medical Genomics</source>
         <issn>1755-8794</issn>
         <pubdate>2008</pubdate>
         <volume>1</volume>
         <issue>1</issue>
         <fpage>8</fpage>
         <url>http://www.biomedcentral.com/1755-8794/1/8</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">18419801</pubid>
               <pubid idtype="doi">10.1186/1755-8794-1-8</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>26</day>
               <month>11</month>
               <year>2007</year>
            </date>
         </rec>
         <acc>
            <date>
               <day>17</day>
               <month>4</month>
               <year>2008</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>17</day>
               <month>4</month>
               <year>2008</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2008</year>
         <collab>Sivaraksa and Lowe; licensee BioMed Central Ltd.</collab>
         <note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>The controversy surrounding the non-uniqueness of predictive gene lists (PGL) of small selected subsets of genes from very large potential candidates as available in DNA microarray experiments is now widely acknowledged <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>. Many of these studies have focused on constructing discriminative semi-parametric models and as such are also subject to the issue of random correlations of sparse model selection in high dimensional spaces. In this work we outline a different approach based around an unsupervised patient-specific nonlinear topographic projection in predictive gene lists.</p>
            </sec>
            <sec>
               <st>
                  <p>Methods</p>
               </st>
               <p>We construct nonlinear topographic projection maps based on inter-patient gene-list relative dissimilarities. The Neuroscale, the Stochastic Neighbor Embedding(SNE) and the Locally Linear Embedding(LLE) techniques have been used to construct two-dimensional projective visualisation plots of 70 dimensional PGLs per patient, classifiers are also constructed to identify the prognosis indicator of each patient using the resulting projections from those visualisation techniques and investigate whether <it>a-posteriori </it>two prognosis groups are separable on the evidence of the gene lists.</p>
               <p>A literature-proposed predictive gene list for breast cancer is benchmarked against a separate gene list using the above methods. Generalisation ability is investigated by using the mapping capability of Neuroscale to visualise the follow-up study, but based on the projections derived from the original dataset.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>The results indicate that small subsets of patient-specific PGLs have insufficient prognostic dissimilarity to permit a distinction between two prognosis patients. Uncertainty and diversity across multiple gene expressions prevents unambiguous or even confident patient grouping. Comparative projections across different PGLs provide similar results.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusion</p>
               </st>
               <p>The random correlation effect to an arbitrary outcome induced by small subset selection from very high dimensional interrelated gene expression profiles leads to an outcome with associated uncertainty. This continuum and uncertainty precludes any attempts at constructing discriminative classifiers.</p>
               <p>However a patient's gene expression profile could possibly be used in treatment planning, based on knowledge of other patients' responses.</p>
               <p>We conclude that many of the patients involved in such medical studies are <it>intrinsically unclassifiable </it>on the basis of provided PGL evidence. This additional category of 'unclassifiable' should be accommodated within medical decision support systems if serious errors and unnecessary adjuvant therapy are to be avoided.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>Metastasis is crucial in determining the life expectancy of breast cancer patients. Numerous studies have focused on searching for methods to predict the predilection of cancer patients to metastasize. Traditional methods fail to correctly predict the outcome of patients who reach metastasis leading to unnecessary clinical adjuvant therapy, such as chemotherapy. Gene profiling based, for example, on DNA microarray technology, has the potential to be a more reliable method allowing better prediction of patient cancer outcome. However, using lists of many thousands of genes is uninformative and not useful in providing insight for the specialist, nor for discovering the role of specific genes. Feature selection methods have been applied to filter the number of genes which are correlated with outcome to produce a much smaller (typically of the order of a few tens) and more informative 'predictive gene list' (PGL), ideally consisting of the key genes which control the behaviour of the cancer.</p>
         <p>Despite the obvious benefits of producing an informative and small PGL, the difficulty is how to perform the feature selection in the absence of good quality functional models of the individual gene pathways. The alternative is to use data-driven data-mining approaches and seek correlations between response and outcome to rank potential genes. To rank the genes requires an appropriate metric. Although feature selection and feature extraction are often unsupervised methods, in the literature in this domain it is more common to be based on a supervised approach based on a specific choice of a nonparametric model linking the gene expressions to the outcomes. For example, using a classification model to infer likely outcome conditioned on expression values allows saliency of individual genes to be obtained, relevant to the classified outcome, e.g. good or poor prognosis patients. This saliency is of course dependent on the chosen model, the pre-specified outcomes, and the specific data used to construct the nonparametric classifier model. It is usually assumed that the data used to construct the classifier is <it>representative </it>of the problem so that results obtained are not highly sensitive on the specific choice of data. However, a different choice of model, or different choice of outcome would modify the saliency even if the chosen model was correct. In addition, in problems of such large input dimensionality (5000 or more on microarray chips), and relative sparsity of patient examples (a few hundred is typical), it is statistically plausible to select small subsets which are <it>randomly </it>correlated with <it>any </it>given desired outcome, <it>irrespective </it>of any biological functionality of the gene expression itself. This aspect has already been discussed in <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr></abbrgrp> for example. Therefore the question arises as to whether a specific PGL can be obtained based on clinical datasets, given these concerns over reliability of pattern processing techniques.</p>
         <p>Almost all nonlinear studies so far have examined supervised approaches to patient discrimination. A major problem with dealing with such high dimensional data is the lack of reliable approaches to investigate and compare patient-specific gene expression profiles separate to the construction of supervised models. We wish to explore an alternative analysis approach, based on <it>unsupervised</it>, nonlinear, topographic (structure-preserving) projection and visualisation methods.</p>
         <p>This paper explores several recent nonlinear visualisation models applied to the data introspection of the van't Veer breast cancer study <abbrgrp><abbr bid="B3">3</abbr></abbrgrp>. The approach can be used to <it>a-posteriori </it>explore whether there exists likely discriminability between patient groups of good and poor prognosis for example. For comparison with the preferred PGL selected by the van't Veer study, we also select a PGL based on cross-patient consistency rather than correlation with outcome and examine its performance also by these data introspection methods.</p>
         <sec>
            <st>
               <p>Reviews</p>
            </st>
            <p>We briefly overview some relevant recent works which have explored different classification, discrimination and clustering techniques to represent the separation between two groups of prognosis signature patients. The studies of van't Veer's group <abbrgrp><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr></abbrgrp> have suggested that a PGL of 70 specifically selected genes has proven accurate in out-of-sample patient prognosis of metastasis.</p>
            <p>However, other studies have concluded that the likelihood there exists a 'best' small-size predictive gene list which can be used to reliably improve the ability of patient-specific prognosis using automated pattern processing techniques is unlikely. In very recent work <abbrgrp><abbr bid="B5">5</abbr></abbrgrp>, analysing supervised machine learning approaches across several public domain data sets, it was found that many gene sets are capable of predicting molecular phenotypes accurately. Hence it is not surprising that expression profiles identified using different training datasets selected from a larger cohort, should show little agreement. It was also demonstrated that predicting relapse directly from microarray data using supervised machine learning approaches was not viable.</p>
            <p>In other work <abbrgrp><abbr bid="B6">6</abbr></abbrgrp>, it was shown that the specific example of the van't Veer PGL selection of 70 genes was no more effective at prognosis than the Nottingham Prognostic Indicator (NPI) or a suitably trained artificial neural network using traditional non-genomic biomarkers. This is not surprising from a systems biology perspective, where we would regard cancer as the result of complex interactions between genetic, biological and environmental influences.</p>
            <p>In <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>, they also found that the top 70 most correlated genes in the van't Veer study can vary significantly depending on the specific training set of patients used. Different randomly selected 70 gene PGL's were selected and shown to have similar prediction ability. They suggested that there is no unique set of genes that can be assumed to be the best or the only set of genes for prognosis accuracy of breast cancer. A follow-up study <abbrgrp><abbr bid="B2">2</abbr></abbrgrp> also suggested a similar conclusion, that we can not create a definitive classifier from a small subset of genes based on the small patient datasets available. Generally, large patient sample sizes are needed to produce viable and robust prediction outcomes of cancer prognosis.</p>
            <sec>
               <st>
                  <p>Projective Visualisation</p>
               </st>
               <p>Projective data visualisation is an approach for introspection of large dimensional datasets by extracting useful information and representing it in a more meaningful way that can be more easily interpreted prior to deciding upon subsequent analysis such as constructing classifiers <abbrgrp><abbr bid="B7">7</abbr></abbrgrp>. The approach is very useful for interpreting data by simply observing two or three dimensional projective maps of the original dataspace, where relative positioning of data points reflects some form of structural similarities in the original dataspace. This allows the easy recognition of anomalous data points, outliers, implicit clustering and relative dissimilarity.</p>
               <p>In microarray data, the combination of large dimensionality, noise, and sparse patient samples makes it almost impossible to explore and extract useful information contained in the data. Dimensionality reduction techniques are required for visualising microarray data. The dendrogram is one of the traditional approaches to perform microarray data clustering. However it usually produces a suboptimal local clustering solution and is not effective as a spatial visualisation tool to reflect relative dissimilarities. Many other algorithms for reduced dimensionality representation have previously been used to visualise microarray data. For instance, the Self Organising Map (SOM) has been used to investigate yeast <abbrgrp><abbr bid="B8">8</abbr></abbrgrp> and human cancers <abbrgrp><abbr bid="B9">9</abbr></abbrgrp> and <abbrgrp><abbr bid="B10">10</abbr></abbrgrp>, the latter in combination with the <it>k</it>-means algorithm. Analogously, Principal Component Analysis (PCA) has been used to investigate yeast <abbrgrp><abbr bid="B11">11</abbr></abbrgrp> and to identify tissue-specific expression of human genes <abbrgrp><abbr bid="B12">12</abbr></abbrgrp>. However, both SOM and PCA have significant drawbacks. PCA is a variance-preserving <it>linear projection</it>, and this limitation does not lead to a topographic representation <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>. On the other hand, the SOM lacks a sound theoretical underpinning (for example, there is no cost function to optimise, and training parameters must be chosen arbitrarily).</p>
               <p>We therefore seek principled approaches to unsupervised data introspection which are nonlinear (since microarray data distributions are unlikely to be distributed on a linear manifold in high dimensional spaces). In this paper we will explore the Neuroscale model <abbrgrp><abbr bid="B14">14</abbr><abbr bid="B15">15</abbr></abbrgrp>, Local Linear Embeddings (LLE) <abbrgrp><abbr bid="B16">16</abbr></abbrgrp> and Stochastic Neighbor Embeddings (SNE) <abbrgrp><abbr bid="B17">17</abbr></abbrgrp>.</p>
            </sec>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Methods</p>
         </st>
         <sec>
            <st>
               <p>The van't Veer Data set</p>
            </st>
            <p>We re-visit the well-known study of van't Veer et.al. <abbrgrp><abbr bid="B3">3</abbr></abbrgrp> in which we focus on 78 sporadic lymph-node negative patients. Of these 78 patients, 34 developed distant metastases within 5 years and 44 remained free of cancer in that period. These are regarded as poor and good-prognosis groups respectively. The interest is whether the information in a gene expression profile alone could be used to perform a patient-specific prognosis separation between those two groups of patients. We will primarily use structure-preserving projective visualisation techniques to investigate this possibility. In the van't Veer study, from an initial set of 24481 human genes synthesised by inkjet microarray technology, about 5000 genes were found to be significantly expressed. They ranked genes by the magnitude of the correlation coefficient and eventually reduced the number of genes to 70, the number of genes which maximised a specific classification model. The centroid-based classifier they constructed could allocate 83% of the patients into the correct prognosis groups with 5 poor prognosis and 8 good-prognosis patients misclassified into the opposite categories.</p>
         </sec>
         <sec>
            <st>
               <p>An alternative PGL</p>
            </st>
            <p>To illustrate the lack of uniqueness of capability of the van't Veer gene list, which we denote List A in this paper, we compare results on a different gene list, denoted List B, selected on the basis of cross-patient consistency rather than maximising classification accuracy on a specific classification model. Let <b>x</b><sup><it>i </it></sup>denote the gene expression vector for patient <it>i </it>of the van't Veer PGL. <b>x</b><sub><it>G</it></sub>, where <it>G </it>= {1, 2,..., 44} represents a set of expression values across all good prognosis patients, and <b>x</b><sub><it>P</it></sub>, where <it>P </it>= {45, 46,..., 78} represents the set of all poor prognosis patients.</p>
            <p>The variance of individual gene expression values across each patient group is estimated by</p>
            <p>
               <display-formula>
                  <m:math name="1755-8794-1-8-i1" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:msubsup>
                              <m:mi>&#963;</m:mi>
                              <m:mi>L</m:mi>
                              <m:mn>2</m:mn>
                           </m:msubsup>
                           <m:mo>=</m:mo>
                           <m:msub>
                              <m:mrow>
                                 <m:mo>&#9001;</m:mo>
                                 <m:msup>
                                    <m:mrow>
                                       <m:mo stretchy="false">(</m:mo>
                                       <m:msub>
                                          <m:mi>x</m:mi>
                                          <m:mi>i</m:mi>
                                       </m:msub>
                                       <m:mo>&#8722;</m:mo>
                                       <m:msub>
                                          <m:mover accent="true">
                                             <m:mi>x</m:mi>
                                             <m:mo>&#175;</m:mo>
                                          </m:mover>
                                          <m:mi>L</m:mi>
                                       </m:msub>
                                       <m:mo stretchy="false">)</m:mo>
                                    </m:mrow>
                                    <m:mn>2</m:mn>
                                 </m:msup>
                                 <m:mo>&#9002;</m:mo>
                              </m:mrow>
                              <m:mrow>
                                 <m:mi>i</m:mi>
                                 <m:mo>&#8712;</m:mo>
                                 <m:mi>L</m:mi>
                              </m:mrow>
                           </m:msub>
                           <m:mo>,</m:mo>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaeq4Wdm3aa0baaSqaaiabdYeambqaaiabikdaYaaakiabg2da9iabgMYiHlabcIcaOGqabiab=Hha4naaBaaaleaacqWGPbqAaeqaaOGaeyOeI0Iaf8hEaGNbaebadaWgaaWcbaGaemitaWeabeaakiabcMcaPmaaCaaaleqabaGaeGOmaidaaOGaeyOkJe=aaSbaaSqaaiabdMgaPjabgIGiolabdYeambqabaGccqGGSaalaaa@438B@</m:annotation>
                     </m:semantics>
                  </m:math>
               </display-formula>
            </p>
            <p>where <it>L </it>= {<it>G, P</it>} and the average is taken across all patients. Assume <inline-formula><m:math name="1755-8794-1-8-i2" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>R</m:mi><m:mi>j</m:mi><m:mi>L</m:mi></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaemOuai1aa0baaSqaaiabdQgaQbqaaiabdYeambaaaaa@2FAD@</m:annotation></m:semantics></m:math></inline-formula> is the rank order of the variance of gene <it>j </it>for each patient group. The unique top <it>T </it>ranked genes from each group are extracted,</p>
            <p>
               <display-formula>
                  <m:math name="1755-8794-1-8-i3" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:mtable columnalign="left">
                              <m:mtr columnalign="left">
                                 <m:mtd columnalign="left">
                                    <m:mrow>
                                       <m:msub>
                                          <m:mi>L</m:mi>
                                          <m:mi>G</m:mi>
                                       </m:msub>
                                       <m:mo>=</m:mo>
                                       <m:mo>{</m:mo>
                                       <m:mi>j</m:mi>
                                       <m:mo>|</m:mo>
                                       <m:msubsup>
                                          <m:mi>R</m:mi>
                                          <m:mi>j</m:mi>
                                          <m:mi>G</m:mi>
                                       </m:msubsup>
                                       <m:mo>&#8804;</m:mo>
                                       <m:mi>T</m:mi>
                                       <m:mo>}</m:mo>
                                    </m:mrow>
                                 </m:mtd>
                              </m:mtr>
                              <m:mtr columnalign="left">
                                 <m:mtd columnalign="left">
                                    <m:mrow>
                                       <m:msub>
                                          <m:mi>L</m:mi>
                                          <m:mi>P</m:mi>
                                       </m:msub>
                                       <m:mo>=</m:mo>
                                       <m:mo>{</m:mo>
                                       <m:mi>j</m:mi>
                                       <m:mo>|</m:mo>
                                       <m:msubsup>
                                          <m:mi>R</m:mi>
                                          <m:mi>j</m:mi>
                                          <m:mi>P</m:mi>
                                       </m:msubsup>
                                       <m:mo>&#8804;</m:mo>
                                       <m:mi>T</m:mi>
                                       <m:mo>}</m:mo>
                                    </m:mrow>
                                 </m:mtd>
                              </m:mtr>
                           </m:mtable>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaqbaeaabiqaaaqaaiabdYeamnaaBaaaleaacqWGhbWraeqaaOGaeyypa0Jaei4EaSNaemOAaOMaeiiFaWNaemOuai1aa0baaSqaaiabdQgaQbqaaiabdEeahbaatuuDJXwAK1uy0HMmaeHbfv3ySLgzG0uy0HgiuD3BaGabaOGae8hzIqOaemivaqLaeiyFa0habaGaemitaW0aaSbaaSqaaiabdcfaqbqabaGccqGH9aqpcqGG7bWEcqWGQbGAcqGG8baFcqWGsbGudaqhaaWcbaGaemOAaOgabaGaemiuaafaaOGae8hzIqOaemivaqLaeiyFa0haaaaa@55C7@</m:annotation>
                     </m:semantics>
                  </m:math>
               </display-formula>
            </p>
            <p>The number of <it>T </it>genes is chosen so that List B has a total number of genes equal to 70, the same as List A. Specifically, in this case the 35 lowest non-overlapped variance genes from each patient group were extracted.</p>
            <p>
               <display-formula>List B = {<it>L</it><sub><it>G </it></sub>&#8746; <it>L</it><sub><it>P</it></sub>} - {<it>L</it><sub><it>G </it></sub>&#8745; <it>L</it><sub><it>P</it></sub>}.</display-formula>
            </p>
            <p>This selection criterion emphasises <it>consistency </it>of gene expression across patients, rather than explicitly seeking discrimination (see table <tblr tid="T25">25</tblr> for list of genes). Examining the details of the two 70-gene subsets, we observe that there are only <it>five </it>genes in common between the van't Veer study and this alternative gene list. If List A has superior prognostic value, its projective visualisation and discrimination properties should be better than those of List B, since List A was chosen explicitly to maximise discrimination.</p>
            <tbl id="T25">
               <title>
                  <p>Table 25</p>
               </title>
               <caption>
                  <p>The alternative gene list</p>
               </caption>
               <tblbdy cols="5">
                  <r>
                     <c ca="center">
                        <p>AA553619.RC</p>
                     </c>
                     <c ca="center">
                        <p>AB023216</p>
                     </c>
                     <c ca="center">
                        <p>AB032954</p>
                     </c>
                     <c ca="center">
                        <p>AF065241</p>
                     </c>
                     <c ca="center">
                        <p>AL050065</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>Contig11065.RC</p>
                     </c>
                     <c ca="center">
                        <p>Contig14706.RC</p>
                     </c>
                     <c ca="center">
                        <p>Contig14882.RC</p>
                     </c>
                     <c ca="center">
                        <p>Contig15031.RC</p>
                     </c>
                     <c ca="center">
                        <p>Contig31839.RC</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>Contig34302.RC</p>
                     </c>
                     <c ca="center">
                        <p>Contig35229.RC</p>
                     </c>
                     <c ca="center">
                        <p>Contig37063.RC</p>
                     </c>
                     <c ca="center">
                        <p>Contig37262</p>
                     </c>
                     <c ca="center">
                        <p>Contig39090.RC</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>Contig42162.RC</p>
                     </c>
                     <c ca="center">
                        <p>Contig46.RC</p>
                     </c>
                     <c ca="center">
                        <p>Contig46223.RC</p>
                     </c>
                     <c ca="center">
                        <p>Contig49818.RC</p>
                     </c>
                     <c ca="center">
                        <p>Contig51800</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>Contig55189.RC</p>
                     </c>
                     <c ca="center">
                        <p>Contig55377.RC</p>
                     </c>
                     <c ca="center">
                        <p>Contig56457.RC</p>
                     </c>
                     <c ca="center">
                        <p>Contig753.RC</p>
                     </c>
                     <c ca="center">
                        <p>Contig760.RC</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>Contig8930.RC</p>
                     </c>
                     <c ca="center">
                        <p>Contig8950.RC</p>
                     </c>
                     <c ca="center">
                        <p>NM.000272</p>
                     </c>
                     <c ca="center">
                        <p>NM.000286</p>
                     </c>
                     <c ca="center">
                        <p>NM.000320</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>NM.000419</p>
                     </c>
                     <c ca="center">
                        <p>NM.000540</p>
                     </c>
                     <c ca="center">
                        <p>NM.000849</p>
                     </c>
                     <c ca="center">
                        <p>NM.001879</p>
                     </c>
                     <c ca="center">
                        <p>NM.002624</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>NM.003686</p>
                     </c>
                     <c ca="center">
                        <p>NM.003778</p>
                     </c>
                     <c ca="center">
                        <p>NM.003858</p>
                     </c>
                     <c ca="center">
                        <p>NM.004087</p>
                     </c>
                     <c ca="center">
                        <p>NM.004273</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>NM.004336</p>
                     </c>
                     <c ca="center">
                        <p>NM.004456</p>
                     </c>
                     <c ca="center">
                        <p>NM.004701</p>
                     </c>
                     <c ca="center">
                        <p>NM.004791</p>
                     </c>
                     <c ca="center">
                        <p>NM.005008</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>NM.005087</p>
                     </c>
                     <c ca="center">
                        <p>NM.005744</p>
                     </c>
                     <c ca="center">
                        <p>NM.006260</p>
                     </c>
                     <c ca="center">
                        <p>NM.006547</p>
                     </c>
                     <c ca="center">
                        <p>NM.007359</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>NM.012177</p>
                     </c>
                     <c ca="center">
                        <p>NM.012261</p>
                     </c>
                     <c ca="center">
                        <p>NM.012310</p>
                     </c>
                     <c ca="center">
                        <p>NM.012406</p>
                     </c>
                     <c ca="center">
                        <p>NM.014093</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>NM.014264</p>
                     </c>
                     <c ca="center">
                        <p>NM.014321</p>
                     </c>
                     <c ca="center">
                        <p>NM.014404</p>
                     </c>
                     <c ca="center">
                        <p>NM.014547</p>
                     </c>
                     <c ca="center">
                        <p>NM.014675</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>NM.014968</p>
                     </c>
                     <c ca="center">
                        <p>NM.015434</p>
                     </c>
                     <c ca="center">
                        <p>NM.017926</p>
                     </c>
                     <c ca="center">
                        <p>NM.018089</p>
                     </c>
                     <c ca="center">
                        <p>NM.018098</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>NM.018313</p>
                     </c>
                     <c ca="center">
                        <p>NM.018488</p>
                     </c>
                     <c ca="center">
                        <p>NM.020123</p>
                     </c>
                     <c ca="center">
                        <p>NM.020386</p>
                     </c>
                     <c ca="center">
                        <p>NM.021033</p>
                     </c>
                  </r>
               </tblbdy>
            </tbl>
         </sec>
         <sec>
            <st>
               <p>The validation data set of van de Vijver <abbrgrp><abbr bid="B4">4</abbr></abbrgrp></p>
            </st>
            <p>A follow-up study by the same group was performed which followed the progression of another set of patients to verify the original study. This follow-up data set contains 295 patients with 106 poor-prognosis and 189 good-prognosis patients. The poor-prognosis patients are categorised into 3 sub-categories.</p>
            <p>Patients with metastasis but who did not die as a direct result of the metastasis, patients who died without developing metastasis and the patients who developed metastasis and eventually died. Within these 295 patients, 61 of them are present in the previous study. Those patients will be removed in this paper to ensure we can examine generalisation on a separate set of 234 different patients, 159 of whom are categorised as good-prognosis.</p>
         </sec>
         <sec>
            <st>
               <p>Topographic Visualisation</p>
            </st>
            <p>Topographic mappings are mechanisms that map the data in a high dimensional space into a low dimensional space in such a way that preserves the structure of the data. This structure of the data usually means the relative distances between points in the high dimensional space, where a suitable distance function is used to reflect the prior knowledge of the domain. In other words, the points that are close together in a high dimensional space should also stay together in a lower dimensional projection space, and data that lie far apart in a high dimensional feature space should remain significantly separated in the lower dimensional projection space.</p>
            <p>Both the large quantity of genes and multiple samples of microarray data make it difficult to represent the entire data set and to identify the interesting genes easily. To make understanding easier, a representation is needed to compress all the information in a lower dimensional space. Moreover, each individual patient can be visualised as to whether that patient has expression values closer to the those patients in different prognosis groups. Topographic projection maps do not assume the existence of clusters, boundaries or classes and so are not subject to the same criticisms as supervised approaches for data mining.</p>
            <p>In this paper, three reliable topographic techniques are used for embedding the van't Veer data set: NeuroScale <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>, Locally Linear Embedding (LLE)and Stochastic Neighbor Embedding (SNE) <abbrgrp><abbr bid="B17">17</abbr></abbrgrp>.</p>
         </sec>
         <sec>
            <st>
               <p>NeuroScale</p>
            </st>
            <p>In a Neuroscale <abbrgrp><abbr bid="B14">14</abbr></abbrgrp> topographic map the distribution and trained relative positions of the points in the projection space are determined to reflect the relative dissimilarity between data measurements (gene expression values) in the high-dimensional space, and hence generalises the established Sammon map concept. <it>N </it>measurement vectors <b>x</b><sub><it>i </it></sub>in &#8477;<sup><it>p </it></sup>are transformed using a Radial Basis Function (RBF) <abbrgrp><abbr bid="B18">18</abbr><abbr bid="B19">19</abbr></abbrgrp> network to a corresponding set of feature (visualisation) vectors <b>y</b><sub><it>i </it></sub>in &#8477;<sup><it>q</it></sup>. An RBF comprises a single hidden layer of <it>h </it>neurons which represents a set of basis functions, each of which has a centre located at some point in the input space. Generally, <it>q </it>&#8810; <it>p </it>as dimension reduction is desired, and typically <it>q </it>= 2 for visualisation. The RBF is a semi-parametric kernel model such that <b>y</b><sub><it>i </it></sub>= <b>W &#934; </b>(<b>r</b><sub><b>i</b></sub>), where the set of weights <it>W </it>can be optimised from a training data set. Thin plate spline functions <b>&#934; </b>(<b>r</b>) = <b>r</b><sup>2 </sup>log(<b>r</b>), where <inline-formula><m:math name="1755-8794-1-8-i4" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>r</m:mi><m:mi>i</m:mi><m:mn>2</m:mn></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaacbeGae8NCai3aa0baaSqaaiabdMgaPbqaaiabikdaYaaaaaa@2FC2@</m:annotation></m:semantics></m:math></inline-formula> = ||<b>x</b><sub><it>i </it></sub>- <b>c</b>|| will be used in this experiment. In the Radial Basis Function network trained traditionally for regression problems, the desired output of the network or the target values are already identified through a supervised learning problem. However, in order to use an RBF in the topographic reduction problem, which does not have predefined targets but only considers the distance-preserving nature of the target values, the traditional supervised Radial Basis Function network needs to be modified by using <it>relative supervision </it>where it is only the relative distances between pattern pairs which are important <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>. The quality of the projection is measured by the <it>Sammon stress metric </it>(n.b. we are using a reduced form here, neglecting a denominator often employed):</p>
            <p>
               <display-formula id="M1">
                  <m:math name="1755-8794-1-8-i5" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:mi>E</m:mi>
                           <m:mo>=</m:mo>
                           <m:mstyle displaystyle="true">
                              <m:munderover>
                                 <m:mo>&#8721;</m:mo>
                                 <m:mi>i</m:mi>
                                 <m:mi>N</m:mi>
                              </m:munderover>
                              <m:mrow>
                                 <m:mstyle displaystyle="true">
                                    <m:munderover>
                                       <m:mo>&#8721;</m:mo>
                                       <m:mi>j</m:mi>
                                       <m:mi>N</m:mi>
                                    </m:munderover>
                                    <m:mrow>
                                       <m:msup>
                                          <m:mrow>
                                             <m:mo stretchy="false">(</m:mo>
                                             <m:msubsup>
                                                <m:mi>d</m:mi>
                                                <m:mrow>
                                                   <m:mi>i</m:mi>
                                                   <m:mi>j</m:mi>
                                                </m:mrow>
                                                <m:mo>&#8727;</m:mo>
                                             </m:msubsup>
                                             <m:mo>&#8722;</m:mo>
                                             <m:msub>
                                                <m:mi>d</m:mi>
                                                <m:mrow>
                                                   <m:mi>i</m:mi>
                                                   <m:mi>j</m:mi>
                                                </m:mrow>
                                             </m:msub>
                                             <m:mo stretchy="false">)</m:mo>
                                          </m:mrow>
                                          <m:mn>2</m:mn>
                                       </m:msup>
                                    </m:mrow>
                                 </m:mstyle>
                              </m:mrow>
                           </m:mstyle>
                           <m:mo>,</m:mo>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaemyrauKaeyypa0ZaaabCaeaadaaeWbqaaiabcIcaOiabdsgaKnaaDaaaleaacqWGPbqAcqWGQbGAaeaacqGHxiIkaaGccqGHsislcqWGKbazdaWgaaWcbaGaemyAaKMaemOAaOgabeaakiabcMcaPmaaCaaaleqabaGaeGOmaidaaaqaaiabdQgaQbqaaiabd6eaobqdcqGHris5aaWcbaGaemyAaKgabaGaemOta4eaniabggHiLdGccqGGSaalaaa@45CD@</m:annotation>
                     </m:semantics>
                  </m:math>
               </display-formula>
            </p>
            <p>where <it>d</it><sub><it>ij </it></sub>= ||<b>y</b><sub><it>i </it></sub>- <b>y</b><sub><it>j</it></sub>|| and <inline-formula><m:math name="1755-8794-1-8-i6" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>d</m:mi><m:mrow><m:mi>i</m:mi><m:mi>j</m:mi></m:mrow><m:mo>&#8727;</m:mo></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaemizaq2aa0baaSqaaiabdMgaPjabdQgaQbqaaiabgEHiQaaaaaa@30FA@</m:annotation></m:semantics></m:math></inline-formula> = ||<b>x</b><sub><it>i </it></sub>- <b>x</b><sub><it>j</it></sub>|| represent the inter-point distances in projection space and data space respectively. The aim of the training process is to set the parameters of the RBF weight matrix to minimise the stress metric and hence capture the the functional relationship between the original data distribution and the projected images. The NeuroScale model is used to visualise and interrogate our data set, considered as subsets of an array of 78 &#215; 70 patterns. Once the functional mapping has been obtained using NeuroScale, the model can be reused without reconstructing the projection over the extended data by just passing the new data points, <b>x</b><sub><it>new </it></sub>through the transformation function. <b>y</b><sub><it>new </it></sub>= <it>f </it>(<b>x</b><sub><it>new</it></sub>, <b>W</b>).</p>
            <p>The number and location of the centres needs to be determined. However choices of centres are robust to the outcome <abbrgrp><abbr bid="B20">20</abbr></abbrgrp> since an implicit smoothing regularisation is used as part of the optimisation process. As a normal practice, the number of centres is chosen to be the same as the number of training data points, so that each data point can be used as a centre of the RBF functions. In this experiment the Netlab toolbox <abbrgrp><abbr bid="B7">7</abbr></abbrgrp> is used to help construct the NeuroScale projections.</p>
         </sec>
         <sec>
            <st>
               <p>LLE</p>
            </st>
            <p>Locally Linear Embedding (LLE) <abbrgrp><abbr bid="B16">16</abbr></abbrgrp> aims to preserve the local neighbourhood area around a point so that nearby points in high dimensional space remain nearby and similarly co-located with respect to one another in the low dimensional space by preserving the neighbouring distance linearly. Provided there is sufficient data, we expect each data point and its neighbours to lie on or close to a locally linear patch of the manifold. In the simplest formulation of LLE, the method will preserve weights for each data point to the surrounding <it>K </it>nearest neighbours per data point, as measured by Euclidean distance from a point of interest. The optimal weights for each data point to the surrounding <it>K </it>nearest neighbours are given by:</p>
            <p>
               <display-formula id="M2">
                  <m:math name="1755-8794-1-8-i7" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:msub>
                              <m:mi>W</m:mi>
                              <m:mrow>
                                 <m:mi>i</m:mi>
                                 <m:mi>j</m:mi>
                              </m:mrow>
                           </m:msub>
                           <m:mo>=</m:mo>
                           <m:mfrac>
                              <m:mrow>
                                 <m:mstyle displaystyle="true">
                                    <m:msub>
                                       <m:mo>&#8721;</m:mo>
                                       <m:mi>k</m:mi>
                                    </m:msub>
                                    <m:mrow>
                                       <m:msubsup>
                                          <m:mi>C</m:mi>
                                          <m:mrow>
                                             <m:mi>j</m:mi>
                                             <m:mi>k</m:mi>
                                          </m:mrow>
                                          <m:mrow>
                                             <m:mi>i</m:mi>
                                             <m:mo>&#8722;</m:mo>
                                             <m:mn>1</m:mn>
                                          </m:mrow>
                                       </m:msubsup>
                                    </m:mrow>
                                 </m:mstyle>
                              </m:mrow>
                              <m:mrow>
                                 <m:mstyle displaystyle="true">
                                    <m:msub>
                                       <m:mo>&#8721;</m:mo>
                                       <m:mrow>
                                          <m:mi>l</m:mi>
                                          <m:mi>m</m:mi>
                                       </m:mrow>
                                    </m:msub>
                                    <m:mrow>
                                       <m:msubsup>
                                          <m:mi>C</m:mi>
                                          <m:mrow>
                                             <m:mi>l</m:mi>
                                             <m:mi>m</m:mi>
                                          </m:mrow>
                                          <m:mrow>
                                             <m:mi>i</m:mi>
                                             <m:mo>&#8722;</m:mo>
                                             <m:mn>1</m:mn>
                                          </m:mrow>
                                       </m:msubsup>
                                    </m:mrow>
                                 </m:mstyle>
                              </m:mrow>
                           </m:mfrac>
                           <m:mo>.</m:mo>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaem4vaC1aaSbaaSqaaiabdMgaPjabdQgaQbqabaGccqGH9aqpjuaGdaWcaaqaamaaqababaGaem4qam0aa0baaeaacqWGQbGAcqWGRbWAaeaacqWGPbqAcqGHsislcqaIXaqmaaaabaGaem4AaSgabeGaeyyeIuoaaeaadaaeqaqaaiabdoeadnaaDaaabaGaemiBaWMaemyBa0gabaGaemyAaKMaeyOeI0IaeGymaedaaaqaaiabdYgaSjabd2gaTbqabiabggHiLdaaaOGaeiOla4caaa@48E5@</m:annotation>
                     </m:semantics>
                  </m:math>
               </display-formula>
            </p>
            <p>where <inline-formula><m:math name="1755-8794-1-8-i8" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>C</m:mi><m:mrow><m:mi>j</m:mi><m:mi>k</m:mi></m:mrow><m:mi>i</m:mi></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaem4qam0aa0baaSqaaiabdQgaQjabdUgaRbqaaiabdMgaPbaaaaa@3128@</m:annotation></m:semantics></m:math></inline-formula> is a covariance matrix within the neighbourhood of <b>x</b><sub><it>i </it></sub>and <it>&#951;</it><sub><it>j </it></sub>is the neighbour of the data point <it>x</it><sub><it>i</it></sub>.</p>
            <p>
               <display-formula id="M3">
                  <m:math name="1755-8794-1-8-i9" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:msubsup>
                              <m:mi>C</m:mi>
                              <m:mrow>
                                 <m:mi>j</m:mi>
                                 <m:mi>k</m:mi>
                              </m:mrow>
                              <m:mi>i</m:mi>
                           </m:msubsup>
                           <m:mo>=</m:mo>
                           <m:msup>
                              <m:mrow>
                                 <m:mo stretchy="false">(</m:mo>
                                 <m:msub>
                                    <m:mi>x</m:mi>
                                    <m:mi>i</m:mi>
                                 </m:msub>
                                 <m:mo>&#8722;</m:mo>
                                 <m:msub>
                                    <m:mi>&#951;</m:mi>
                                    <m:mi>j</m:mi>
                                 </m:msub>
                                 <m:mo stretchy="false">)</m:mo>
                              </m:mrow>
                              <m:mi>T</m:mi>
                           </m:msup>
                           <m:mo>&#8901;</m:mo>
                           <m:mo stretchy="false">(</m:mo>
                           <m:msub>
                              <m:mi>x</m:mi>
                              <m:mi>i</m:mi>
                           </m:msub>
                           <m:mo>&#8722;</m:mo>
                           <m:msub>
                              <m:mi>&#951;</m:mi>
                              <m:mi>k</m:mi>
                           </m:msub>
                           <m:mo stretchy="false">)</m:mo>
                           <m:mo>.</m:mo>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaem4qam0aa0baaSqaaiabdQgaQjabdUgaRbqaaiabdMgaPbaakiabg2da9iabcIcaOGqabiab=Hha4naaBaaaleaacqWGPbqAaeqaaOGaeyOeI0Iaeq4TdG2aaSbaaSqaaiabdQgaQbqabaGccqGGPaqkdaahaaWcbeqaaiabdsfaubaakiabgwSixlabcIcaOiab=Hha4naaBaaaleaacqWGPbqAaeqaaOGaeyOeI0Iaeq4TdG2aaSbaaSqaaiabdUgaRbqabaGccqGGPaqkcqGGUaGlaaa@48F0@</m:annotation>
                     </m:semantics>
                  </m:math>
               </display-formula>
            </p>
            <p>Each high dimensional point <b>x</b><sub><it>i </it></sub>is mapped to a low dimensional point <b>y</b><sub><it>i </it></sub>in low dimensional space, representing global internal coordinates on the manifold. This is done by choosing <it>d</it>-dimensional coordinates <b>y</b><sub><it>i </it></sub>to minimise the cost function in low dimensional space:</p>
            <p>
               <display-formula id="M4">
                  <m:math name="1755-8794-1-8-i10" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:mi>&#934;</m:mi>
                           <m:mo stretchy="false">(</m:mo>
                           <m:mi>Y</m:mi>
                           <m:mo stretchy="false">)</m:mo>
                           <m:mo>=</m:mo>
                           <m:mstyle displaystyle="true">
                              <m:munderover>
                                 <m:mo>&#8721;</m:mo>
                                 <m:mi>i</m:mi>
                                 <m:mi>N</m:mi>
                              </m:munderover>
                              <m:mrow>
                                 <m:mo>|</m:mo>
                                 <m:msub>
                                    <m:mi>y</m:mi>
                                    <m:mi>i</m:mi>
                                 </m:msub>
                                 <m:mo>&#8722;</m:mo>
                                 <m:mstyle displaystyle="true">
                                    <m:munderover>
                                       <m:mo>&#8721;</m:mo>
                                       <m:mi>j</m:mi>
                                       <m:mi>K</m:mi>
                                    </m:munderover>
                                    <m:mrow>
                                       <m:msub>
                                          <m:mi>W</m:mi>
                                          <m:mrow>
                                             <m:mi>i</m:mi>
                                             <m:mi>j</m:mi>
                                          </m:mrow>
                                       </m:msub>
                                       <m:msub>
                                          <m:mi>y</m:mi>
                                          <m:mi>j</m:mi>
                                       </m:msub>
                                       <m:msup>
                                          <m:mo>|</m:mo>
                                          <m:mn>2</m:mn>
                                       </m:msup>
                                    </m:mrow>
                                 </m:mstyle>
                              </m:mrow>
                           </m:mstyle>
                           <m:mo>,</m:mo>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaeuOPdyKaeiikaGIaemywaKLaeiykaKIaeyypa0ZaaabCaeaacqGG8baFieqacqWF5bqEdaWgaaWcbaGaemyAaKgabeaakiabgkHiTmaaqahabaGaem4vaC1aaSbaaSqaaiabdMgaPjabdQgaQbqabaGccqWF5bqEdaWgaaWcbaGaemOAaOgabeaakiabcYha8naaCaaaleqabaGaeGOmaidaaaqaaiabdQgaQbqaaiabdUealbqdcqGHris5aaWcbaGaemyAaKgabaGaemOta4eaniabggHiLdGccqGGSaalaaa@4B3C@</m:annotation>
                     </m:semantics>
                  </m:math>
               </display-formula>
            </p>
            <p>where <it>W</it><sub><it>ij </it></sub>is fixed from the high dimensional space. This algorithm has only one free parameter: the number of neighbours per data point, <it>K</it>. The higher the value of <it>K</it>, the more similar to the NeuroScale method this method will be. This <it>K </it>practically, is very hard to find to suit the given data set.</p>
            <p>Furthermore, it is hard to find an appropriate value of <it>K </it>which performs well across different choices of data sets. It is typically much smaller than the number of data points. In the experiments here we will show results using <it>K </it>= 5, 20. Furthermore, LLE has a further disadvantage over NeuroScale in that the NeuroScale model can produce a transformation function which can be used for generalisation. Any new patient gene vector can be projected down by using this existing function without recomputing a full new projection, which can be very computational expensive. In addition, the result using LLE is sensitive to the choice of neighbours, <it>K </it>while NeuroScale gives quite consistent results <abbrgrp><abbr bid="B20">20</abbr></abbrgrp>.</p>
         </sec>
         <sec>
            <st>
               <p>Stochastic Neighbor Embedding</p>
            </st>
            <p>Stochastic Neighbor Embedding <abbrgrp><abbr bid="B17">17</abbr></abbrgrp> also uses a pairwise similarity but measures similarity using a probabilistic distance approach to preserve the neighbourhood identity. A Gaussian distribution is centred on each object point in the high dimensional space and a probability density is defined over all the potential neighbours of that point. This approach permits a 1-to-many mapping of high dimensional points to projection space.</p>
            <p>The high dimensional related probability for each point, <it>i</it>, and each potential neighbour, <it>j</it>, is computed using the asymmetric probability, <it>p</it><sub><it>ij</it></sub>, that <it>i </it>would pick <it>j </it>as its neighbour.</p>
            <p>
               <display-formula id="M5">
                  <m:math name="1755-8794-1-8-i11" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:msub>
                              <m:mi>p</m:mi>
                              <m:mrow>
                                 <m:mi>i</m:mi>
                                 <m:mi>j</m:mi>
                              </m:mrow>
                           </m:msub>
                           <m:mo>=</m:mo>
                           <m:mfrac>
                              <m:mrow>
                                 <m:mi>exp</m:mi>
                                 <m:mo>&#8289;</m:mo>
                                 <m:mo stretchy="false">(</m:mo>
                                 <m:mo>&#8722;</m:mo>
                                 <m:msubsup>
                                    <m:mi>d</m:mi>
                                    <m:mrow>
                                       <m:mi>i</m:mi>
                                       <m:mi>j</m:mi>
                                    </m:mrow>
                                    <m:mn>2</m:mn>
                                 </m:msubsup>
                                 <m:mo stretchy="false">)</m:mo>
                              </m:mrow>
                              <m:mrow>
                                 <m:mstyle displaystyle="true">
                                    <m:msub>
                                       <m:mo>&#8721;</m:mo>
                                       <m:mrow>
                                          <m:mi>k</m:mi>
                                          <m:mo>&#8800;</m:mo>
                                          <m:mi>i</m:mi>
                                       </m:mrow>
                                    </m:msub>
                                    <m:mrow>
                                       <m:mi>exp</m:mi>
                                       <m:mo>&#8289;</m:mo>
                                       <m:mo stretchy="false">(</m:mo>
                                       <m:mo>&#8722;</m:mo>
                                       <m:msubsup>
                                          <m:mi>d</m:mi>
                                          <m:mrow>
                                             <m:mi>i</m:mi>
                                             <m:mi>k</m:mi>
                                          </m:mrow>
                                          <m:mn>2</m:mn>
                                       </m:msubsup>
                                       <m:mo stretchy="false">)</m:mo>
                                    </m:mrow>
                                 </m:mstyle>
                              </m:mrow>
                           </m:mfrac>
                           <m:mo>.</m:mo>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaemiCaa3aaSbaaSqaaiabdMgaPjabdQgaQbqabaGccqGH9aqpjuaGdaWcaaqaaiGbcwgaLjabcIha4jabcchaWjabcIcaOiabgkHiTiabdsgaKnaaDaaabaGaemyAaKMaemOAaOgabaGaeGOmaidaaiabcMcaPaqaamaaqababaGagiyzauMaeiiEaGNaeiiCaaNaeiikaGIaeyOeI0Iaemizaq2aa0baaeaacqWGPbqAcqWGRbWAaeaacqaIYaGmaaGaeiykaKcabaGaem4AaSMaeyiyIKRaemyAaKgabeGaeyyeIuoaaaGccqGGUaGlaaa@513A@</m:annotation>
                     </m:semantics>
                  </m:math>
               </display-formula>
            </p>
            <p>The dissimilarities, <it>d</it><sub><it>ij </it></sub>can be based on standard Euclidean distances and scaled by a smoothing factor <it>&#963;</it><sub><it>i </it></sub>which is empirically determined:</p>
            <p>
               <display-formula id="M6">
                  <m:math name="1755-8794-1-8-i12" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:msub>
                              <m:mi>d</m:mi>
                              <m:mrow>
                                 <m:mi>i</m:mi>
                                 <m:mi>j</m:mi>
                              </m:mrow>
                           </m:msub>
                           <m:mo>=</m:mo>
                           <m:mfrac>
                              <m:mrow>
                                 <m:mo>|</m:mo>
                                 <m:mo>|</m:mo>
                                 <m:msub>
                                    <m:mi>x</m:mi>
                                    <m:mi>i</m:mi>
                                 </m:msub>
                                 <m:mo>&#8722;</m:mo>
                                 <m:msub>
                                    <m:mi>x</m:mi>
                                    <m:mi>j</m:mi>
                                 </m:msub>
                                 <m:mo>|</m:mo>
                                 <m:mo>|</m:mo>
                              </m:mrow>
                              <m:mrow>
                                 <m:msubsup>
                                    <m:mi>&#963;</m:mi>
                                    <m:mi>i</m:mi>
                                    <m:mn>2</m:mn>
                                 </m:msubsup>
                              </m:mrow>
                           </m:mfrac>
                           <m:mo>.</m:mo>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaemizaq2aaSbaaSqaaiabdMgaPjabdQgaQbqabaGccqGH9aqpjuaGdaWcaaqaaiabcYha8jabcYha8Hqabiab=Hha4naaBaaabaGaemyAaKgabeaacqGHsislcqWF4baEdaWgaaqaaiabdQgaQbqabaGaeiiFaWNaeiiFaWhabaGaeq4Wdm3aa0baaeaacqWGPbqAaeaacqaIYaGmaaaaaiabc6caUaaa@43F7@</m:annotation>
                     </m:semantics>
                  </m:math>
               </display-formula>
            </p>
            <p>The low dimensional images <b>y</b><sub><it>i </it></sub>of the points are used to define a probabilistic density in the mapping space, as:</p>
            <p>
               <display-formula id="M7">
                  <m:math name="1755-8794-1-8-i13" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:msub>
                              <m:mi>q</m:mi>
                              <m:mrow>
                                 <m:mi>i</m:mi>
                                 <m:mi>j</m:mi>
                              </m:mrow>
                           </m:msub>
                           <m:mo>=</m:mo>
                           <m:mfrac>
                              <m:mrow>
                                 <m:mi>exp</m:mi>
                                 <m:mo>&#8289;</m:mo>
                                 <m:mo stretchy="false">(</m:mo>
                                 <m:mo>&#8722;</m:mo>
                                 <m:mo>|</m:mo>
                                 <m:mo>|</m:mo>
                                 <m:msub>
                                    <m:mi>y</m:mi>
                                    <m:mi>i</m:mi>
                                 </m:msub>
                                 <m:mo>&#8722;</m:mo>
                                 <m:msub>
                                    <m:mi>y</m:mi>
                                    <m:mi>j</m:mi>
                                 </m:msub>
                                 <m:mo>|</m:mo>
                                 <m:msup>
                                    <m:mo>|</m:mo>
                                    <m:mn>2</m:mn>
                                 </m:msup>
                                 <m:mo stretchy="false">)</m:mo>
                              </m:mrow>
                              <m:mrow>
                                 <m:mstyle displaystyle="true">
                                    <m:msub>
                                       <m:mo>&#8721;</m:mo>
                                       <m:mrow>
                                          <m:mi>k</m:mi>
                                          <m:mo>&#8800;</m:mo>
                                          <m:mi>i</m:mi>
                                       </m:mrow>
                                    </m:msub>
                                    <m:mrow>
                                       <m:mi>exp</m:mi>
                                       <m:mo>&#8289;</m:mo>
                                       <m:mo stretchy="false">(</m:mo>
                                       <m:mo>&#8722;</m:mo>
                                       <m:mo>|</m:mo>
                                       <m:mo>|</m:mo>
                                       <m:msub>
                                          <m:mi>y</m:mi>
                                          <m:mi>i</m:mi>
                                       </m:msub>
                                       <m:mo>&#8722;</m:mo>
                                       <m:msub>
                                          <m:mi>y</m:mi>
                                          <m:mi>k</m:mi>
                                       </m:msub>
                                       <m:mo>|</m:mo>
                                       <m:msup>
                                          <m:mo>|</m:mo>
                                          <m:mn>2</m:mn>
                                       </m:msup>
                                       <m:mo stretchy="false">)</m:mo>
                                    </m:mrow>
                                 </m:mstyle>
                              </m:mrow>
                           </m:mfrac>
                           <m:mo>.</m:mo>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaemyCae3aaSbaaSqaaiabdMgaPjabdQgaQbqabaGccqGH9aqpjuaGdaWcaaqaaiGbcwgaLjabcIha4jabcchaWjabcIcaOiabgkHiTiabcYha8jabcYha8Hqabiab=Lha5naaBaaabaGaemyAaKgabeaacqGHsislcqWF5bqEdaWgaaqaaiabdQgaQbqabaGaeiiFaWNaeiiFaW3aaWbaaeqabaGaeGOmaidaaiabcMcaPaqaamaaqababaGagiyzauMaeiiEaGNaeiiCaaNaeiikaGIaeyOeI0IaeiiFaWNaeiiFaWNae8xEaK3aaSbaaeaacqWGPbqAaeqaaiabgkHiTiab=Lha5naaBaaabaGaem4AaSgabeaacqGG8baFcqGG8baFdaahaaqabeaacqaIYaGmaaGaeiykaKcabaGaem4AaSMaeyiyIKRaemyAaKgabeGaeyyeIuoaaaGccqGGUaGlaaa@62DE@</m:annotation>
                     </m:semantics>
                  </m:math>
               </display-formula>
            </p>
            <p>The aim of this SNE method is to match the above two distributions as close as possible. The Kullback-Leibler divergence, which is a measure of dissimilarity between two probabilities is used here as a cost function. This can be achieved by manipulating the coordinates <it>y</it><sub><it>i </it></sub>to minimise the cost:</p>
            <p>
               <display-formula id="M8">
                  <m:math name="1755-8794-1-8-i14" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:mi>C</m:mi>
                           <m:mo>=</m:mo>
                           <m:mstyle displaystyle="true">
                              <m:munderover>
                                 <m:mo>&#8721;</m:mo>
                                 <m:mi>i</m:mi>
                                 <m:mi>N</m:mi>
                              </m:munderover>
                              <m:mrow>
                                 <m:mstyle displaystyle="true">
                                    <m:munderover>
                                       <m:mo>&#8721;</m:mo>
                                       <m:mi>j</m:mi>
                                       <m:mi>N</m:mi>
                                    </m:munderover>
                                    <m:mrow>
                                       <m:msub>
                                          <m:mi>p</m:mi>
                                          <m:mrow>
                                             <m:mi>i</m:mi>
                                             <m:mi>j</m:mi>
                                          </m:mrow>
                                       </m:msub>
                                       <m:mi>log</m:mi>
                                       <m:mo>&#8289;</m:mo>
                                       <m:mfrac>
                                          <m:mrow>
                                             <m:msub>
                                                <m:mi>p</m:mi>
                                                <m:mrow>
                                                   <m:mi>i</m:mi>
                                                   <m:mi>j</m:mi>
                                                </m:mrow>
                                             </m:msub>
                                          </m:mrow>
                                          <m:mrow>
                                             <m:msub>
                                                <m:mi>q</m:mi>
                                                <m:mrow>
                                                   <m:mi>i</m:mi>
                                                   <m:mi>j</m:mi>
                                                </m:mrow>
                                             </m:msub>
                                          </m:mrow>
                                       </m:mfrac>
                                    </m:mrow>
                                 </m:mstyle>
                              </m:mrow>
                           </m:mstyle>
                           <m:mo>=</m:mo>
                           <m:mstyle displaystyle="true">
                              <m:munderover>
                                 <m:mo>&#8721;</m:mo>
                                 <m:mi>i</m:mi>
                                 <m:mi>N</m:mi>
                              </m:munderover>
                              <m:mrow>
                                 <m:mi>K</m:mi>
                                 <m:mi>L</m:mi>
                                 <m:mo stretchy="false">(</m:mo>
                                 <m:msub>
                                    <m:mi>P</m:mi>
                                    <m:mi>i</m:mi>
                                 </m:msub>
                                 <m:mo>|</m:mo>
                                 <m:mo>|</m:mo>
                                 <m:msub>
                                    <m:mi>Q</m:mi>
                                    <m:mi>i</m:mi>
                                 </m:msub>
                                 <m:mo stretchy="false">)</m:mo>
                                 <m:mo>.</m:mo>
                              </m:mrow>
                           </m:mstyle>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaem4qamKaeyypa0ZaaabCaeaadaaeWbqaaiabdchaWnaaBaaaleaacqWGPbqAcqWGQbGAaeqaaOGagiiBaWMaei4Ba8Maei4zaCwcfa4aaSaaaeaacqWGWbaCdaWgaaqaaiabdMgaPjabdQgaQbqabaaabaGaemyCae3aaSbaaeaacqWGPbqAcqWGQbGAaeqaaaaaaSqaaiabdQgaQbqaaiabd6eaobqdcqGHris5aaWcbaGaemyAaKgabaGaemOta4eaniabggHiLdGccqGH9aqpdaaeWbqaaiabdUealjabdYeamjabcIcaOiabdcfaqnaaBaaaleaacqWGPbqAaeqaaOGaeiiFaWNaeiiFaWNaemyuae1aaSbaaSqaaiabdMgaPbqabaGccqGGPaqkcqGGUaGlaSqaaiabdMgaPbqaaiabd6eaobqdcqGHris5aaaa@5C75@</m:annotation>
                     </m:semantics>
                  </m:math>
               </display-formula>
            </p>
            <p>The SNE model can be extended for multiple projections of a single object by using a mixture of densities, which produces a probabilistic density in the mapping space:</p>
            <p>
               <display-formula id="M9">
                  <m:math name="1755-8794-1-8-i15" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:msub>
                              <m:mi>q</m:mi>
                              <m:mrow>
                                 <m:mi>i</m:mi>
                                 <m:mi>j</m:mi>
                              </m:mrow>
                           </m:msub>
                           <m:mo>=</m:mo>
                           <m:mstyle displaystyle="true">
                              <m:munder>
                                 <m:mo>&#8721;</m:mo>
                                 <m:mi>b</m:mi>
                              </m:munder>
                              <m:mrow>
                                 <m:msub>
                                    <m:mi>&#960;</m:mi>
                                    <m:mrow>
                                       <m:msub>
                                          <m:mi>i</m:mi>
                                          <m:mi>b</m:mi>
                                       </m:msub>
                                    </m:mrow>
                                 </m:msub>
                              </m:mrow>
                           </m:mstyle>
                           <m:mstyle displaystyle="true">
                              <m:munder>
                                 <m:mo>&#8721;</m:mo>
                                 <m:mi>c</m:mi>
                              </m:munder>
                              <m:mrow>
                                 <m:mfrac>
                                    <m:mrow>
                                       <m:msub>
                                          <m:mi>&#960;</m:mi>
                                          <m:mrow>
                                             <m:msub>
                                                <m:mi>j</m:mi>
                                                <m:mi>c</m:mi>
                                             </m:msub>
                                          </m:mrow>
                                       </m:msub>
                                       <m:mi>exp</m:mi>
                                       <m:mo>&#8289;</m:mo>
                                       <m:mo stretchy="false">(</m:mo>
                                       <m:mo>&#8722;</m:mo>
                                       <m:mo>|</m:mo>
                                       <m:mo>|</m:mo>
                                       <m:msub>
                                          <m:mi>y</m:mi>
                                          <m:mrow>
                                             <m:msub>
                                                <m:mi>i</m:mi>
                                                <m:mi>b</m:mi>
                                             </m:msub>
                                          </m:mrow>
                                       </m:msub>
                                       <m:mo>&#8722;</m:mo>
                                       <m:msub>
                                          <m:mi>y</m:mi>
                                          <m:mrow>
                                             <m:msub>
                                                <m:mi>j</m:mi>
                                                <m:mi>c</m:mi>
                                             </m:msub>
                                          </m:mrow>
                                       </m:msub>
                                       <m:mo>|</m:mo>
                                       <m:msup>
                                          <m:mo>|</m:mo>
                                          <m:mn>2</m:mn>
                                       </m:msup>
                                       <m:mo stretchy="false">)</m:mo>
                                    </m:mrow>
                                    <m:mrow>
                                       <m:mstyle displaystyle="true">
                                          <m:msub>
                                             <m:mo>&#8721;</m:mo>
                                             <m:mi>k</m:mi>
                                          </m:msub>
                                          <m:mrow>
                                             <m:mstyle displaystyle="true">
                                                <m:msub>
                                                   <m:mo>&#8721;</m:mo>
                                                   <m:mi>d</m:mi>
                                                </m:msub>
                                                <m:mrow>
                                                   <m:msub>
                                                      <m:mi>&#960;</m:mi>
                                                      <m:mrow>
                                                         <m:msub>
                                                            <m:mi>k</m:mi>
                                                            <m:mi>d</m:mi>
                                                         </m:msub>
                                                      </m:mrow>
                                                   </m:msub>
                                                </m:mrow>
                                             </m:mstyle>
                                             <m:mi>exp</m:mi>
                                             <m:mo>&#8289;</m:mo>
                                             <m:mo stretchy="false">(</m:mo>
                                             <m:mo>&#8722;</m:mo>
                                             <m:mo>|</m:mo>
                                             <m:mo>|</m:mo>
                                             <m:msub>
                                                <m:mi>y</m:mi>
                                                <m:mrow>
                                                   <m:msub>
                                                      <m:mi>i</m:mi>
                                                      <m:mi>b</m:mi>
                                                   </m:msub>
                                                </m:mrow>
                                             </m:msub>
                                             <m:mo>&#8722;</m:mo>
                                             <m:msub>
                                                <m:mi>y</m:mi>
                                                <m:mrow>
                                                   <m:msub>
                                                      <m:mi>k</m:mi>
                                                      <m:mi>d</m:mi>
                                                   </m:msub>
                                                </m:mrow>
                                             </m:msub>
                                             <m:mo>|</m:mo>
                                             <m:msup>
                                                <m:mo>|</m:mo>
                                                <m:mn>2</m:mn>
                                             </m:msup>
                                          </m:mrow>
                                       </m:mstyle>
                                       <m:mo stretchy="false">)</m:mo>
                                    </m:mrow>
                                 </m:mfrac>
                              </m:mrow>
                           </m:mstyle>
                           <m:mo>.</m:mo>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaemyCae3aaSbaaSqaaiabdMgaPjabdQgaQbqabaGccqGH9aqpdaaeqbqaaiabec8aWnaaBaaaleaacqWGPbqAdaWgaaadbaGaemOyaigabeaaaSqabaaabaGaemOyaigabeqdcqGHris5aOWaaabuaKqbagaadaWcaaqaaiabec8aWnaaBaaabaGaemOAaO2aaSbaaeaacqWGJbWyaeqaaaqabaGagiyzauMaeiiEaGNaeiiCaaNaeiikaGIaeyOeI0IaeiiFaWNaeiiFaWNaemyEaK3aaSbaaeaacqWGPbqAdaWgaaqaaiabdkgaIbqabaaabeaacqGHsislcqWG5bqEdaWgaaqaaiabdQgaQnaaBaaabaGaem4yamgabeaaaeqaaiabcYha8jabcYha8naaCaaabeqaaiabikdaYaaacqGGPaqkaeaadaaeqaqaamaaqababaGaeqiWda3aaSbaaeaacqWGRbWAdaWgaaqaaiabdsgaKbqabaaabeaaaeaacqWGKbazaeqacqGHris5aiGbcwgaLjabcIha4jabcchaWjabcIcaOiabgkHiTiabcYha8jabcYha8jabdMha5naaBaaabaGaemyAaK2aaSbaaeaacqWGIbGyaeqaaaqabaGaeyOeI0IaemyEaK3aaSbaaeaacqWGRbWAdaWgaaqaaiabdsgaKbqabaaabeaacqGG8baFcqGG8baFdaahaaqabeaacqaIYaGmaaaabaGaem4AaSgabeGaeyyeIuoacqGGPaqkaaaaleaacqWGJbWyaeqaniabggHiLdGccqGGUaGlaaa@7DA0@</m:annotation>
                     </m:semantics>
                  </m:math>
               </display-formula>
            </p>
            <p>The number of clusters in the mixture also needs to be determined empirically. Each data point <it>x</it><sub><it>i </it></sub>can be projected to many locations of the mixtures <inline-formula><m:math name="1755-8794-1-8-i16" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msub><m:mi>y</m:mi><m:mrow><m:msub><m:mi>i</m:mi><m:mi>b</m:mi></m:msub></m:mrow></m:msub></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaemyEaK3aaSbaaSqaaiabdMgaPnaaBaaameaacqWGIbGyaeqaaaWcbeaaaaa@305C@</m:annotation></m:semantics></m:math></inline-formula> or <inline-formula><m:math name="1755-8794-1-8-i17" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msub><m:mi>y</m:mi><m:mrow><m:msub><m:mi>j</m:mi><m:mi>c</m:mi></m:msub></m:mrow></m:msub></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaemyEaK3aaSbaaSqaaiabdQgaQnaaBaaameaacqWGJbWyaeqaaaWcbeaaaaa@3060@</m:annotation></m:semantics></m:math></inline-formula>. However, for the experiments in this paper, only one projection per data will be used. The main advantage of SNE is its probabilistic approach but the results of the SNE are strongly dependent on the chosen <it>&#963;</it>. If the chosen <it>&#963; </it>is too large, the projecting data is likely to collapse to a single point. The suggested <it>&#963; </it>is <it>&#963; </it>= log(<it>K</it>), where <it>K </it>is the number of neighbours used to define a local cluster. To be consistent with the LLE method, which used <it>K </it>= 5, 20, we therefore choose <it>&#963; </it>= log(5), log(20) in the experiments in this paper as representative examples.</p>
         </sec>
         <sec>
            <st>
               <p>Classifier</p>
            </st>
            <p>Purely for comparison to previous works in this area, we additionally superimpose the results from a discriminative classifier on the projective maps. Classifiers are created to determine the performance of the discrimination patients into good or poor prognosis from each topographic projection images by trying to compare the performance using different techniques and gene lists. The classifiers are built on the two dimensional input of the visualisation space using a separate RBF nonlinear classifier. The output, instead of crisply divided the results into good or poor prognosis patients produces an analogue value indicating the likelihood of good prognosis typically varying from 0 to 1. The classifier uses 2 coordinate input values and produces 2 output values indicating good and poor prognosis likelihood respectively and is trained using the original 78 patients only as a training set. Specifically, the desired target value is <b>T </b>= {<it>T</it><sub>1</sub>, <it>T</it><sub>2</sub>} where <b>T </b>&#8712; {[1, 0], [0, 1]} represents good and poor prognosis patients respectively. The two dimensional output of the RBF network is: <b>y</b><sub><it>i </it></sub>= <b>W &#934; </b>(<b>x</b><sub><it>i</it></sub>) where the basis functions constituting <b>&#934; </b>are selected using cross validation.</p>
            <p>The outputs of the RBF network are then transformed using the softmax function, giving a vector prognosis indicator for each patient <inline-formula><m:math name="1755-8794-1-8-i18" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:mi>P</m:mi><m:mo>=</m:mo><m:mfrac><m:mrow><m:mi>exp</m:mi><m:mo>&#8289;</m:mo><m:mo stretchy="false">(</m:mo><m:mi>y</m:mi><m:mo stretchy="false">)</m:mo></m:mrow><m:mrow><m:mstyle displaystyle="true"><m:msub><m:mo>&#8721;</m:mo><m:mi>j</m:mi></m:msub><m:mrow><m:mi>exp</m:mi><m:mo>&#8289;</m:mo><m:mo stretchy="false">(</m:mo><m:msub><m:mi>y</m:mi><m:mi>j</m:mi></m:msub><m:mo stretchy="false">)</m:mo></m:mrow></m:mstyle></m:mrow></m:mfrac></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaemiuaaLaeyypa0tcfa4aaSaaaeaacyGGLbqzcqGG4baEcqGGWbaCcqGGOaakieqacqWF5bqEcqGGPaqkaeaadaaeqaqaaiGbcwgaLjabcIha4jabcchaWjabcIcaOiabdMha5naaBaaabaGaemOAaOgabeaacqGGPaqkaeaacqWGQbGAaeqacqGHris5aaaaaaa@4210@</m:annotation></m:semantics></m:math></inline-formula>. One of the two scales outputs which represents the good prognosis class is used as an indicator and contours of the indicator values are superimposed on the projection map space to show the likelihood of the good prognosis indicators.</p>
            <p>Patients with predicted prognosis values in the range 0.3 &#8594; 0.7 are considered as ambiguously classified. Removing these 'low-confidence' patients from measures of classification performance figures are likely to improve classification rates since the low confidence patients fall in the overlap between the good and poor prognosis groups as will be seen in the latter results.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Results</p>
         </st>
         <p>In this section we apply three different projective embeddings which are described in Methods, applied to the two PGLs of 70 genes per patient: List A, the van't Veer choice of genes, and List B, an alternative choice of genes chosen for consistency, as described in Methods.</p>
         <sec>
            <st>
               <p>NeuroScale Projection</p>
            </st>
            <p>Figure <figr fid="F1">1</figr> is the result from a 2-dimensional NeuroScale projection using List A and Figure <figr fid="F2">2</figr> is the result using List B. The group of poor-prognosis and good-prognosis patients are labelled differently, with black diamonds and grey circles respectively. The results similarly show some separation between the two groups of patients with a few patients wrongly mapped into the opposite class. This projection is using each patient data point as a separate centre in the RBF model internal to NeuroScale. However, to emphasize, no class information is used to construct the projection map. The different symbols are simply to allow easier identification of the two patient groups. This projection appears to support the previous result of van't Veer et. al. that List A appears to have some discriminatory capability, although it is evident from these figures that any discrimination is on a graded and overlapping scale rather than providing separable distributions.</p>
            <fig id="F1">
               <title>
                  <p>Figure 1</p>
               </title>
               <caption>
                  <p>The NeuroScale results using List A</p>
               </caption>
               <text>
                  <p><b>The NeuroScale results using List A</b>. NeuroScale map of gene List A. Note the approximate separation of the centroid between poor (diamonds) and good (circles) prognosis groups. Specific individual patients are highlighted with arrows.</p>
               </text>
               <graphic file="1755-8794-1-8-1"/>
            </fig>
            <fig id="F2">
               <title>
                  <p>Figure 2</p>
               </title>
               <caption>
                  <p>The NeuroScale results using List B</p>
               </caption>
               <text>
                  <p><b>The NeuroScale results using List B</b>. NeuroScale map of gene List B. Note the approximate separation of the centroid between poor (diamonds) nd good (circles) prognosis groups. Specific individual patients are highlighted with arrows.</p>
               </text>
               <graphic file="1755-8794-1-8-2"/>
            </fig>
            <p>Figures <figr fid="F1">1</figr> and <figr fid="F2">2</figr>, also show the classification model contour lines superimposed on the projection map of NeuroScale. The prognosis indicators vary on the level of overlap between two prognosis groups. The areas where there is large overlap between the two patient groups reflects ambiguity of any likely class membership. Therefore, we regard these patients as low confidence samples as far as determining class information and we regard them as 'unclassifiable'.</p>
            <p>Table <tblr tid="T1">1</tblr> shows the classification result of the NeuroScale projection using List A with 0.5 prognosis indicator as a threshold boundary between good and poor prognosis signatures of all patients. The overall classification rate is 83.33%, while if only high confidence patients are used for consideration the classification result increases to 100% accuracy, the result shown in Table <tblr tid="T2">2</tblr>. However, there are only 30 patients that fall in the high confidence regions, less than one half of all patients. Similarly the results using List B, shown in Tables <tblr tid="T3">3</tblr> and <tblr tid="T4">4</tblr> gave the same classification rate as using the list A but with 4 fewer high confidence patients. The results of the projection maps and the classifications support the previous propositions in the literature regarding the lack of uniqueness of a single PGL.</p>
            <tbl id="T1">
               <title>
                  <p>Table 1</p>
               </title>
               <caption>
                  <p>The misclassification matrix from the NeuroScale projection using List A. The classification is performed using the original 78 patients with 0.5 prognosis indicator as a threshold boundary. The classification rate is 83.33%.</p>
               </caption>
               <tblbdy cols="3">
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>Predict Good</p>
                     </c>
                     <c ca="center">
                        <p>Predict Poor</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="3">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>Actual Good</p>
                     </c>
                     <c ca="center">
                        <p>27</p>
                     </c>
                     <c ca="center">
                        <p>7</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>Actual Poor</p>
                     </c>
                     <c ca="center">
                        <p>6</p>
                     </c>
                     <c ca="center">
                        <p>38</p>
                     </c>
                  </r>
               </tblbdy>
            </tbl>
            <tbl id="T2">
               <title>
                  <p>Table 2</p>
               </title>
               <caption>
                  <p>The misclassification matrix from the NeuroScale projection using List A with only high confidence patients. The classification is performed using only 30 high confidence patients whose indicators are either above 0.7 or below 0.3. The classification rate is 100%.</p>
               </caption>
               <tblbdy cols="3">
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>Predict Good</p>
                     </c>
                     <c ca="center">
                        <p>Predict Poor</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="3">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>Actual Good</p>
                     </c>
                     <c ca="center">
                        <p>10</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>Actual Poor</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>20</p>
                     </c>
                  </r>
               </tblbdy>
            </tbl>
            <tbl id="T3">
               <title>
                  <p>Table 3</p>
               </title>
               <caption>
                  <p>The misclassification matrix from the NeuroScale projection using List B. The classification is performed using the original 78 patients with 0.5 prognosis indicator as a threshold boundary. The classification rate is 83.33%.</p>
               </caption>
               <tblbdy cols="3">
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>Predict Good</p>
                     </c>
                     <c ca="center">
                        <p>Predict Poor</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="3">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>Actual Good</p>
                     </c>
                     <c ca="center">
                        <p>28</p>
                     </c>
                     <c ca="center">
                        <p>6</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>Actual Poor</p>
                     </c>
                     <c ca="center">
                        <p>7</p>
                     </c>
                     <c ca="center">
                        <p>37</p>
                     </c>
                  </r>
               </tblbdy>
            </tbl>
            <tbl id="T4">
               <title>
                  <p>Table 4</p>
               </title>
               <caption>
                  <p>The misclassification matrix from the NeuroScale projection using List B with only high confidence patients. The classification is performed using only 26 high confidence patients whose indicators are either above 0.7 or below 0.3. The classification rate is 100%.</p>
               </caption>
               <tblbdy cols="3">
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>Predict Good</p>
                     </c>
                     <c ca="center">
                        <p>Predict Poor</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="3">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>Actual Good</p>
                     </c>
                     <c ca="center">
                        <p>7</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>Actual Poor</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>19</p>
                     </c>
                  </r>
               </tblbdy>
            </tbl>
         </sec>
         <sec>
            <st>
               <p>Locally Linear Embedding</p>
            </st>
            <p>The Locally Linear Embedding results using two different gene sets are projected down and shown in Figures <figr fid="F3">3</figr> and <figr fid="F4">4</figr> using <it>K </it>= 5 and Figures <figr fid="F5">5</figr> and <figr fid="F6">6</figr> using <it>K </it>= 20 together with the classification contour lines of the good prognosis indicator. <it>K </it>is the number of neighbours used to construct the mapping which has to be chosen empirically. With <it>K </it>= 5, within a good prognosis cluster of both gene lists, there are four obvious poor prognosis patients. Three of them are common across both List projections. These four patients remain in the wrong place even after the number of neighbours increases. Between 13 and 16 poor prognosis patients are likely to be misclassified as good prognosis patients in the 'boundary layer'.</p>
            <fig id="F3">
               <title>
                  <p>Figure 3</p>
               </title>
               <caption>
                  <p>The LLE results with <it>K </it>= 5 using List A</p>
               </caption>
               <text>
                  <p><b>The LLE results with <it>K </it>= 5 using List A</b>. Projection result of the LLE method using <it>K </it>= 5 List A, the van't Veer list: the comparison list. Selected specific patients identified by arrows.</p>
               </text>
               <graphic file="1755-8794-1-8-3"/>
            </fig>
            <fig id="F4">
               <title>
                  <p>Figure 4</p>
               </title>
               <caption>
                  <p>The LLE results with <it>K </it>= 5 using List B</p>
               </caption>
               <text>
                  <p><b>The LLE results with <it>K </it>= 5 using List B</b>. Projection result of the LLE method using <it>K </it>= 5, List B: the comparison list. Selected specific patients identified by arrows.</p>
               </text>
               <graphic file="1755-8794-1-8-4"/>
            </fig>
            <fig id="F5">
               <title>
                  <p>Figure 5</p>
               </title>
               <caption>
                  <p>The LLE results with <it>K </it>= 20 using List A</p>
               </caption>
               <text>
                  <p><b>The LLE results with <it>K </it>= 20 using List A</b>. Projection result of the LLE method using <it>K </it>= 20, List A. Selected specific patients identified by arrows.</p>
               </text>
               <graphic file="1755-8794-1-8-5"/>
            </fig>
            <fig id="F6">
               <title>
                  <p>Figure 6</p>
               </title>
               <caption>
                  <p>The LLE results with <it>K </it>= 20 using List B</p>
               </caption>
               <text>
                  <p><b>The LLE results with <it>K </it>= 20 using List B</b>. Projection result of the LLE method using <it>K </it>= 20, List B. Selected specific patients identified by arrows.</p>
               </text>
               <graphic file="1755-8794-1-8-6"/>
            </fig>
            <p>The best representation seems to be <it>K </it>= 20 with List A giving slightly better separation with fewer patients misclassified, with 7 good prognosis and 4 poor prognosis patients likely to be misclassified, from inspection of the figures without the classification results. However, some regions can be classified better when the classifier are trained on the particular data set. For example, few good prognosis patients in Figure <figr fid="F5">5</figr> are on the right of the projection while most of the good prognosis patients are supposed to be on the left side. Those few patients create the region where patients are likely to be good prognosis even though this could be the result of these few outliers.</p>
            <p>Both List projections have a separability of the modes of the two groups even though some patients appear in the wrong relative positions for their prognosis groups. Nevertheless, the difficulty for LLE is choosing the appropriate value for <it>K</it>. The result shows better separation of the training data with <it>K </it>= 20. For Figure <figr fid="F5">5</figr>, poor prognosis patients <it>P</it>45, <it>P</it>55, <it>P</it>54 are isolated from the other patients. However, having these three patients correctly classified could result in poor generalisation across new data. Other than this, the LLE projections reflect some similarities to the NeuroScale projections.</p>
            <p>Similar to the NeuroScale classification results, the classification results of LLE are provided in misclassification matrices showing results for all the patients and also using only high confidence patients, with different choices for the number of neighbours, <it>K</it>. Tables <tblr tid="T5">5</tblr> and <tblr tid="T6">6</tblr> show the classification results of LLE with <it>K </it>= 5 using List A with classification results on all patients and high confidence patients respectively. Similarly for list B, the classification results for <it>K </it>= 5 are shown in Tables <tblr tid="T7">7</tblr> and <tblr tid="T8">8</tblr>. Visually, List B gives a more distinct projection than List A with more clusters of good prognosis patients separated without overlap of many poor prognosis patients. The classification results confirm this. When only high confidence patients are retained, no patients are misclassified using either gene list although the number of high confidence patients using List B is more than using List A by 8 patients.</p>
            <tbl id="T5">
               <title>
                  <p>Table 5</p>
               </title>
               <caption>
                  <p>The misclassification matrix from the LLE projection using List A with <it>K </it>= 5. The classification is performed using the original 78 patients with 0.5 prognosis indicator as a threshold boundary. The classification rate is 79.49%.</p>
               </caption>
               <tblbdy cols="3">
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>Predict Good</p>
                     </c>
                     <c ca="center">
                        <p>Predict Poor</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="3">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>Actual Good</p>
                     </c>
                     <c ca="center">
                        <p>27</p>
                     </c>
                     <c ca="center">
                        <p>7</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>Actual Poor</p>
                     </c>
                     <c ca="center">
                        <p>9</p>
                     </c>
                     <c ca="center">
                        <p>35</p>
                     </c>
                  </r>
               </tblbdy>
            </tbl>
            <tbl id="T6">
               <title>
                  <p>Table 6</p>
               </title>
               <caption>
                  <p>The misclassification matrix from the LLE projection using List A with <it>K </it>= 5 using only high confidence patients. The classification is performed using only 22 high confidence patients whose indicators are either above 0.7 or below 0.3. The classification rate is 100%.</p>
               </caption>
               <tblbdy cols="3">
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>Predict Good</p>
                     </c>
                     <c ca="center">
                        <p>Predict Poor</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="3">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>Actual Good</p>
                     </c>
                     <c ca="center">
                        <p>7</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>Actual Poor</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>15</p>
                     </c>
                  </r>
               </tblbdy>
            </tbl>
            <tbl id="T7">
               <title>
                  <p>Table 7</p>
               </title>
               <caption>
                  <p>The misclassification matrix from the LLE projection using List B with <it>K </it>= 5. The classification is performed using the original 78 patients with 0.5 prognosis indicator as a threshold boundary. The classification rate is 87.18%.</p>
               </caption>
               <tblbdy cols="3">
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>Predict Good</p>
                     </c>
                     <c ca="center">
                        <p>Predict Poor</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="3">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>Actual Good</p>
                     </c>
                     <c ca="center">
                        <p>31</p>
                     </c>
                     <c ca="center">
                        <p>7</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>Actual Poor</p>
                     </c>
                     <c ca="center">
                        <p>3</p>
                     </c>
                     <c ca="center">
                        <p>37</p>
                     </c>
                  </r>
               </tblbdy>
            </tbl>
            <tbl id="T8">
               <title>
                  <p>Table 8</p>
               </title>
               <caption>
                  <p>The misclassification matrix from the LLE projection using List A with <it>K </it>= 5 using only high confidence patients. The classification is performed using 29 high confidence patients whose indicators are either above 0.7 or below 0.3. Again a perfect classification rate is achieve.</p>
               </caption>
               <tblbdy cols="3">
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>Predict Good</p>
                     </c>
                     <c ca="center">
                        <p>Predict Poor</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="3">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>Actual Good</p>
                     </c>
                     <c ca="center">
                        <p>8</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>Actual Poor</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>21</p>
                     </c>
                  </r>
               </tblbdy>
            </tbl>
            <p>For <it>K </it>= 20, the classification results are shown in Tables <tblr tid="T9">9</tblr> and <tblr tid="T10">10</tblr> for list A and Tables <tblr tid="T11">11</tblr> and <tblr tid="T12">12</tblr> for list B. Contrary to the <it>K </it>= 5 case, the results using List A with <it>K </it>= 20 gives better classification performance. The classification rate is quite high with 93.58% accuracy with larger numbers of high confidence patients compared to the other methods, but this could result from the overfitting of this particular model. As can be seen in Figure <figr fid="F5">5</figr>, the gap of the contours between 0.4 to 0.7 is quite narrow. Choosing the exact boundary that determines the prognosis signature of each patient is therefore critical. As a result, if patient values contain uncertain information or noisy data, the resulting classification outcome of such patients is likely to be effectively random. Therefore the data of such uncertain patients should not be taken into account in representing performance results. We investigate generalisation of these results later in the paper.</p>
            <tbl id="T9">
               <title>
                  <p>Table 9</p>
               </title>
               <caption>
                  <p>The misclassification matrix from the LLE projection using List A with <it>K </it>= 20. The classification is performed using the original 78 patients with 0.5 prognosis indicator as a threshold boundary. The classification rate is 93.58%.</p>
               </caption>
               <tblbdy cols="3">
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>Predict Good</p>
                     </c>
                     <c ca="center">
                        <p>Predict Poor</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="3">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>Actual Good</p>
                     </c>
                     <c ca="center">
                        <p>33</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>Actual Poor</p>
                     </c>
                     <c ca="center">
                        <p>4</p>
                     </c>
                     <c ca="center">
                        <p>40</p>
                     </c>
                  </r>
               </tblbdy>
            </tbl>
            <tbl id="T10">
               <title>
                  <p>Table 10</p>
               </title>
               <caption>
                  <p>The misclassification matrix from the LLE projection using List A with <it>K </it>= 20 using only high confidence patients. The classification is performed using 41 high confidence patients whose indicators are either above 0.7 or below 0.3. The classification rate is 100%.</p>
               </caption>
               <tblbdy cols="3">
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>Predict Good</p>
                     </c>
                     <c ca="center">
                        <p>Piredict Poor</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="3">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>Actual Good</p>
                     </c>
                     <c ca="center">
                        <p>12</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>Actual Poor</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>29</p>
                     </c>
                  </r>
               </tblbdy>
            </tbl>
            <tbl id="T11">
               <title>
                  <p>Table 11</p>
               </title>
               <caption>
                  <p>The misclassification matrix from the LLE projection using List B with <it>K </it>= 20. The classification is performed using the original 78 patients with 0.5 prognosis indicator as a threshold boundary. The classification rate is 84.62%.</p>
               </caption>
               <tblbdy cols="3">
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>Predict Good</p>
                     </c>
                     <c ca="center">
                        <p>Predict Poor</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="3">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>Actual Good</p>
                     </c>
                     <c ca="center">
                        <p>28</p>
                     </c>
                     <c ca="center">
                        <p>7</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>Actual Poor</p>
                     </c>
                     <c ca="center">
                        <p>5</p>
                     </c>
                     <c ca="center">
                        <p>39</p>
                     </c>
                  </r>
               </tblbdy>
            </tbl>
            <tbl id="T12">
               <title>
                  <p>Table 12</p>
               </title>
               <caption>
                  <p>The misclassification matrix from the LLE projection using List B with <it>K </it>= 20 using only high confidence patients. The classification is performed using 28 high confidence patients whose indicators are either above 0.7 or below 0.3. The classification rate is 100%.</p>
               </caption>
               <tblbdy cols="3">
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>Predict Good</p>
                     </c>
                     <c ca="center">
                        <p>Predict Poor</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="3">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>Actual Good</p>
                     </c>
                     <c ca="center">
                        <p>12</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>Actual Poor</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>16</p>
                     </c>
                  </r>
               </tblbdy>
            </tbl>
         </sec>
         <sec>
            <st>
               <p>Stochastic Neighbor Embedding</p>
            </st>
            <p>The projection maps of Stochastic Neighbor Embedding reflect different results of qualitative projections for <it>&#963; </it>= log(5) shown in Figures <figr fid="F7">7</figr> and <figr fid="F8">8</figr>, and <it>&#963; </it>= log(20) shown in Figures <figr fid="F9">9</figr> and <figr fid="F10">10</figr>. From these figures it can be seen that the relative distributions of the patient projections are quite different for differing choices of the value of <it>&#963; </it>and this value is quite hard to determine. With <it>&#963; </it>= log(20), patients from both gene groups are mostly overlapped. The separation is not as good as in the previous two models.</p>
            <fig id="F7">
               <title>
                  <p>Figure 7</p>
               </title>
               <caption>
                  <p>The SNE results with <it>&#963; </it>= <it>log</it>(5) using List A</p>
               </caption>
               <text>
                  <p><b>The SNE results with <it>&#963; </it>= <it>log</it>(5) using List A</b>. The Stochastic Neighbor Embedding projections of List A with <it>&#963; </it>= log(5).</p>
               </text>
               <graphic file="1755-8794-1-8-7"/>
            </fig>
            <fig id="F8">
               <title>
                  <p>Figure 8</p>
               </title>
               <caption>
                  <p>The SNE results with <it>&#963; </it>= <it>log</it>(5) using List B</p>
               </caption>
               <text>
                  <p><b>The SNE results with <it>&#963; </it>= <it>log</it>(5) using List B</b>. The Stochastic Neighbor Embedding projections of List B with <it>&#963; </it>= log(5).</p>
               </text>
               <graphic file="1755-8794-1-8-8"/>
            </fig>
            <fig id="F9">
               <title>
                  <p>Figure 9</p>
               </title>
               <caption>
                  <p>The SNE results with <it>&#963; </it>= <it>log</it>(20) using List A</p>
               </caption>
               <text>
                  <p><b>The SNE results with <it>&#963; </it>= <it>log</it>(20) using List A</b>. The Stochastic Neighbor Embedding projections of List A with <it>&#963; </it>= log(20).</p>
               </text>
               <graphic file="1755-8794-1-8-9"/>
            </fig>
            <fig id="F10">
               <title>
                  <p>Figure 10</p>
               </title>
               <caption>
                  <p>The SNE results with <it>&#963; </it>= <it>log</it>(20) using List B</p>
               </caption>
               <text>
                  <p><b>The SNE results with <it>&#963; </it>= <it>log</it>(20) using List B</b>. The Stochastic Neighbor Embedding projections of List B with <it>&#963; </it>= log(20).</p>
               </text>
               <graphic file="1755-8794-1-8-10"/>
            </fig>
            <p>Figures <figr fid="F7">7</figr> and <figr fid="F8">8</figr>, show the classification contour lines superimposed on the SNE projection maps using the two different gene Lists with <it>&#963; </it>= log(5), and Figures <figr fid="F9">9</figr> and <figr fid="F10">10</figr> with <it>&#963; </it>= log(20).</p>
            <p>Tables <tblr tid="T13">13</tblr> and <tblr tid="T14">14</tblr> show the classification results of the SNE with <it>&#963; </it>= log(5) using List A, with classification results shown on all patients and also those high confidence selected patients respectively. For List B, the classification results are shown in Tables <tblr tid="T15">15</tblr> and <tblr tid="T16">16</tblr>. With <it>&#963; </it>= log(5), List B gives better overall performance but when only high confidence patients are being measured, no patients are misclassified with almost the same number of high confidence patients using both gene lists. Again, this supports the proposition that equivalent performance can be obtained on dissimilar gene lists.</p>
            <tbl id="T13">
               <title>
                  <p>Table 13</p>
               </title>
               <caption>
                  <p>The misclassification matrix from the SNE projection using List A with <it>&#963; </it>= log(5). The classification is performed using the original 78 patients with 0.5 prognosis indicator as a threshold boundary. The classification rate is 79.49%.</p>
               </caption>
               <tblbdy cols="3">
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>Predict Good</p>
                     </c>
                     <c ca="center">
                        <p>Predict Poor</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="3">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>Actual Good</p>
                     </c>
                     <c ca="center">
                        <p>27</p>
                     </c>
                     <c ca="center">
                        <p>9</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>Actual Poor</p>
                     </c>
                     <c ca="center">
                        <p>7</p>
                     </c>
                     <c ca="center">
                        <p>35</p>
                     </c>
                  </r>
               </tblbdy>
            </tbl>
            <tbl id="T14">
               <title>
                  <p>Table 14</p>
               </title>
               <caption>
                  <p>The misclassification matrix from the SNE projection using List A with <it>&#963; </it>= log(5) using only high confidence patients. The classification is performed using 16 high confidence patients whose indicators are either above 0.7 or below 0.3. The classification rate is 100%.</p>
               </caption>
               <tblbdy cols="3">
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>Predict Good</p>
                     </c>
                     <c ca="center">
                        <p>Predict Poor</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="3">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>Actual Good</p>
                     </c>
                     <c ca="center">
                        <p>4</p>
                     </c>
                     <c ca="center">
                    