<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1471-2105-7-74</ui>
   <ji>1471-2105</ji>
   <fm>
      <dochead>Research article</dochead>
      <bibl>
         <title>
            <p>Spectral embedding finds meaningful (relevant) structure in image and microarray data</p>
         </title>
         <aug>
            <au id="A1" ca="yes" ce="yes">
               <snm>Higgs</snm>
               <mi>W</mi>
               <fnm>Brandon</fnm>
               <insr iid="I1"/>
               <email>bhiggs@gmu.edu</email>
            </au>
            <au id="A2" ce="yes">
               <snm>Weller</snm>
               <fnm>Jennifer</fnm>
               <insr iid="I1"/>
               <email>jweller@gmu.edu</email>
            </au>
            <au id="A3" ce="yes">
               <snm>Solka</snm>
               <mi>L</mi>
               <fnm>Jeffrey</fnm>
               <insr iid="I1"/>
               <insr iid="I2"/>
               <email>jlsolka@gmail.com</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>School of Computational Sciences, George Mason University, Manassas, VA 20110, USA</p>
            </ins>
            <ins id="I2">
               <p>Naval Surface Warfare Center, Code B10, Dahlgren, VA 22448-5000, USA</p>
            </ins>
         </insg>
         <source>BMC Bioinformatics</source>
         <issn>1471-2105</issn>
         <pubdate>2006</pubdate>
         <volume>7</volume>
         <issue>1</issue>
         <fpage>74</fpage>
         <url>http://www.biomedcentral.com/1471-2105/7/74</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">16483359</pubid>
               <pubid idtype="doi">10.1186/1471-2105-7-74</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>07</day>
               <month>9</month>
               <year>2005</year>
            </date>
         </rec>
         <acc>
            <date>
               <day>16</day>
               <month>2</month>
               <year>2006</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>16</day>
               <month>2</month>
               <year>2006</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2006</year>
         <collab>Higgs et al; licensee BioMed Central Ltd.</collab>
         <note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p/>
               <p>Accurate methods for extraction of meaningful patterns in high dimensional data have become increasingly important with the recent generation of data types containing measurements across thousands of variables. Principal components analysis (PCA) is a linear dimensionality reduction (DR) method that is unsupervised in that it relies only on the data; projections are calculated in Euclidean or a similar linear space and do not use tuning parameters for optimizing the fit to the data. However, relationships within sets of nonlinear data types, such as biological networks or images, are frequently mis-rendered into a low dimensional space by linear methods. Nonlinear methods, in contrast, attempt to model important aspects of the underlying data structure, often requiring parameter(s) fitting to the data type of interest. In many cases, the optimal parameter values vary when different classification algorithms are applied on the same rendered subspace, making the results of such methods highly dependent upon the type of classifier implemented.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>We present the results of applying the spectral method of Lafon, a nonlinear DR method based on the weighted graph Laplacian, that minimizes the requirements for such parameter optimization for two biological data types. We demonstrate that it is successful in determining implicit ordering of brain slice image data and in classifying separate species in microarray data, as compared to two conventional linear methods and three nonlinear methods (one of which is an alternative spectral method). This spectral implementation is shown to provide more meaningful information, by preserving important relationships, than the methods of DR presented for comparison.</p>
               <p>Tuning parameter fitting is simple and is a general, rather than data type or experiment specific approach, for the two datasets analyzed here. Tuning parameter optimization is minimized in the DR step to each subsequent classification method, enabling the possibility of valid cross-experiment comparisons.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusion</p>
               </st>
               <p>Results from the spectral method presented here exhibit the desirable properties of preserving meaningful nonlinear relationships in lower dimensional space and requiring minimal parameter fitting, providing a useful algorithm for purposes of visualization and classification across diverse datasets, a common challenge in systems biology.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>Dimensionality reduction (DR) is the process of rendering high dimensional complex data in a low dimensional space. Provided the process is calculated accurately, this low dimensional representation is preferred for use in inference and summarization for multiple reasons, among which are ease of visualization in a reduced variable space and clarity (i.e. ready interpretation) of clustering or classification. Other benefits include the insights into the data structure that can be obtained from the projected axes and the obvious denoising effect attained in some types of DR. Reduction strategies often rely on linear approaches defined by a method that represents <it>x</it><sub>1</sub>, ..., <it>x</it><sub><it>n </it></sub>&#8712; &#8477;<sup><it>q </it></sup>as <m:math name="1471-2105-7-74-i1" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msub><m:mover accent="true"><m:mi>x</m:mi><m:mo>^</m:mo></m:mover><m:mn>1</m:mn></m:msub><m:mo>,</m:mo><m:mo>&#8230;</m:mo><m:mo>,</m:mo><m:msub><m:mover accent="true"><m:mi>x</m:mi><m:mo>^</m:mo></m:mover><m:mi>n</m:mi></m:msub></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuWG4baEgaqcamaaBaaaleaacqaIXaqmaeqaaOGaeiilaWIaeSOjGSKaeiilaWIafmiEaGNbaKaadaWgaaWcbaGaemOBa4gabeaaaaa@3557@</m:annotation></m:semantics></m:math> in such a way that each <m:math name="1471-2105-7-74-i2" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msub><m:mover accent="true"><m:mi>x</m:mi><m:mo>^</m:mo></m:mover><m:mi>i</m:mi></m:msub></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuWG4baEgaqcamaaBaaaleaacqWGPbqAaeqaaaaa@2FBC@</m:annotation></m:semantics></m:math> is obtained by projecting <it>x</it><sub><it>i </it></sub>into a common linear subspace of &#8477;<sup><it>q</it></sup>. Commonly used methods on data types relevant to bioinformatics include principal components analysis (PCA) <abbrgrp><abbr bid="B1">1</abbr></abbrgrp> and classical multidimensional scaling (MDS) <abbrgrp><abbr bid="B2">2</abbr></abbrgrp>, which calculate linear projections of the data; clearly such projections are unsuitable for nonlinear or curved surfaces.</p>
         <p>These methods generally are based on minimization of a global cost function, wherein large distances can drive the embedding, minimizing the effect of local distance structures <abbrgrp><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr></abbrgrp>. Where local data structures are not best summarized linearly (yet important to the interpretation of the experimental results), nonlinear methods that are kernel-based (e.g. kernel PCA) <abbrgrp><abbr bid="B6">6</abbr></abbrgrp> and graph theoretic like spectral embedding <abbrgrp><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr><abbr bid="B6">6</abbr><abbr bid="B7">7</abbr><abbr bid="B8">8</abbr><abbr bid="B9">9</abbr><abbr bid="B10">10</abbr><abbr bid="B11">11</abbr><abbr bid="B12">12</abbr><abbr bid="B13">13</abbr><abbr bid="B14">14</abbr><abbr bid="B15">15</abbr><abbr bid="B16">16</abbr><abbr bid="B17">17</abbr></abbrgrp> can be more appropriate. These methods attempt to model the underlying manifold by fitting a kernel parameter to optimize performance (e.g. as assessed by some performance accuracy metric) <abbrgrp><abbr bid="B6">6</abbr></abbrgrp>. Unfortunately it is usually necessary to re-fit one or more tuning parameter(s) to each different data type or experiment set, making it difficult to propose a more generalized method across multiple data types. It is also difficult to avoid over-fitting the model to the data in this scenario. In addition, when attempting to determine class structure in the low dimensional space calculated from these nonlinear approaches, different classifiers may require separate spatial representations in order to appropriately partition the classes (e.g. quadratic discriminant analysis (QDA) compared to linear discriminant analysis (LDA)). Such parameter(s) modifications are optimized with a specific range of values that can be different for each classifier.</p>
         <p>Two examples of high dimensional data types that fall into this nonlinear domain include DNA microarrays and image data. Microarrays contain the simultaneous measurement for thousands of mRNA transcripts <abbrgrp><abbr bid="B18">18</abbr><abbr bid="B19">19</abbr><abbr bid="B20">20</abbr></abbrgrp>, which can be viewed as <it>n </it>arrays with <it>q </it>dimensions (where <it>n</it>&lt;&lt;<it>q</it>). Many of the biological processes (feedback loops, oscillators, and repressilators) represented by measurements generated with microarrays are nonlinear, providing a great challenge in expressing associations between biological entities in a linear domain. Nilsson <it>et al</it>. demonstrated the importance of this concept in their comparison of MDS with a nonlinear algorithm, isometric feature mapping (ISOMAP) <abbrgrp><abbr bid="B21">21</abbr><abbr bid="B22">22</abbr></abbrgrp>. ISOMAP uses nonlinear distances as estimated in the ambient space along with a linear MDS to a Euclidean target projection space <abbrgrp><abbr bid="B22">22</abbr></abbrgrp>. This nonlinear method was shown to render more robust partitioning of disease class structure on the low dimensional manifold, when class membership predictions were evaluated against those obtained from linear projections from MDS.</p>
         <p>Images are another data type that can be represented in <it>q </it>dimensions as well, where each image <it>n </it>is a vector. This data type can exhibit comparable complexities to the microarray example, particularly when imaging tissues and organs. The Euclidean distance between two similar images is seldom the optimal comparison criterion. Simple variations on the main image features, such as those caused by registration issues (rotation and shifting), can alter the pixel alignment, thereby modifying the definition of distance between the original image and the rotated one, and distorting the apparent relationship. The ideal method for DR should be capable of extracting meaningful patterns in multiple data types (such as these mentioned), should not be confined to a linear domain, and should exhibit tuning parameter-fitting independence to minimize parameter optimization specific to each example and classification method.</p>
         <p>Given this goal, we examined the performance of a spectral method presented by Lafon <abbrgrp><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr></abbrgrp> and have shown that it is successful in extracting meaningful structure in these two disparate data types, both having high dimensionality paired with low replication, with a method for calculating the tuning parameter that does not have to be varied across classifiers to achieve correct results. Previous work by Lafon has demonstrated how ordered structure from both helix and trefoils curves in &#8477;<sup>3 </sup>can be accurately preserved in the embedded space (&#8477;<sup>2</sup>) with a spectral method <abbrgrp><abbr bid="B3">3</abbr></abbrgrp>. We extend this work to address biological examples of higher dimensionality, where accuracy in embedded results is evaluated using a known ordering and classification structure. In a more global sense, we demonstrate that the spectral method is able to preserve the implicit ordering within biological image data and can accurately classify different taxonomic species within microarray data. These results are compared to two linear approaches (PCA with either correlation or covariance distance metrics), one nonlinear counterpart to classical MDS (i.e. nonmetric MDS), and two similar nonlinear approaches (kernel PCA with a Gaussian radial basis function kernel; weighted graph Laplacian as presented by Ng <it>et al</it>. <abbrgrp><abbr bid="B11">11</abbr></abbrgrp>), for the latter two of which numerous variations are often promoted in the mathematical/statistical literature for their successful application to a number of nonlinear data types <abbrgrp><abbr bid="B11">11</abbr><abbr bid="B12">12</abbr><abbr bid="B13">13</abbr><abbr bid="B15">15</abbr><abbr bid="B16">16</abbr><abbr bid="B17">17</abbr></abbrgrp>. We demonstrate that for our datasets, the spectral approach presented here is not dependent upon tuning parameter(s) optimization to allow success across any of three separate classifiers chosen. This is a considerable advantage to an investigator who needs to make cross-experiment or multi-data type comparisons that benefit from a tuning parameter-independent nonlinear DR approach.</p>
      </sec>
      <sec>
         <st>
            <p>Results and discussion</p>
         </st>
         <sec>
            <st>
               <p>Image dataset</p>
            </st>
            <p>The image dataset was used to test the ability of each of the projection methods to predict the correct image ordering, based on the size increment of the brain. Since the largest source of variability separating each image in the series is the increase in feature surface area, as a result of the head size, only the first eigenfunction for each method was used in the comparison. This calculation reduces the dimensionality from &#8477;<sup>16,384 </sup>to &#8477;<sup>1</sup>. To assess the accuracy of each method, a non-parametric measure of association (Spearman's rho coefficient) was used, by which the scores from the primary eigenfunction were ranked and correlated against the correct ordering. A straight-line fit is indicative of perfect image ordering.</p>
            <p>Both the kernel PCA and the spectral method from Ng <it>et al</it>. require fitting for the smoothing epsilon term to optimize performance for the dataset. The results for this parameter optimization are provided in Figure <figr fid="F1">1</figr>. The maximum rho coefficient possible indicates the appropriate value for this epsilon term for each method. Based upon the observation that neither line reaches a maximum value of 1, it is apparent that neither method is capable of determining the correct ordering of all of the images.</p>
            <fig id="F1">
               <title>
                  <p>Figure 1</p>
               </title>
               <caption>
                  <p>Parameter optimization plot for image example</p>
               </caption>
               <text>
                  <p><b>Parameter optimization plot for image example</b>. Regression coefficients for image ordering determined by different epsilon values for kernel PCA and the spectral method from Ng <it>et al</it>. Epsilon values were increased to 300,000 to assess image ordering accuracy (data not shown), but truncated for the plot to better visualize the global maxima. The dashed black horizontal line indicates a rho statistic value of 1, though neither method reaches this threshold. Large fluctuations in the rho statistic are observed for both methods at minimal values of epsilon. For kernel PCA a non-optimal solution is determined in the variable region, while for the spectral method from Ng <it>et al</it>., a maximum is determined in this region. The variability in the rho values at these minimal values can be associated with the optimal convergence of remote and local distances in the weight matrices (Figure 6) of kernel PCA and the spectral method from Ng <it>et al </it>(<it>L</it>). Small values of the epsilon parameter provide minimal convergence of the <it>L </it>matrix distribution tails (very small distances and very large distance), which is optimal for the spectral method from Ng <it>et al </it>for this example. For kernel PCA, larger values of the epsilon parameter provide convergence of large distances and greater convergence of small distances in the Gaussian radial basis function kernel matrix, which is determined to be optimal for this example.</p>
               </text>
               <graphic file="1471-2105-7-74-1"/>
            </fig>
            <p>The remaining methods: PCA-correlation, PCA-covariance, nonmetric MDS, and the spectral method from Lafon <abbrgrp><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr></abbrgrp> do not require parameter fitting that is dependent upon performance (as is necessary for the previous two nonlinear methods), so the images can be directly rendered into a low dimensional representation.</p>
            <p>Table <tblr tid="T1">1</tblr> and Figure <figr fid="F2">2</figr> illustrate the results for all of the projection methods. PCA-correlation has the lowest coefficient (rho = 0.902). The plot (Figure <figr fid="F2">2a</figr>) demonstrates that the global size change is minimal after image 20 and the subtler differences in shape are not picked up by this method. This method thus fails to predict the correct progression between the images after this point. Both PCA-covariance and nonmetric MDS (Figures <figr fid="F2">2b</figr> and <figr fid="F2">2c</figr>, respectively) show exactly the same ability to solve for the correct ordering of images (rho = 0.966). Since these two methods give the same results, it is implied that the ordering of dissimilarity values (as fit with classical MDS) and the <it>rank </it>of the ordering of dissimilarity values (as fit with nonmetric MDS) are identical, which implies that there is no benefit in using nonmetric MDS to recover the image ordering. These two methods fail to retain the ordering after image 22, although the deviations at this point are not as drastic as those observed for the PCA-correlation results. The two other nonlinear methods (Figures <figr fid="F2">2d</figr> and <figr fid="F2">2e</figr>), each of which requires parameter optimization, predict the image ordering more correctly than do the two linear methods, indicating that this dataset is not best summarized with linear methods. Where there are meaningful local relationships, or nonlinearities, that the linear methods fail to preserve in a low dimensional mapping, nonlinear methods will be a more appropriate analysis choice. However, neither kernel PCA nor the spectral method from Ng <it>et al</it>. accurately preserves the correct image ordering over the entire series (rho = 0.989 and rho = 0.980, respectively). In this study, only the spectral method from Lafon was able to correctly solve the implicit ordering of the complete set of images (Figure <figr fid="F2">2f</figr>). This spectral method shares the properties of nonlinearity with kernel PCA and the spectral method from Ng <it>et al</it>. (which is initially anchored on the transformation of the Euclidean distance to some form of Gaussian kernel), however, it does not require parameter fitting of the epsilon term in order to produce optimal performance. Instead, the minimum non-zero squared distance is calculated for the smoothing term (for this example, <it>&#949; </it>= min<sub><it>i</it>&#8800;<it>j </it></sub>||<it>x</it><sub><it>i </it></sub>- <it>x</it><sub><it>j</it></sub>||<sup>2 </sup>= 140,245), meaning that this method is dependent only upon the distribution of squared Euclidean distances. The image ordering as produced by both PCA-correlation and the spectral method from Lafon is provided in Figure <figr fid="F3">3</figr>.</p>
            <tbl id="T1">
               <title>
                  <p>Table 1</p>
               </title>
               <caption>
                  <p>Spearman rho values used for evaluation of CATSCAN image ordering</p>
               </caption>
               <tblbdy cols="2">
                  <r>
                     <c ca="left">
                        <p>
                           <b>Method</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>rho</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="2">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>PCA (cor)</p>
                     </c>
                     <c ca="center">
                        <p>0.902</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>PCA (cov)</p>
                     </c>
                     <c ca="center">
                        <p>0.966</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Nonmetric MDS</p>
                     </c>
                     <c ca="center">
                        <p>0.966</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Kernel PCA (<it>&#949; </it>= 1,040)</p>
                     </c>
                     <c ca="center">
                        <p>0.989</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Spectral-Ng <it>et al</it>. (<it>&#949; </it>= 70)</p>
                     </c>
                     <c ca="center">
                        <p>0.980</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Spectral-Lafon</p>
                     </c>
                     <c ca="center">
                        <p>1.000</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>Two linear methods and four nonlinear methods are considered. For both kernel PCA and the spectral method from Ng <it>et al</it>., the methods were optimized at the epsilon (&#949;) values provided.</p>
               </tblfn>
            </tbl>
            <fig id="F2">
               <title>
                  <p>Figure 2</p>
               </title>
               <caption>
                  <p>Scatter plots of image ordering for six projection methods</p>
               </caption>
               <text>
                  <p><b>Scatter plots of image ordering for six projection methods</b>. <b>(a) </b>Regression of ranked eigenfunction values calculated by PCA-correlation vs. actual ordering. <b>(b) </b>Regression of ranked eigenfunction values calculated by PCA-covariance vs. actual ordering. Note that the scores have been reverse sorted for consistency with the other plots. <b>(c) </b>Regression of ranked eigenfunction values calculated by nonmetric MDS vs. actual ordering. <b>(d) </b>Regression of ranked eigenfunction values calculated by kernel PCA (<it>&#949; </it>= 1,040) vs. actual ordering. <b>(e) </b>Regression of ranked eigenfunction values calculated by the spectral method from Ng <it>et al</it>. (<it>&#949; </it>= 70) vs. actual ordering. <b>(f) </b>Regression of ranked eigenfunction values calculated by the spectral method from Lafon [3,4] vs. actual ordering.</p>
               </text>
               <graphic file="1471-2105-7-74-2"/>
            </fig>
            <fig id="F3">
               <title>
                  <p>Figure 3</p>
               </title>
               <caption>
                  <p>Images ordered by the primary eigenfunction for PCA-correlation and the spectral method from Lafon [3,4]</p>
               </caption>
               <text>
                  <p><b>Images ordered by the primary eigenfunction for PCA-correlation and the spectral method from Lafon [3,4]</b>. <b>(a) </b>The ordering is not correctly preserved with PCA-correlation (in this series, moving from left to right and top to bottom). <b>(b) </b>The order is correctly preserved with the spectral method from Lafon [3,4] (in this series, moving from left to right and top to bottom).</p>
               </text>
               <graphic file="1471-2105-7-74-3"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>Microarray dataset</p>
            </st>
            <p>The microarray dataset was used to evaluate the ability of the projection methods to accurately classify samples derived from three taxonomically separate species into their respective groups, without conducting any preliminary feature selection (a procedure usually conducted in order to better partition the groups). In each case, the results of a method were calculated such that dimensionality was reduced from &#8477;<sup>12,625 </sup>to &#8477;<sup>2</sup>. Thereafter three classifiers using leave-one-out cross-validation (LOO-CV) were calculated on this projected space, both to assess the classification accuracy for each method and to compare the differences in value of the optimal parameters for kernel PCA and the spectral method from Ng <it>et al</it>. A nonlinear classifier, <it>k</it>-nearest neighbors (KNN), was calculated, setting <it>k </it>= 2 and <it>k </it>= 3; two settings of <it>k </it>were used since each DR method renders the groupings differently, thereby favouring two nearest neighbors for some methods and three nearest neighbors for others. The average error rates were computed across 1,000 trials, to account for the variation arising from ties broken at random in the assignment of the nearest winning class which results in slightly different classification results for subsequent trials. In the event that the mean error rate is calculated to be greater than zero, even when one or more of the trials provide a smaller error rate, a range is denoted in the results (e.g. 0%&#8211;4.66%) to indicate that the occurrence of a lower error rate is possible. Another commonly utilized classifier, QDA, was calculated, as well as the linear classifier counterpart, LDA, to allow comparison of the dependence of the results of different classification methods on the tuning parameter fitting in the first DR step.</p>
            <p>As was done with the image data, optimal parameters were determined for kernel PCA and the spectral method from Ng <it>et al</it>. These values were evaluated with each classification algorithm separately. In determining the most appropriate epsilon value for optimal classification accuracy, the optimal value for the spectral method from Ng <it>et al</it>. varies according to the classifier used (Figure <figr fid="F4">4a</figr>). There is a small window at a value of <it>&#949; </it>~20,000 in which both QDA (blue line) and LDA (red line) reach respective minimum total classification error rates of 2.22% and 15.56%. However, the KNN classifiers for <it>k </it>= 2 (black line) and <it>k </it>= 3 (green line) require slightly higher epsilon values (21,000 indicated by a dashed vertical line) to reach their respective minimum total classification error rates (2.22%&#8211;13.93% and 8.88%&#8211;13.79%, respectively). It can be argued that increasing the trials of the KNN classifier might better adjust this minimum point in the two KNN curves, where it coincides with the QDA and LDA window for minimum error. However, without calculating error rates with three separate classifiers, but rather independently determining the value for a given classifier, this window would be unknown, in which case different optimal parameter values for each classifier would be suggested. For example, using LDA as a classifier to determine the minimum error rate, a value of <it>&#949; </it>= 20,000 can be chosen as the optimal parameter for the spectral method from Ng <it>et al</it>.; however, for a KNN (<it>k </it>= 3) classifier in the same example, this parameter value would not fall within the range of the minimum error rate. Instead, a value of <it>&#949; </it>= 20,000 would provide a local minimum error of 14.08%, as compared to the global minimum error of 13.79%.</p>
            <fig id="F4">
               <title>
                  <p>Figure 4</p>
               </title>
               <caption>
                  <p>Parameter optimization plot for microarray example</p>
               </caption>
               <text>
                  <p><b>Parameter optimization plot for microarray example</b>. Classification error rates for primary fibroblast cell lines between three separate species determined by different epsilon values for <b>(a) </b>the spectral method from Ng <it>et al</it>. and <b>(b) </b>kernel PCA. Epsilon values were increased to 5 &#215; 10<sup>9 </sup>to assess classification accuracy (data not shown), but truncated for the plot to better visualize the global minima for each classifier method. The spectral method from Lafon has a 0% error rate across all methods except LDA, where only one sample is misclassified.</p>
               </text>
               <graphic file="1471-2105-7-74-4"/>
            </fig>
            <p>The parameter optimization for kernel PCA shows similar trends to the optimal values from the spectral method from Ng <it>et al</it>. However, the differences in optimal parameter solutions between the KNN classifier and both QDA and LDA is much more pronounced with this DR method. For the QDA classifier, the epsilon value is optimized to provide a minimal error rate (0%) in the window of <it>&#949; </it>= 67,000 &#8211; 68,000, while for the LDA classifier, the epsilon value is optimized to provide a minimal error rate (2.22%) in the window of <it>&#949; </it>= 73,000 &#8211; 92,000. In addition, the epsilon values that provide the minimal KNN error rates for <it>k </it>= 2 (2.22%&#8211;6.79%) and <it>k </it>= 3 (8.56%) are at <it>&#949; </it>= 174,000 (indicated by a dashed vertical line in Figure <figr fid="F4">4</figr>) and <it>&#949; </it>= 57,000 (indicated by a solid vertical line in Figure <figr fid="F4">4</figr>), respectively. This result from kernel PCA is consistent with those obtained from the spectral methods of Ng <it>et al</it>., and demonstrates that each of these two nonlinear approaches have a dependence between the outcome of the classification algorithm and an appropriately optimized parameter. In addition, when comparing training set classification results to LOO-CV, the optimal parameter values are drastically different for both kernel PCA and the spectral method of Ng <it>et al</it>. More importantly, the parameter selection is completely dataset-dependent. Note that here the scale is vastly different from that seen in the image example, thus the optimal parameter is as well. For these examples we examined two very disparate data types, but the same conclusion of dataset-dependence would almost certainly occur if two microarray datasets were compared, since there would still be differences in distance distributions (the possibility of two microarray datasets having identical distributions of Euclidean distances is highly unlikely). In other work we have examined additional microarray datasets and examined functional sub categories as well as disease state, cited here for those who are interested <abbrgrp><abbr bid="B5">5</abbr></abbrgrp>.</p>
            <p>All individual classification results are summarized in Table <tblr tid="T2">2</tblr>, and the two-dimensional projections for each method are shown (Figure <figr fid="F5">5</figr>). For the kernel PCA and spectral method from Ng <it>et al</it>., the projection plots were generated with the epsilon terms optimized for QDA (<it>&#949; </it>= 67,000 for kernel PCA and <it>&#949; </it>= 20,000 for the spectral method from Ng <it>et al</it>). From the total error rate results reported in Table <tblr tid="T2">2</tblr>, it is apparent that the nonlinear DR methods of kernel PCA and the spectral method from Lafon perform more accurately than do the two linear methods (and nonmetric MDS) across all three of the classification algorithms (though using KNN with <it>k </it>= 2, PCA-covariance and nonmetric MDS can achieve a minimal error rate of 2.22%, similar to kernel PCA). Of these three nonlinear DR methods tested (not including nonmetric MDS), the spectral method from Lafon <abbrgrp><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr></abbrgrp> outperforms both kernel PCA and the spectral method from Ng <it>et al</it>. Not only does the Lafon spectral method project the different species into well-partitioned groups (Figure <figr fid="F5">5f</figr>) for a 0% error rate across all classification algorithms (excepting a single misclassified sample with the LDA classifier), but the KNN classifier does not exhibit any deviation in classification results across 1,000 iterations, unlike the results obtained with any of the other methods. These results, in addition to the property that tuning parameter optimization is only dependent on the distribution of squared Euclidean distances (for this example <it>&#949; </it>= min<sub><it>i</it>&#8800;<it>j </it></sub>||<it>x</it><sub><it>i </it></sub>- <it>x</it><sub><it>j</it></sub>||<sup>2 </sup>= 1.29 &#215; 10<sup>9</sup>, indicates that the spectral method from Lafon has significant advantages in tuning parameter fitting as a nonlinear DR method by our two criteria.</p>
            <fig id="F5">
               <title>
                  <p>Figure 5</p>
               </title>
               <caption>
                  <p>Two-dimensional projections of the Affymetrix primary fibroblast cell lines, using six DR methods</p>
               </caption>
               <text>
                  <p><b>Two-dimensional projections of the Affymetrix primary fibroblast cell lines, using six DR methods</b>. For each plot, red 'x' symbols denote samples from bonobo (<it>Pan paniscus</it>), blue 'o' symbols denote samples from gorilla (<it>Gorilla gorilla</it>), and black '*' symbols denote samples from human (<it>Homo sapien</it>). <b>(a) </b>Two-dimensional projection calculated with PCA-correlation. <b>(b) </b>Two-dimensional projection calculated with PCA-covariance. <b>(c) </b>Two-dimensional projection calculated with nonmetric MDS. <b>(d) </b>Two-dimensional projection calculated with kernel PCA. Epsilon parameter was selected at optimal classification using QDA (<it>&#949; </it>= 67,000). <b>(e) </b>Two-dimensional projection calculated with the spectral method from Ng et al. Epsilon parameter was selected at optimal classification using QDA (<it>&#949; </it>= 20,000). <b>(f) </b>Two-dimensional projection calculated with the spectral method from Lafon.</p>
               </text>
               <graphic file="1471-2105-7-74-5"/>
            </fig>
            <tbl id="T2">
               <title>
                  <p>Table 2</p>
               </title>
               <caption>
                  <p>LOO-CV classification error rates for six DR methods and three classifiers (fibroblast data)</p>
               </caption>
               <tblbdy cols="8">
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>
                           <b>PCA (cor)</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>PCA (cov)</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Nonmetric MDS</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>kernel PCA</b>
                           <sup>&#8224;</sup>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Spectral Ng <it>et al</it>.</b>
                           <sup>&#8225;</sup>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Spectral Lafon</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="8">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>*<b>KNN (<it>k </it>= 2)</b></p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Pan paniscus</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>27.27%&#8211;41.02%</p>
                     </c>
                     <c ca="center">
                        <p>0%&#8211;13.71%</p>
                     </c>
                     <c ca="center">
                        <p>0%&#8211;13.69%</p>
                     </c>
                     <c ca="center">
                        <p>0%&#8211;9.13%</p>
                     </c>
                     <c ca="center">
                        <p>0%&#8211;9.02%</p>
                     </c>
                     <c ca="center">
                        <p>0%</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Gorilla gorilla</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>8.33%&#8211;29.39%</p>
                     </c>
                     <c ca="center">
                        <p>8.33%&#8211;12.61%</p>
                     </c>
                     <c ca="center">
                        <p>8.33%&#8211;12.42%</p>
                     </c>
                     <c ca="center">
                        <p>8.33%&#8211;12.49%</p>
                     </c>
                     <c ca="center">
                        <p>8.33%&#8211;29.33%</p>
                     </c>
                     <c ca="center">
                        <p>0%</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Homo sapiens</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>4.55%</p>
                     </c>
                     <c ca="center">
                        <p>0%&#8211;2.28%</p>
                     </c>
                     <c ca="center">
                        <p>0%&#8211;2.22%</p>
                     </c>
                     <c ca="center">
                        <p>0%&#8211;2.26%</p>
                     </c>
                     <c ca="center">
                        <p>0%&#8211;4.66%</p>
                     </c>
                     <c ca="center">
                        <p>0%</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Total</p>
                     </c>
                     <c ca="center">
                        <p>20.09%</p>
                     </c>
                     <c ca="center">
                        <p>2.22%&#8211;7.83%</p>
                     </c>
                     <c ca="center">
                        <p>2.22%&#8211;7.74%</p>
                     </c>
                     <c ca="center">
                        <p>2.22%&#8211;6.79%</p>
                     </c>
                     <c ca="center">
                        <p>2.22%&#8211;13.93%</p>
                     </c>
                     <c ca="center">
                        <p>0%</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>*<b>KNN (<it>k </it>= 3)</b></p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Pan paniscus</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>27.27%</p>
                     </c>
                     <c ca="center">
                        <p>18.18%</p>
                     </c>
                     <c ca="center">
                        <p>18.18%</p>
                     </c>
                     <c ca="center">
                        <p>9.09%</p>
                     </c>
                     <c ca="center">
                        <p>9.09%</p>
                     </c>
                     <c ca="center">
                        <p>0%</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Gorilla gorilla</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>8.33%</p>
                     </c>
                     <c ca="center">
                        <p>16.67%</p>
                     </c>
                     <c ca="center">
                        <p>16.67%</p>
                     </c>
                     <c ca="center">
                        <p>16.67%</p>
                     </c>
                     <c ca="center">
                        <p>25%</p>
                     </c>
                     <c ca="center">
                        <p>0%</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Homo sapiens</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>4.55%</p>
                     </c>
                     <c ca="center">
                        <p>4.55%</p>
                     </c>
                     <c ca="center">
                        <p>4.55%</p>
                     </c>
                     <c ca="center">
                        <p>0%</p>
                     </c>
                     <c ca="center">
                        <p>0%&#8211;3.04%</p>
                     </c>
                     <c ca="center">
                        <p>0%</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Total</p>
                     </c>
                     <c ca="center">
                        <p>11.11%</p>
                     </c>
                     <c ca="center">
                        <p>11.11%</p>
                     </c>
                     <c ca="center">
                        <p>11.11%</p>
                     </c>
                     <c ca="center">
                        <p>8.56%</p>
                     </c>
                     <c ca="center">
                        <p>8.88%&#8211;13.79%</p>
                     </c>
                     <c ca="center">
                        <p>0%</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b>QDA</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Pan paniscus</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>72.73%</p>
                     </c>
                     <c ca="center">
                        <p>9.09%</p>
                     </c>
                     <c ca="center">
                        <p>9.09%</p>
                     </c>
                     <c ca="center">
                        <p>0%</p>
                     </c>
                     <c ca="center">
                        <p>0%</p>
                     </c>
                     <c ca="center">
                        <p>0%</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Gorilla gorilla</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>16.67%</p>
                     </c>
                     <c ca="center">
                        <p>8.33%</p>
                     </c>
                     <c ca="center">
                        <p>8.33%</p>
                     </c>
                     <c ca="center">
                        <p>0%</p>
                     </c>
                     <c ca="center">
                        <p>8.33%</p>
                     </c>
                     <c ca="center">
                        <p>0%</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Homo sapiens</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>4.55%</p>
                     </c>
                     <c ca="center">
                        <p>0%</p>
                     </c>
                     <c ca="center">
                        <p>0%</p>
                     </c>
                     <c ca="center">
                        <p>0%</p>
                     </c>
                     <c ca="center">
                        <p>0%</p>
                     </c>
                     <c ca="center">
                        <p>0%</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Total</p>
                     </c>
                     <c ca="center">
                        <p>24.44%</p>
                     </c>
                     <c ca="center">
                        <p>4.44%</p>
                     </c>
                     <c ca="center">
                        <p>4.44%</p>
                     </c>
                     <c ca="center">
                        <p>0%</p>
                     </c>
                     <c ca="center">
                        <p>2.22%</p>
                     </c>
                     <c ca="center">
                        <p>0%</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b>LDA</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Pan paniscus</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>18.18%</p>
                     </c>
                     <c ca="center">
                        <p>9.09%</p>
                     </c>
                     <c ca="center">
                        <p>9.09%</p>
                     </c>
                     <c ca="center">
                        <p>0%</p>
                     </c>
                     <c ca="center">
                        <p>0%</p>
                     </c>
                     <c ca="center">
                        <p>**9.09%</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Gorilla gorilla</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>25%</p>
                     </c>
                     <c ca="center">
                        <p>8.33%</p>
                     </c>
                     <c ca="center">
                        <p>8.33%</p>
                     </c>
                     <c ca="center">
                        <p>8.33%</p>
                     </c>
                     <c ca="center">
                        <p>41.67%</p>
                     </c>
                     <c ca="center">
                        <p>0%</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Homo sapiens</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0%</p>
                     </c>
                     <c ca="center">
                        <p>0%</p>
                     </c>
                     <c ca="center">
                        <p>0%</p>
                     </c>
                     <c ca="center">
                        <p>0%</p>
                     </c>
                     <c ca="center">
                        <p>9.09%</p>
                     </c>
                     <c ca="center">
                        <p>0%</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Total</p>
                     </c>
                     <c ca="center">
                        <p>11.11%</p>
                     </c>
                     <c ca="center">
                        <p>4.44%</p>
                     </c>
                     <c ca="center">
                        <p>4.44%</p>
                     </c>
                     <c ca="center">
                        <p>2.22%</p>
                     </c>
                     <c ca="center">
                        <p>15.56%</p>
                     </c>
                     <c ca="center">
                        <p>**2.22%</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>*The <it>k</it>-nearest neighbors (KNN) classifier was calculated over 1,000 independent trials for the mean error rates to be determined. Range of error values indicate that one or more trials provided a smaller error rate, though the average was computed.</p>
                  <p>&#8224; KNN <it>k </it>= 2: <it>&#949; </it>= 174,000; KNN <it>k </it>= 3: <it>&#949; </it>= 57,000; QDA: <it>&#949; </it>= 67,000; LDA: <it>&#949; </it>= 73,000</p>
                  <p><sup>&#8225; </sup>KNN <it>k </it>= 2: <it>&#949; </it>= 21,000; KNN <it>k </it>= 3: <it>&#949; </it>= 21,000; QDA: <it>&#949; </it>= 20,000; LDA: <it>&#949; </it>= 20,000</p>
                  <p>**The 9.09% error rate corresponds to a single <it>Pan paniscus </it>sample that is misclassified by LOO-CV using LDA as a classifier.</p>
               </tblfn>
            </tbl>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Conclusion</p>
         </st>
         <p>Within these examples, the spectral method from Lafon is demonstrated to extract more meaningful structure, relative to two linear and three nonlinear methods, for calculating low dimensional representations of high dimensional data types, such as image and microarray data, for purposes of determining ordered patterns or classification. As a nonlinear method it is shown to be a reasonable choice for biological and image data types, where it is important to preserve nonlinear relationships and local geometries in a low dimensional embedding. Though the nonlinear methods of kernel PCA and Ng's spectral method also may be well suited for these data types, they suffer the primary drawback of requiring dataset- and classifier-specific tuning parameter optimization, making the validity of cross-experiment comparisons problematic. Other nonlinear manifold methods, such as ISOMAP <abbrgrp><abbr bid="B22">22</abbr></abbrgrp> and Local Linear Embedding (LLE) <abbrgrp><abbr bid="B23">23</abbr></abbrgrp>, have similar optimization requirements as drawbacks, although tuning parameter optimization and classification accuracy for these two methods was not assessed here. This data fitting step can be not only time consuming, but also, as we have shown, varies according to which classification algorithm is used as well as which dataset is examined. In this work, the spectral method from Lafon is shown to outperform competing methods and exhibit independence to tuning parameter fitting across three separate classifiers and two unrelated high dimensional data types. Much like any method of DR, this method is not proposed to always elucidate the most meaningful structure across all high dimensional data types. Methods such as boosting and bagging <abbrgrp><abbr bid="B24">24</abbr></abbrgrp> and the relative distance plane (RDP) <abbrgrp><abbr bid="B25">25</abbr></abbrgrp> may be better suited for certain high dimensional datasets. Rather, the results presented here demonstrate success in two disparate datasets of high dimensionality and the authors' hope is that this presentation will encourage others to extend applications of this method in research within the computational biology community.</p>
      </sec>
      <sec>
         <st>
            <p>Methods</p>
         </st>
         <sec>
            <st>
               <p>Data types</p>
            </st>
            <p>The image data was obtained from the Computer Vision Laboratory at the University of Massachusetts at Amherst <abbrgrp><abbr bid="B26">26</abbr></abbrgrp>. A total of 54 slice-by-slice CATSCAN images were obtained for the human head, where each image has dimensions of 128 &#215; 128 pixels. The average for each image was calculated and the median of these averages was determined to be 11.82. Each image was then scaled to a target mean of this value.</p>
            <p>The microarray data are from genomic primary fibroblast cell lines <abbrgrp><abbr bid="B27">27</abbr><abbr bid="B28">28</abbr></abbrgrp> and were generated with Affymetrix oligonucleotide hgu95v2 arrays for 46 samples: 23 human (<it>Homo sapien</it>), 11 bonobo (<it>Pan paniscus</it>), and 12 gorilla (<it>Gorilla gorilla</it>) donors. This is a publicly available dataset within the 'fibroEset' package in R <abbrgrp><abbr bid="B29">29</abbr></abbrgrp>. It should be noted that two identical human donor arrays are in this dataset, so one was removed, reducing the dataset to 45 total samples.</p>
            <p>The data was provided in R already normalized by the Affymetrix GeneChip MAS 5.0 algorithm. "Normalization was done by calculating multiplicative scaling factors on the basis of the median intensity of the 60th to 95th percentile of gene-expression scores" and intensities were floored to 100 fluorescent units <abbrgrp><abbr bid="B29">29</abbr></abbrgrp>. No further filtering or scaling was conducted on this dataset, which consists of 12,625 expression points for each of 45 arrays.</p>
         </sec>
         <sec>
            <st>
               <p>Spectral methods</p>
            </st>
            <p>The spectral implementation of the weighted graph Laplacian from Lafon <abbrgrp><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr></abbrgrp> is calculated as follows: Given a set of points X = {<it>x</it><sub>1</sub>, <it>x</it><sub>2</sub>, ..., <it>x</it><sub><it>n</it></sub>} &#8712; &#8477;<sup><it>q</it></sup>, let <it>G </it>= (<it>E</it>, <it>V</it>) be a graph with edge weights or lines <it>E</it>, between pairs of vertices <it>V</it>. Consistent with standard terminology from graph theory, we can construct a graph, where each pair of vertices (<it>x</it><sub><it>i</it></sub>, <it>x</it><sub><it>j </it></sub>&#8712; <it>V</it>(<it>G</it>)) is assigned a weight specific to the distance between them <abbrgrp><abbr bid="B30">30</abbr></abbrgrp>. The matrix, <it>K</it><sub>1 </sub>is calculated from these edge weights by a Gaussian kernel estimate</p>
            <p><m:math name="1471-2105-7-74-i3" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msub><m:mi>K</m:mi><m:mn>1</m:mn></m:msub><m:mo>=</m:mo><m:msub><m:mi>K</m:mi><m:mn>1</m:mn></m:msub><m:mo stretchy="false">(</m:mo><m:mi>G</m:mi><m:mo stretchy="false">)</m:mo><m:mo>=</m:mo><m:mo stretchy="false">(</m:mo><m:msup><m:mi>e</m:mi><m:mrow><m:mo>&#8722;</m:mo><m:mo stretchy="false">(</m:mo><m:msup><m:mrow><m:mrow><m:mo>&#8214;</m:mo><m:mrow><m:msub><m:mi>x</m:mi><m:mi>i</m:mi></m:msub><m:mo>&#8722;</m:mo><m:msub><m:mi>x</m:mi><m:mi>j</m:mi></m:msub></m:mrow><m:mo>&#8214;</m:mo></m:mrow></m:mrow><m:mn>2</m:mn></m:msup><m:mo>/</m:mo><m:mi>&#949;</m:mi><m:mo stretchy="false">)</m:mo></m:mrow></m:msup><m:mo stretchy="false">)</m:mo></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacqWGlbWsdaWgaaWcbaGaeGymaedabeaakiabg2da9iabdUealnaaBaaaleaacqaIXaqmaeqaaOGaeiikaGIaem4raCKaeiykaKIaeyypa0JaeiikaGIaemyzau2aaWbaaSqabeaacqGHsislcqGGOaakdaqbdaqaaiabdIha4naaBaaameaacqWGPbqAaeqaaSGaeyOeI0IaemiEaG3aaSbaaWqaaiabdQgaQbqabaaaliaawMa7caGLkWoadaahaaadbeqaaiabikdaYaaaliabc+caVGGaciab=v7aLjabcMcaPaaakiabcMcaPaaa@49D3@</m:annotation></m:semantics></m:math> if <it>i </it>&#8800; <it>j</it>, where <it>K</it><sub>1<it>ii </it></sub>= 0 and <it>&#949; </it>= min<sub><it>i</it>&#8800;<it>j </it></sub>||<it>x</it><sub><it>i </it></sub>- <it>x</it><sub><it>j</it></sub>||<sup>2 </sup>> 0</p>
            <p>The epsilon term is chosen at the minimum squared Euclidean distance, as opposed to the average minimum distance (specified by Lafon) to induce maximum convergence of distance (see section on <b>Comparison of transformed distances </b>for an example). The vector, <it>v </it>is calculated from the square root of the product between matrix components from <it>K</it><sub>1 </sub>and vector components from <it>e</it>. The matrix <it>P </it>is then formed by the product of <it>v </it>and <it>v</it><sub><it>T</it></sub>. Then, the weighted graph Laplacian matrix, <it>K </it>is calculated by component division of the matrix <it>K</it><sub>1 </sub>elements</p>
            <p>(<it>K</it><sub>1<it>ij </it></sub>where <it>i </it>= 1, ..., <it>q </it>and <it>j </it>= 1, ..., <it>n</it>) by the matrix <it>P </it>elements</p>
            <p>(<it>P</it><sub><it>ij </it></sub>where <it>i </it>= 1, ..., <it>q </it>and <it>j </it>= 1, ..., <it>n</it>). The calculations are given as the following:</p>
            <p><m:math name="1471-2105-7-74-i4" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:mi>v</m:mi><m:mo>=</m:mo><m:msqrt><m:mrow><m:msub><m:mi>K</m:mi><m:mrow><m:mn>1</m:mn><m:mi>i</m:mi><m:mi>j</m:mi></m:mrow></m:msub><m:msub><m:mi>e</m:mi><m:mi>i</m:mi></m:msub></m:mrow></m:msqrt></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacqWG2bGDcqGH9aqpdaGcaaqaaiabdUealnaaBaaaleaacqaIXaqmcqWGPbqAcqWGQbGAaeqaaOGaemyzau2aaSbaaSqaaiabdMgaPbqabaaabeaaaaa@370E@</m:annotation></m:semantics></m:math> where <it>i </it>= 1, ..., <it>q</it>; <it>j </it>= 1, ..., <it>n</it>; and <it>e </it>= (1, 1, ..., 1)<sup><it>T</it></sup>, then</p>
            <p><it>P </it>= <it>vv</it><sup><it>T </it></sup>and</p>
            <p><it>K </it>= <it>K</it><sub>1<it>ij</it></sub>/<it>P</it><sub><it>ij </it></sub>where <it>i </it>= 1, ..., <it>q </it>and <it>j </it>= 1, ..., <it>n</it>.</p>
            <p>The <it>K </it>matrix is decomposed by singular value decomposition (<it>svd</it>)</p>
            <p><it>svd</it>(<it>K</it>) = <it>XHV</it><sup><it>T </it></sup>(Note that for this symmetric positive semidefinite matrix <it>K</it>, the <it>svd </it>is the spectral decomposition, however, to remain consistent with the nomenclature specified by Lafon, the calculations with <it>svd </it>are used.)</p>
            <p>and the <it>n </it>columns of the <it>X </it>matrix which define the left singular vectors of <it>K </it>are scaled by the first column of <it>X</it>, given by</p>
            <p><m:math name="1471-2105-7-74-i5" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msub><m:mi>&#934;</m:mi><m:mi>j</m:mi></m:msub><m:mo>=</m:mo><m:mfrac><m:mrow><m:mi>X</m:mi><m:mo stretchy="false">[</m:mo><m:mo>,</m:mo><m:mi>j</m:mi><m:mo stretchy="false">]</m:mo></m:mrow><m:mrow><m:mi>X</m:mi><m:mo stretchy="false">[</m:mo><m:mo>,</m:mo><m:mn>1</m:mn><m:mo stretchy="false">]</m:mo></m:mrow></m:mfrac></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacqqHMoGrdaWgaaWcbaGaemOAaOgabeaakiabg2da9maalaaabaGaemiwaGLaei4waSLaeiilaWIaemOAaOMaeiyxa0fabaGaemiwaGLaei4waSLaeiilaWIaeGymaeJaeiyxa0faaaaa@3C4E@</m:annotation></m:semantics></m:math> where <it>j </it>= 1, ..., <it>n</it>.</p>
            <p>This provides <it>n </it>- 1 characteristic roots of the matrix <it>K </it>given by &#934;<sub><it>j</it></sub>. It should be noted that the first column of <it>X </it>is scaled by itself, creating a vector of values equal to 1. By convention, this vector is designated by &#934;<sub>0</sub>. As such, the second column in the matrix <it>X </it>will be considered the primary &#934; vector, and designated by &#934;<sub>1</sub>. Utilizing these primary &#934; vectors, the data can be embedded as points in &#8477;<sup>&#934;</sup>.</p>
            <p>The spectral implementation of the weighted graph Laplacian from Ng <it>et al</it>. is similar up to the calculation of the <it>K</it><sub>1 </sub>matrix, however, the kernel is defined by a denominator term of 2<it>&#949;</it><sup>2 </sup>as opposed to simply <it>&#949; </it>in Lafon's method. In addition, the epsilon smoothing term (<it>&#949;</it>) (as will be demonstrated in the <b>Results </b>section) is not optimized at <it>&#949; </it>= min <sub><it>i</it>&#8800;<it>j </it></sub>||<it>x</it><sub><it>i </it></sub>- <it>x</it><sub><it>j</it></sub>||<sup>2 </sup>> 0, as it is in our modification to Lafon's method, and thus requires fitting for each example analyzed and classifier utilized. Following the calculation of the <it>K</it><sub>1 </sub>matrix, the matrix <it>D </it>is a diagonal matrix calculated from the row sums of <it>K</it><sub>1</sub>,</p>
            <p><m:math name="1471-2105-7-74-i6" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msub><m:mi>D</m:mi><m:mrow><m:mi>i</m:mi><m:mi>i</m:mi></m:mrow></m:msub><m:mo>=</m:mo><m:mstyle displaystyle="true"><m:munderover><m:mo>&#8721;</m:mo><m:mrow><m:mi>j</m:mi><m:mo>=</m:mo><m:mn>1</m:mn></m:mrow><m:mi>n</m:mi></m:munderover><m:mrow><m:msub><m:mi>K</m:mi><m:mrow><m:mn>1</m:mn><m:mi>i</m:mi><m:mi>j</m:mi></m:mrow></m:msub></m:mrow></m:mstyle></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacqWGebardaWgaaWcbaGaemyAaKMaemyAaKgabeaakiabg2da9maaqahabaGaem4saS0aaSbaaSqaaiabigdaXiabdMgaPjabdQgaQbqabaaabaGaemOAaOMaeyypa0JaeGymaedabaGaemOBa4ganiabggHiLdaaaa@3D91@</m:annotation></m:semantics></m:math> where <it>i </it>= 1, ..., <it>q</it>. Then the normalized Laplacian matrix, <it>L </it>is calculated as</p>
            <p><it>L </it>= <it>D</it><sup>-1/2</sup><it>K</it><sub>1</sub><it>D</it><sup>-1/2</sup></p>
            <p>The Laplacian matrix, <it>L </it>is decomposed by <it>svd</it>, and the <it>n </it>columns of the <it>X </it>matrix which define the left singular vectors of <it>L </it>have rows scaled to unit length into the matrix <it>Y</it></p>
            <p><it>svd</it>(<it>L</it>) = <it>XHV</it><sup><it>T </it></sup>(Note the point specified in the decomposition of the weighted graph Laplacian as provided by Lafon.)</p>
            <p><m:math name="1471-2105-7-74-i7" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msub><m:mi>Y</m:mi><m:mrow><m:mi>i</m:mi><m:mi>j</m:mi></m:mrow></m:msub><m:mo>=</m:mo><m:mfrac><m:mrow><m:msub><m:mi>X</m:mi><m:mrow><m:mi>i</m:mi><m:mi>j</m:mi></m:mrow></m:msub></m:mrow><m:mrow><m:msup><m:mrow><m:mo stretchy="false">(</m:mo><m:mstyle displaystyle="true"><m:munderover><m:mo>&#8721;</m:mo><m:mrow><m:mi>j</m:mi><m:mo>=</m:mo><m:mn>1</m:mn></m:mrow><m:mi>n</m:mi></m:munderover><m:mrow><m:msubsup><m:mi>X</m:mi><m:mrow><m:mi>i</m:mi><m:mi>j</m:mi></m:mrow><m:mn>2</m:mn></m:msubsup></m:mrow></m:mstyle><m:mo stretchy="false">)</m:mo></m:mrow><m:mrow><m:mn>1</m:mn><m:mo>/</m:mo><m:mn>2</m:mn></m:mrow></m:msup></m:mrow></m:mfrac></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacqWGzbqwdaWgaaWcbaGaemyAaKMaemOAaOgabeaakiabg2da9maalaaabaGaemiwaG1aaSbaaSqaaiabdMgaPjabdQgaQbqabaaakeaacqGGOaakdaaeWbqaaiabdIfaynaaDaaaleaacqWGPbqAcqWGQbGAaeaacqaIYaGmaaaabaGaemOAaOMaeyypa0JaeGymaedabaGaemOBa4ganiabggHiLdGccqGGPaqkdaahaaWcbeqaaiabigdaXiabc+caViabikdaYaaaaaaaaa@46C2@</m:annotation></m:semantics></m:math> where <it>i </it>= 1, ..., <it>q </it>and <it>j </it>=1, ..., <it>n</it>.</p>
            <p>Utilizing the primary &#934; columns of this <it>Y </it>matrix, the data can be embedded as points in &#8477;<sup>&#934;</sup>.</p>
         </sec>
         <sec>
            <st>
               <p>Additional DR (projection) methods</p>
            </st>
            <p>Two implementations of PCA were calculated for comparison: correlation and covariance. These calculations were conducted with the <it>prcomp </it>function in the 'stats' package of R <abbrgrp><abbr bid="B29">29</abbr></abbrgrp>. Since classical MDS is synonymous to PCA calculated on a Euclidean distance matrix, nonmetric MDS was performed instead (based on ranking of dissimilarities), to avoid redundant information and for an additional comparison with another nonlinear approach, using the <it>isoMDS </it><abbrgrp><abbr bid="B31">31</abbr></abbrgrp> function in the 'MASS' package of R <abbrgrp><abbr bid="B29">29</abbr></abbrgrp>. Kernel PCA was calculated with a Gaussian radial basis function kernel using the <it>k.pca </it>function in the 'kmethods' package of R <abbrgrp><abbr bid="B29">29</abbr></abbrgrp>. This kernel function was chosen to maintain consistency with the kernel used in both spectral methods. The weighted graphs Laplacian for the two spectral methods are anchored on some form of a Gaussian kernel (see <b>Spectral methods </b>section for the difference between the kernels).</p>
         </sec>
         <sec>
            <st>
               <p>Comparison of transformed distances</p>
            </st>
            <p>In order to better demonstrate how each of the three DR methods with a kernel function transforms Euclidean distances to its respective weighting values (e.g. Gaussian radial basis function, weighted graph Laplacian <it>L </it>matrix-Ng <it>et al</it>., weighted graph Laplacian <it>K </it>matrix-Lafon), an example was generated with simulated data. A data matrix of five observations was generated (each composed of 10 variables) and the three methods with a kernel function were calculated on the data matrix to compare between the final transformed weight matrix that is decomposed (i.e. <it>svd</it>) and the standard Euclidean distance matrix (Table <tblr tid="T3">3</tblr>). This evaluation illustrates how distances within each method are transformed in the final step prior to decomposition, and shows where on the distribution local and remote distances converge in this transformation. For example, in kernel PCA, a Gaussian radial basis function kernel is computed from the distance matrix and these entries are plotted against their respective Euclidean distances to represent the transformed space that the eigenfunctions are calculated on, in order to provide a low dimensional embedding. For both weighted graphs Laplacian from Ng <it>et al</it>. and Lafon, the weight values in the <it>L </it>and <it>K </it>matrix, respectively, are compared against the Euclidean distance matrix.</p>
            <tbl id="T3">
               <title>
                  <p>Table 3</p>
               </title>
               <caption>
                  <p>Euclidean distance matrix for simulated example</p>
               </caption>
               <tblbdy cols="6">
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>
                           <b>1</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>2</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>3</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>4</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>5</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="6">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>
                           <b>1</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0.0</p>
                     </c>
                     <c ca="center">
                        <p>5.4</p>
                     </c>
                     <c ca="center">
                        <p>26.8</p>
                     </c>
                     <c ca="center">
                        <p>31.4</p>
                     </c>
                     <c ca="center">
                        <p>56.4</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>
                           <b>2</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>5.4</p>
                     </c>
                     <c ca="center">
                        <p>0.0</p>
                     </c>
                     <c ca="center">
                        <p>30.2</p>
                     </c>
                     <c ca="center">
                        <p>35.6</p>
                     </c>
                     <c ca="center">
                        <p>59.5</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>
                           <b>3</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>26.8</p>
                     </c>
                     <c ca="center">
                        <p>30.2</p>
                     </c>
                     <c ca="center">
                        <p>0.0</p>
                     </c>
                     <c ca="center">
                        <p>12.2</p>
                     </c>
                     <c ca="center">
                        <p>39.1</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>
                           <b>4</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>31.4</p>
                     </c>
                     <c ca="center">
                        <p>35.6</p>
                     </c>
                     <c ca="center">
                        <p>12.2</p>
                     </c>
                     <c ca="center">
                        <p>0.0</p>
                     </c>
                     <c ca="center">
                        <p>40.0</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>
                           <b>5</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>56.4</p>
                     </c>
                     <c ca="center">
                        <p>59.5</p>
                     </c>
                     <c ca="center">
                        <p>39.1</p>
                     </c>
                     <c ca="center">
                        <p>40.0</p>
                     </c>
                     <c ca="center">
                        <p>0.0</p>
                     </c>
                  </r>
               </tblbdy>
            </tbl>
            <p>The Euclidean matrix in Table <tblr tid="T3">3</tblr> contains 10 unique distance values. Two values are less than the 25% of the distribution, three values are greater than the 75% of the distribution, and the remaining values are within the interquartile range. This range of distances is utilized to convey both the subtle and apparent differences within the transformed space between methods.</p>
            <p>The results are shown in Figure <figr fid="F6">6</figr>. For each plot, the points represent the relationship between the transformed space (y-axis) and Euclidean distance (x-axis). The trend lines in each plot are determined by ordering both the x and y axis vectors, so an ordered transformation from Euclidean distance values to the weighted values would be depicted with a line that passes through each point. Those plots that do not exhibit this line pattern do not maintain the identical ordering from distances to weights. The latter statement does not imply that there is either a disadvantage or an advantage to the method. It simply provides a means of comparison for those methods that distort the ordering of some distances when transformed into a weight value. The black line in each plot is calculated based on assigning the epsilon smoothing term to the minimum non-zero squared Euclidean distance (<it>&#949; </it>= min<sub><it>i</it>&#8800;<it>j </it></sub>||<it>x</it><sub><it>i </it></sub>- <it>x</it><sub><it>j</it></sub>||<sup>2 </sup>> 0). Each subsequent line that is shaded by a portion of the rainbow spectrum (ROYGB) is then calculated by increasing this epsilon term by 1% (determined from the distribution of the squared Euclidean distances) increments up to the 50% of the distribution of squared Euclidean distances. Each line color is assigned to bins of 10% incremental values from 1% to 50% (e.g. weight value lines calculated with epsilon = 1%&#8211;10% values from the distribution of squared Euclidean distances are shaded red, epsilon = 11%&#8211;20% values from the distribution of squared Euclidean distances are shaded orange, etc.).</p>
            <fig id="F6">
               <title>
                  <p>Figure 6</p>
               </title>
               <caption>
                  <p>Weighted values vs. Euclidean distance</p>
               </caption>
               <text>
                  <p><b>Weighted values vs. Euclidean distance</b>. For each plot, the black trend line is drawn based on ordering the values for each vector (weighted values and Euclidean distance values); this gives a general fitting of each curve. The portion of the rainbow spectrum (ROYGB) shading for the lines are drawn by the same criterion, however, each line is calculated based on a set of increasing epsilon smoothing values. For example, epsilon values are increased from the 1% quantile to the 50% quantile of the squared Euclidean distance distribution, where each set of 10% values are plotted with a separate color in the rainbow spectrum, staring at red and ending at blue. This method of line shading illustrates how the transformed Euclidean distances are adjusted across a dynamic range of epsilon values for each method. <b>(a) </b>Gaussian radial basis function kernel values vs. Euclidean distances. <b>(b) </b>Entries in the matrix of the weighted graph Laplacian, <it>L </it>from Ng <it>et al</it>., vs. Euclidean distance. <b>(c) </b>Entries in the matrix of the weighted graph Laplacian, <it>K </it>from Lafon vs. Euclidean distance.</p>
               </text>
               <graphic file="1471-2105-7-74-6"/>
            </fig>
            <p>From the kernel PCA plot in Figure <figr fid="F6">6a</figr>, a Gaussian radial basis function demonstrates convergence of local Euclidean distances in the transformation to weight values, while remote distances exhibit more linearity with these weight values. In addition, as the epsilon smoothing term is increased at very small values, local distances converge to similar weight values much faster than remote distances. This example illustrates how local distance structure is better preserved at levels specific to the epsilon smoothing term utilized, than remote distances. Small distances are collapsed to a similar weighted value, while large distances maintain relatively stronger linearity with Euclidean distances, though this is reduced as the epsilon smoothing term is increased to values greater than the 10% of the distribution of squared Euclidean distances (denoted by the change in line shading from red to orange). The two smallest Euclidean distances converge to similar values rather quickly as the epsilon smoothing term is increased, while the points in the distance distribution converge more slowly. At epsilon values greater than the 20% (line shading of yellow, green, and blue) of the distribution of squared Euclidean distances, the weight values all converge to the same value of 1.</p>
            <p>The Laplacian matrix, <it>L </it>from Ng <it>et al</it>., in Figure <figr fid="F6">6b</figr> shows that at minimal values of the epsilon smoothing term, there is near linearity between Euclidean distances and values in the matrix <it>L</it>. However, as the smoothing epsilon term is increased, the convergence occurs with both local and remote distances, while the points in the middle of this distribution maintain similar values (represented by the approximate slope of 0 from the values within the interquartile range of the plot). Both tails of the trend line quickly approach a weight value of ~0.25 with epsilon values at less than 20% of the distribution of squared Euclidean distances (denoted by line shading of only red and orange). This example illustrates that values within the middle region of the distribution (as compared the right and left tails of the distribution) are transformed to similar weight values in matrix <it>L</it>, with rather small values of the smoothing epsilon term, and then as this epsilon value is increased to only slightly larger values, all points in the function converge to a similar weight value. In addition, the Laplacian matrix, <it>L </it>does not preserve the same ordering of Euclidean distances.</p>
            <p>The Laplacian matrix, <it>K </it>from Lafon (Figure <figr fid="F6">6c</figr>), in contrast to the matrix <it>L</it>, demonstrates convergence of weight values on both extremes of the Euclidean distance vector at very small values of the epsilon smoothing term, as illustrated by the small variance in red lines (i.e. epsilon vales at less than the 10% of the distribution of squared Euclidean distance). Additionally, opposite to the matrix <it>L</it>, these weight values become more linear (less converged) on both right and left tails of the distribution as the epsilon smoothing term is <it>increased </it>to values greater than the 20% of the distribution of squared Euclidean distances (line shading corresponding to colors of yellow, green, and blue). It is interesting to note that at maximal epsilon values (indicated by yellow, green, and blue lines in Figure <figr fid="F6">6c</figr>), the function generated by the weight values in the <it>K </it>matrix resemble the function generated by the weight values in the <it>L </it>matrix at the minimum epsilon value (indicated by the black line in Figure <figr fid="F6">6b</figr>). However, similar to the <it>L </it>matrix values, the ordering of Euclidean distances is not preserved. This example illustrates how the Lafon method differs from the other two, in that maximal convergence of both local and remote distances is optimized at minimal epsilon values (i.e. values at less than 10% of the distribution of squared Euclidean distance for this example). Based on the partitioned structure in the data, this reduces the dynamic range of Euclidean distances, particularly along the middle of the distribution, and transforms the distance structure to weight values (in matrix <it>K</it>) at both extremes of the distribution. Such a transformation, that acts to create a large gap between local and remote distances (at a partitioning threshold driven by the data) is shown to provide an optimal distance transformation for subsequent spectral decomposition for purposes of elucidating meaningful structure in image seriation and microarray species classification examples presented. Since the convergence of local and remote distances in the <it>K </it>matrix is highly dependent upon the primary partitioning point (for this example, between the Euclidean distances of 12.2 and 26.8), it is assumed that the most meaningful structures are defined by the difference between the smallest two Euclidean distances and the remaining eight Euclidean distances.</p>
         </sec>
         <sec>
            <st>
               <p>Classifiers</p>
            </st>
            <p>The <it>k</it>-nearest neighbor (KNN) algorithm (for <it>k </it>= 2 and 3) was calculated for 1,000 independent trials for each method (to address the problem of random assignment when ties occur in nearest neighbor voting) and the mean error was computed using the <it>knn </it>function in the 'class' package of R <abbrgrp><abbr bid="B29">29</abbr></abbrgrp>. The discriminant analysis classifiers both for separate variances (QDA) and pooled variances (LDA) were computed using the <it>qda </it>and <it>lda </it>functions, respectively, in the 'MASS' package of R <abbrgrp><abbr bid="B29">29</abbr></abbrgrp>. All classification models were trained and tested with leave-one-out cross-validation (LOO-CV).</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Authors' contributions</p>
         </st>
         <p>BWH conducted the data analysis, created the visuals, and was involved in drafting the manuscript. Both JW and JLS were involved in the revisions, edits, and critically assessing the manuscript for technical and general content. All authors read and approved the final manuscript.</p>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>We are thankful to Alan Berger, David Johannsen, David Marchette, the reviewers of this manuscript, and the BMC Editorial Staff for their useful criticism.</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>A new approach to feature selection based on the Karhunen-Loeve expansion</p>
            </title>
            <aug>
               <au>
                  <snm>Kittler</snm>
                  <fnm>JV</fnm>
               </au>
               <au>
                  <snm>Young</snm>
                  <fnm>PC</fnm>
               </au>
            </aug>
            <source>Pattern Recognition</source>
            <pubdate>1973</pubdate>
            <volume>5</volume>
            <fpage>335</fpage>
            <lpage>352</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1016/0031-3203(73)90025-3</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <aug>
               <au>
                  <snm>Cox</snm>
                  <fnm>TF</fnm>
               </au>
               <au>
                  <snm>Cox</snm>
                  <fnm>MAA</fnm>
               </au>
            </aug>
            <source>Multidimensional Scaling</source>
            <publisher>London: Chapman and Hall</publisher>
            <edition>Second</edition>
            <pubdate>1994</pubdate>
         </bibl>
         <bibl id="B3">
            <title>
               <p>Diffusion Maps and Geometric Harmonics</p>
            </title>
            <aug>
               <au>
                  <snm>Lafon</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>PhD thesis</source>
            <publisher>Yale University, Mathematics Department</publisher>
            <pubdate>2004</pubdate>
         </bibl>
         <bibl id="B4">
            <title>
               <p>Geometric diffusions as a tool for harmonic analysis and structure definition of data: Diffusion Maps</p>
            </title>
            <aug>
               <au>
                  <snm>Coifman</snm>
                  <fnm>RR</fnm>
               </au>
               <au>
                  <snm>Lafon</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Lee</snm>
                  <fnm>AB</fnm>
               </au>
               <au>
                  <snm>Maggioni</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Nadler</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Warner</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Zucker</snm>
                  <fnm>SW</fnm>
               </au>
            </aug>
            <source>PNAS</source>
            <pubdate>2005</pubdate>
            <volume>102</volume>
            <issue>21</issue>
            <fpage>7426</fpage>
            <lpage>7431</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1140422</pubid>
                  <pubid idtype="pmpid" link="fulltext">15899970</pubid>
                  <pubid idtype="doi">10.1073/pnas.0500334102</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>Deriving Meaningful Structure from Spectral Embedding</p>
            </title>
            <aug>
               <au>
                  <snm>Higgs</snm>
                  <fnm>B</fnm>
               </au>
            </aug>
            <source>PhD thesis</source>
            <publisher>George Mason University, School of Computational Sciences</publisher>
            <pubdate>2005</pubdate>
         </bibl>
         <bibl id="B6">
            <title>
               <p>Geometric Method for Feature Extraction and Dimensional Reduction: A Guided Tour</p>
            </title>
            <aug>
               <au>
                  <snm>Burges</snm>
                  <fnm>CJC</fnm>
               </au>
            </aug>
            <source>Microsoft Research Technical Report No. MSR-TR-2004-55</source>
            <pubdate>2004</pubdate>
         </bibl>
         <bibl id="B7">
            <aug>
               <au>
                  <snm>Chung</snm>
                  <fnm>FRK</fnm>
               </au>
            </aug>
            <source>Spectral Graph Theory (CBMS Regional Conference Series in Mathematics, No. 92)</source>
            <publisher>Providence: American Mathematical Society</publisher>
            <pubdate>1997</pubdate>
         </bibl>
         <bibl id="B8">
            <title>
               <p>Semi-Supervised Learning on Riemannian Manifolds</p>
            </title>
            <aug>
               <au>
                  <snm>Belkin</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Niyogi</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Machine Learning</source>
            <pubdate>2004</pubdate>
            <volume>56</volume>
            <fpage>209</fpage>
            <lpage>239</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1023/B:MACH.0000033120.25363.1e</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p>Laplacian Eigenmaps for dimensionality reduction and data representation</p>
            </title>
            <aug>
               <au>
                  <snm>Belkin</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Niyogi</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Neural Computation</source>
            <pubdate>2003</pubdate>
            <volume>15</volume>
            <issue>6</issue>
            <fpage>1373</fpage>
            <lpage>1396</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1162/089976603321780317</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B10">
            <title>
               <p>Geometric diffusions as a tool for harmonic analysis and structure definition of data: Multiscale Methods</p>
            </title>
            <aug>
               <au>
                  <snm>Coifman</snm>
                  <fnm>RR</fnm>
               </au>
               <au>
                  <snm>Lafon</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Lee</snm>
                  <fnm>AB</fnm>
               </au>
               <au>
                  <snm>Maggioni</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Nadler</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Warner</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Zucker</snm>
                  <fnm>SW</fnm>
               </au>
            </aug>
            <source>PNAS</source>
            <pubdate>2005</pubdate>
            <volume>102</volume>
            <issue>21</issue>
            <fpage>7432</fpage>
            <lpage>7437</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1140426</pubid>
                  <pubid idtype="pmpid" link="fulltext">15899969</pubid>
                  <pubid idtype="doi">10.1073/pnas.0500896102</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>On Spectral Clustering: Analysis and an Algorithm</p>
            </title>
            <aug>
               <au>
                  <snm>Ng</snm>
                  <fnm>AY</fnm>
               </au>
               <au>
                  <snm>Jordan</snm>
                  <fnm>MI</fnm>
               </au>
               <au>
                  <snm>Weiss</snm>
                  <fnm>Y</fnm>
               </au>
            </aug>
            <source>NIPS</source>
            <pubdate>2001</pubdate>
            <fpage>14</fpage>
         </bibl>
         <bibl id="B12">
            <title>
               <p>A kernel view of the dimensionality reduction of manifolds</p>
            </title>
            <aug>
               <au>
                  <snm>Ham</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Lee</snm>
                  <fnm>DD</fnm>
               </au>
               <au>
                  <snm>Mika</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Scholkopf</snm>
                  <fnm>B</fnm>
               </au>
            </aug>
            <source>Max Planck Technical Report No. TR-110</source>
            <pubdate>2003</pubdate>
         </bibl>
         <bibl id="B13">
            <title>
               <p>Segmentation using eigenvectors: A unifying view</p>
            </title>
            <aug>
               <au>
                  <snm>Weiss</snm>
                  <fnm>Y</fnm>
               </au>
            </aug>
            <source>IEEE International Conference on Computer Vision</source>
            <pubdate>1999</pubdate>
            <fpage>975</fpage>
            <lpage>982</lpage>
         </bibl>
         <bibl id="B14">
            <title>
               <p>Spectral Kernel Methods for Clustering</p>
            </title>
            <aug>
               <au>
                  <snm>Cristianini</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Shawe-Taylor</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Kandola</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>NIPS</source>
            <pubdate>2002</pubdate>
            <fpage>14</fpage>
         </bibl>
         <bibl id="B15">
            <title>
               <p>Kernel PCA and de-noising in feature spaces</p>
            </title>
            <aug>
               <au>
                  <snm>Mika</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Scholkopf</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Smola</snm>
                  <fnm>AJ</fnm>
               </au>
               <au>
                  <snm>Muller</snm>
                  <fnm>KR</fnm>
               </au>
               <au>
                  <snm>Scholz</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Ratsch</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>NIPS</source>
            <pubdate>1999</pubdate>
            <fpage>11</fpage>
         </bibl>
         <bibl id="B16">
            <title>
               <p>Normalized cuts and image segmentation</p>
            </title>
            <aug>
               <au>
                  <snm>Shi</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Malik</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Proc IEEE Transactions on Pattern Analysis and Machine Intelligence</source>
            <pubdate>2000</pubdate>
            <volume>22</volume>
            <issue>8</issue>
            <fpage>731</fpage>
            <lpage>737</lpage>
         </bibl>
         <bibl id="B17">
            <title>
               <p>A factorization approach to grouping</p>
            </title>
            <aug>
               <au>
                  <snm>Perona</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Freeman</snm>
                  <fnm>WT</fnm>
               </au>
            </aug>
            <source>Proc 5th ECCV</source>
            <editor>Burkardt H, Neumann B</editor>
            <pubdate>1998</pubdate>
            <fpage>655</fpage>
            <lpage>670</lpage>
         </bibl>
         <bibl id="B18">
            <title>
               <p>Cluster analysis and display of genome-wide expression patterns</p>
            </title>
            <aug>
               <au>
                  <snm>Eisen</snm>
                  <fnm>MB</fnm>
               </au>
               <au>
                  <snm>Spellman</snm>
                  <fnm>PT</fnm>
               </au>
               <au>
                  <snm>Brown</snm>
                  <fnm>PO</fnm>
               </au>
               <au>
                  <snm>Botstein</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>PNAS</source>
            <pubdate>1998</pubdate>
            <volume>95</volume>
            <fpage>14863</fpage>
            <lpage>14868</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">24541</pubid>
                  <pubid idtype="pmpid" link="fulltext">9843981</pubid>
                  <pubid idtype="doi">10.1073/pnas.95.25.14863</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B19">
            <title>
               <p>A genome-wide transcriptional analysis of the mitotic cell cycle</p>
            </title>
            <aug>
               <au>
                  <snm>Cho</snm>
                  <fnm>RJ</fnm>
               </au>
               <au>
                  <snm>Campbell</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Winzeler</snm>
                  <fnm>EA</fnm>
               </au>
               <au>
                  <snm>Steinmetz</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Conway</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Wodicka</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Wolfsberg</snm>
                  <fnm>TG</fnm>
               </au>
               <au>
                  <snm>Gabrielian</snm>
                  <fnm>AE</fnm>
               </au>
               <au>
                  <snm>Landsman</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Lockhart</snm>
                  <fnm>DJ</fnm>
               </au>
               <au>
                  <snm>Davis</snm>
                  <fnm>RW</fnm>
               </au>
            </aug>
            <source>Mol Cell</source>
            <pubdate>1998</pubdate>
            <volume>2</volume>
            <fpage>65</fpage>
            <lpage>73</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S1097-2765(00)80114-8</pubid>
                  <pubid idtype="pmpid" link="fulltext">9702192</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B20">
            <title>
               <p>Comprehensive Identification of Cell Cycle-regulated Genes of the Yeast <it>Saccharomyces cerevisiae </it>by Microarray Hybridization</p>
            </title>
            <aug>
               <au>
                  <snm>Spellman</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Sherlock</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>MQ</fnm>
               </au>
               <au>
                  <snm>Iyer</snm>
                  <fnm>VR</fnm>
               </au>
               <au>
                  <snm>Anders</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Eisen</snm>
                  <fnm>MB</fnm>
               </au>
               <au>
                  <snm>Brown</snm>
                  <fnm>PO</fnm>
               </au>
               <au>
                  <snm>Botstein</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Futcher</snm>
                  <fnm>B</fnm>
               </au>
            </aug>
            <source>Molecular Biology of the Cell</source>
            <pubdate>1998</pubdate>
            <volume>9</volume>
            <issue>12</issue>
            <fpage>3273</fpage>
            <lpage>3297</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">25624</pubid>
                  <pubid idtype="pmpid" link="fulltext">9843569</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B21">
            <title>
               <p>Approximate geodesic distances reveal biologically relevant structures in microarray data</p>
            </title>
            <aug>
               <au>
                  <snm>Nilsson</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Fioretos</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Hoglund</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Fontes</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2004</pubdate>
            <volume>20</volume>
            <fpage>874</fpage>
            <lpage>880</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/btg496</pubid>
                  <pubid idtype="pmpid" link="fulltext">14752004</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B22">
            <title>
               <p>A Global Geometric Framework for Nonlinear Dimensionality Reduction</p>
            </title>
            <aug>
               <au>
                  <snm>Tenenbaum</snm>
                  <fnm>JB</fnm>
               </au>
               <au>
                  <snm>Silva</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Langford</snm>
                  <fnm>JC</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2000</pubdate>
            <volume>290</volume>
            <fpage>2319</fpage>
            <lpage>2322</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.290.5500.2319</pubid>
                  <pubid idtype="pmpid" link="fulltext">11125149</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B23">
            <title>
               <p>Nonlinear Dimensionality Reduction by Local Linear Embedding</p>
            </title>
            <aug>
               <au>
                  <snm>Roweis</snm>
                  <fnm>ST</fnm>
               </au>
               <au>
                  <snm>Saul</snm>
                  <fnm>LK</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2000</pubdate>
            <volume>290</volume>
            <fpage>2323</fpage>
            <lpage>2326</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.290.5500.2323</pubid>
                  <pubid idtype="pmpid" link="fulltext">11125150</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B24">
            <title>
               <p>An empirical comparison of voting classification algorithms: Bagging, boosting, and variants</p>
            </title>
            <aug>
               <au>
                  <snm>Bauer</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Kohavi</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Machine Learning</source>
            <pubdate>1999</pubdate>
            <volume>36</volume>
            <issue>1&#8211;2</issue>
            <fpage>105</fpage>
            <lpage>139</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1023/A:1007515423169</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B25">
            <title>
               <p>Mapping high dimensional data onto a relative distance plane-an exact method for visualizing and characterizing high-dimensional patterns</p>
            </title>
            <aug>
               <au>
                  <snm>Somorjai</snm>
                  <fnm>RL</fnm>
               </au>
               <au>
                  <snm>Dolenko</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Demko</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Mandelzweig</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Nikulin</snm>
                  <fnm>AE</fnm>
               </au>
               <au>
                  <snm>Baumgartner</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Pizzi</snm>
                  <fnm>NJ</fnm>
               </au>
            </aug>
            <source>Journal of Biomedical Informatics</source>
            <pubdate>2004</pubdate>
            <volume>37</volume>
            <fpage>366</fpage>
            <lpage>379</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.jbi.2004.07.005</pubid>
                  <pubid idtype="pmpid" link="fulltext">15488750</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B26">
            <publisher>University of Massachusetts at Amherst CATSCAN images</publisher>
            <url>http://vis-www.cs.umass.edu/files.html</url>
         </bibl>
         <bibl id="B27">
            <title>
               <p>Comparative Analysis of Gene-Expression Patterns in Human and African Great Ape Cultured Fibroblasts</p>
            </title>
            <aug>
               <au>
                  <snm>Karaman</snm>
                  <fnm>MW</fnm>
               </au>
               <au>
                  <snm>Houck</snm>
                  <fnm>ML</fnm>
               </au>
               <au>
                  <snm>Chemnick</snm>
                  <fnm>LG</fnm>
               </au>
               <au>
                  <snm>Nagpal</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Chawannakul</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Sudano</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Pike</snm>
                  <fnm>BL</fnm>
               </au>
               <au>
                  <snm>Ho</snm>
                  <fnm>VV</fnm>
               </au>
               <au>
                  <snm>Ryder</snm>
                  <fnm>OA</fnm>
               </au>
               <au>
                  <snm>Hacia</snm>
                  <fnm>JG</fnm>
               </au>
            </aug>
            <source>Genome Research</source>
            <pubdate>2003</pubdate>
            <volume>13</volume>
            <fpage>1619</fpage>
            <lpage>1630</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">403735</pubid>
                  <pubid idtype="pmpid" link="fulltext">12840040</pubid>
                  <pubid idtype="doi">10.1101/gr.1289803</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B28">
            <title>
               <p>Evolutionary sequence comparisons using high-density oligonucleotide arrays</p>
            </title>
            <aug>
               <au>
                  <snm>Hacia</snm>
                  <fnm>JG</fnm>
               </au>
               <au>
                  <snm>Makalowski</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Edgemon</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Erdos</snm>
                  <fnm>MR</fnm>
               </au>
               <au>
                  <snm>Robbins</snm>
                  <fnm>CM</fnm>
               </au>
               <au>
                  <snm>Fodor</snm>
                  <fnm>SP</fnm>
               </au>
               <au>
                  <snm>Brody</snm>
                  <fnm>LC</fnm>
               </au>
               <au>
                  <snm>Collins</snm>
                  <fnm>FS</fnm>
               </au>
            </aug>
            <source>Nature Genetics</source>
            <pubdate>1998</pubdate>
            <volume>18</volume>
            <fpage>155</fpage>
            <lpage>158</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/ng0298-155</pubid>
                  <pubid idtype="pmpid" link="fulltext">9462745</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B29">
            <aug>
               <au>
                  <cnm>R Development Core Team</cnm>
               </au>
            </aug>
            <source>R: A Language and Environment for Statistical Computing</source>
            <publisher>R Foundation for Statistical Computing, Vienna, Austria</publisher>
            <pubdate>2004</pubdate>
         </bibl>
         <bibl id="B30">
            <title>
               <p>The Laplacian spectrum of graphs</p>
            </title>
            <aug>
               <au>
                  <snm>Mohar</snm>
                  <fnm>B</fnm>
               </au>
            </aug>
            <source>Graph Theory, Combinatorics, and Applications</source>
            <publisher>Wiley</publisher>
            <editor>Alavi Y, Schwenk A</editor>
            <pubdate>1991</pubdate>
            <volume>2</volume>
            <fpage>871</fpage>
            <lpage>898</lpage>
         </bibl>
         <bibl id="B31">
            <title>
               <p>Nonmetric multidimensional scaling: a numerical method</p>
            </title>
            <aug>
               <au>
                  <snm>Kruskal</snm>
                  <fnm>JB</fnm>
               </au>
            </aug>
            <source>Psychometrika</source>
            <pubdate>1964</pubdate>
            <volume>29</volume>
            <fpage>115</fpage>
            <lpage>129</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1007/BF02289694</pubid>
            </xrefbib>
         </bibl>
      </refgrp>
   </bm>
</art>
