<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1742-4682-3-19</ui>
   <ji>1742-4682</ji>
   <fm>
      <dochead>Research</dochead>
      <bibl>
         <title>
            <p>A statistical method for predicting splice variants between two groups of samples using GeneChip<sup>&#174; </sup>expression array data</p>
         </title>
         <aug>
            <au id="A1">
               <snm>Fan</snm>
               <fnm>Wenhong</fnm>
               <insr iid="I1"/>
               <email>wfan@fhcrc.org</email>
            </au>
            <au id="A2">
               <snm>Khalid</snm>
               <fnm>Najma</fnm>
               <insr iid="I1"/>
               <email>nkhalid@fhcrc.org</email>
            </au>
            <au id="A3">
               <snm>Hallahan</snm>
               <mi>R</mi>
               <fnm>Andrew</fnm>
               <insr iid="I2"/>
               <insr iid="I3"/>
               <email>a.hallaham@uq.edu.au</email>
            </au>
            <au id="A4">
               <snm>Olson</snm>
               <mi>M</mi>
               <fnm>James</fnm>
               <insr iid="I2"/>
               <email>jolson@fhcrc.org</email>
            </au>
            <au id="A5" ca="yes">
               <snm>Zhao</snm>
               <mnm>Ping</mnm>
               <fnm>Lue</fnm>
               <insr iid="I1"/>
               <email>lzhao@fhcrc.org</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Division of Public Health Sciences, Fred Hutchinson Cancer Research Center, 1100 Fairview Ave. N., Seattle, WA 98109, USA</p>
            </ins>
            <ins id="I2">
               <p>Clinical Research Division, Fred Hutchinson Cancer Research Center, 1100 Fairview Ave. N., Seattle, WA 98109, USA</p>
            </ins>
            <ins id="I3">
               <p>Department of Paediatrics and Child Health, University of Queensland, QLD, 4029, Australia</p>
            </ins>
         </insg>
         <source>Theoretical Biology and Medical Modelling</source>
         <issn>1742-4682</issn>
         <pubdate>2006</pubdate>
         <volume>3</volume>
         <issue>1</issue>
         <fpage>19</fpage>
         <url>http://www.tbiomed.com/content/3/1/19</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">16603076</pubid>
               <pubid idtype="doi">10.1186/1742-4682-3-19</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>23</day>
               <month>1</month>
               <year>2006</year>
            </date>
         </rec>
         <acc>
            <date>
               <day>07</day>
               <month>4</month>
               <year>2006</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>07</day>
               <month>4</month>
               <year>2006</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2006</year>
         <collab>Fan et al; licensee BioMed Central Ltd.</collab>
         <note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>Alternative splicing of pre-messenger RNA results in RNA variants with combinations of selected exons. It is one of the essential biological functions and regulatory components in higher eukaryotic cells. Some of these variants are detectable with the Affymetrix GeneChip<sup>&#174; </sup>that uses multiple oligonucleotide probes (i.e. probe set), since the target sequences for the multiple probes are adjacent within each gene. Hybridization intensity from a probe correlates with abundance of the corresponding transcript. Although the multiple-probe feature in the current GeneChip<sup>&#174; </sup>was designed to assess expression values of individual genes, it also measures transcriptional abundance for a sub-region of a gene sequence. This additional capacity motivated us to develop a method to predict alternative splicing, taking advance of extensive repositories of GeneChip<sup>&#174; </sup>gene expression array data.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>We developed a two-step approach to predict alternative splicing from GeneChip<sup>&#174; </sup>data. First, we clustered the probes from a probe set into pseudo-exons based on similarity of probe intensities and physical adjacency. A pseudo-exon is defined as a sequence in the gene within which multiple probes have comparable probe intensity values. Second, for each pseudo-exon, we assessed the statistical significance of the difference in probe intensity between two groups of samples. Differentially expressed pseudo-exons are predicted to be alternatively spliced. We applied our method to empirical data generated from GeneChip<sup>&#174; </sup>Hu6800 arrays, which include 7129 probe sets and twenty probes per probe set. The dataset consists of sixty-nine medulloblastoma (27 metastatic and 42 non-metastatic) samples and four cerebellum samples as normal controls. We predicted that 577 genes would be alternatively spliced when we compared normal cerebellum samples to medulloblastomas, and predicted that thirteen genes would be alternatively spliced when we compared metastatic medulloblastomas to non-metastatic ones. We checked the consistency of some of our findings with information in UCSC Human Genome Browser.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusion</p>
               </st>
               <p>The two-step approach described in this paper is capable of predicting some alternative splicing from multiple oligonucleotide-based gene expression array data with GeneChip<sup>&#174; </sup>technology. Our method employs the extensive repositories of gene expression array data available and generates alternative splicing hypotheses, which can be further validated by experimental studies.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>Alternative splicing of pre-messenger RNA is an essential biological functional and regulatory component in higher eukaryotic cells. It increases the complexity of biological processes and gives the cells enhanced capability to respond to various factors, such as developmental changes and environmental stimuli. Some splice variants have been associated with diseases, such as mammary tumorigenesis <abbrgrp><abbr bid="B1">1</abbr></abbrgrp> and ovarian cancer <abbrgrp><abbr bid="B2">2</abbr></abbrgrp>. About 15% of single nucleotide mutations in the exon recognition process are associated with human genetic diseases <abbrgrp><abbr bid="B3">3</abbr></abbrgrp>. Understanding the alternative splicing mechanism may also lead to finding potential treatments for related diseases <abbrgrp><abbr bid="B4">4</abbr></abbrgrp>.</p>
         <p>In this paper, we describe a method for detecting alternative splicing variants using the GeneChip<sup>&#174; </sup>gene expression array data. Affymetrix GeneChip<sup>&#174; </sup>technology employs multiple probes per gene to measure gene expression. These multiple probes are short sequences located in different positions within each gene. Even though distributions of these probe sequences are not optimized for detecting alternative splicing, the probe sequence data obtained by the current GeneChip<sup>&#174; </sup>technology can be used to assess alternative splicing. In our method, we infer "pseudo-exons" from hybridization intensities of multiple probes that are spread over a probe set. A pseudo-exon is defined as a range of expressed sequence on the genome that we infer to be an exon based on probe intensities and physical adjacency.</p>
         <p>Figure <figr fid="F1">1</figr> illustrates how GeneChip<sup>&#174; </sup>expression array data can be used to detect alternative splicing. We show the probe locations for a hypothetical gene in Figure <figr fid="F1">1A</figr> and their corresponding hybridization intensities in Figure <figr fid="F1">1B</figr>. From the probe intensities, we infer that three clusters of probes represent three pseudo-exons (Figure 1C). For each of the pseudo-exons, we test whether the difference in probe intensities between tissue 1 and tissue 2 is significant. If the difference is statistically significant, we infer that there is alternative splicing between the two tissues for the region corresponding to the selected pseudo-exon. In our illustration, the region between probe #7 and probe #14, i.e. pseudo-exon 2 is predicted to be alternatively spliced between tissue 1 and tissue 2.</p>
         <fig id="F1">
            <title>
               <p>Figure 1</p>
            </title>
            <caption>
               <p>A Multiple probes are used to quantify the expression value for a gene in GeneChip<sup>&#174; </sup>technology</p>
            </caption>
            <text>
               <p><b>A </b>Multiple probes are used to quantify the expression value for a gene in GeneChip<sup>&#174; </sup>technology. Currently the probe design has a 3' bias, i.e. probes are selected from the sequence at the 3'end of the gene. In the Hu6800 array, twenty probes are used for a single gene. 1 <b>B </b>Intensities of the twenty probes are plotted for both tissues 1 and 2. 1 <b>C </b>The twenty probes are clustered into three groups based on the similarity of probe intensity and probe adjacency. Each cluster, called a pseudo-exon in this paper, represents a sub-region of the gene.</p>
            </text>
            <graphic file="1742-4682-3-19-1"/>
         </fig>
         <p>Previously, Hu et al reported a method, based on fold changes, to predict alternative splicing from GeneChip<sup>&#174; </sup>expression array data on ten tissue types <abbrgrp><abbr bid="B5">5</abbr></abbrgrp>. For each probe, they calculated the difference in the fold change between each tissue type and the average of the remaining tissue types for the corresponding probe. If the fold change was greater than an empirically-determined threshold value R, they selected the gene sequence corresponding to that selected probe as an alternative splicing site for that tissue type. However, there are some problems with Hu's approach. First, the fold-change approach does not take into account sample variation and thus is less reliable when sample-to-sample variations are large. Second, their method is designed to predict splice variants in a dataset with multiple tissue types. Hu et al reported that prediction power decreased for a dataset that contained only three tissue types compared to a dataset that consisted of ten tissue types. The robustness of their method depended on the number of the tissue types in the dataset. Thus, their method is not suitable for the comparison of two tissue types such as detection of splice variants between two phenotypes, or two disease status, or two experimental stimuli.</p>
         <p>In this paper, we propose an approach to predict splice variants between two groups of samples from GeneChip<sup>&#174; </sup>expression array data, taking into consideration sample variation. Our t-test based approach is more statistically vigorous and reliable than fold-change based methods. Furthermore, our method does not rely on a large number of tissue types. We implemented the method from Hu et al and compared the splice variants predicted from the two approaches. Our dataset consists of normal cerebellum, non-metastatic medulloblastomas, and metastatic medulloblastomas. The comparisons were made between normal cerebellum versus medulloblastomas, and non-metastatic medulloblastomas versus metastatic medulloblastomas.</p>
      </sec>
      <sec>
         <st>
            <p>Results</p>
         </st>
         <sec>
            <st>
               <p>The computational algorithms</p>
            </st>
            <p>Our approach has two steps. In STEP 1, we infer pseudo-exons using multiple probe intensities. In STEP 2, we identify pseudo-exons that are differentially expressed between two groups of samples. In STEP 1, for each probe, we first compute the average of the difference in probe intensities between the two groups of samples. Then, based on the similarity of probe intensities and probe adjacency on the gene sequence, we merge probes into clusters that represent one pseudo-exon. In STEP 2, we test if the pseudo-exons are differentially expressed between the two groups of samples. The expression value from a pseudo-exon is treated as an entity in the current analysis, comparable to the gene expression from a complete probe set in customary analyses of gene expression data. The selected pseudo-exons are interpreted as an indication of alternative splicing at this region of the gene between the two comparison groups.</p>
         </sec>
         <sec>
            <st>
               <p>Predicting splice variants between normal cerebellum and medulloblastomas</p>
            </st>
            <p>For illustrative purposes, we applied the above method to predict splice variants between the normal cerebellum and medulloblastoma tumor samples, which included both non-metastatic and metastatic tumors. In STEP 1, using a significance level of 0.05 in the t-test, we identified 10,838 pseudo-exons out of a total of 142,580 (7129 &#215; 20) probes that represent the 7,129 probe sets on the Hu6800 GeneChip<sup>&#174;</sup>. In STEP 2, we compared the difference in expression values between the two groups for each pseudo-exon. The histogram of Z-scores from these tests is shown in Figure <figr fid="F2">2</figr>. With the significance threshold of the Z-score set to 4.8 (equivalent to one false positive error in the discovery), we discovered 811 pseudo-exons, derived from 577 genes, were significantly different between normal cerebellum and medulloblastoma tumor samples. Note that for some genes more than one pseudo-exon was selected.</p>
         </sec>
         <sec>
            <st>
               <p>Predicting splice variants between non-metastatic medulloblastomas and metastatic medulloblastomas</p>
            </st>
            <p>Following the same procedure, we predicted splice variants between the non-metastatic and the metastatic medulloblastomas. We identified 8,319 pseudo-exons, thirteen of which were significantly different between non-metastatic and metastatic medulloblastomas (Table <tblr tid="T1">1</tblr>). Instead of conducting validation in a biological experiment, we searched two genome browsers for supportive evidence for our prediction. We input the thirteen genes in Table <tblr tid="T1">1</tblr> into the Integrated Genome Browser (IGB) from Affymetrix <abbrgrp><abbr bid="B7">7</abbr></abbrgrp> to see whether the probes in the identified pseudo-exons were positioned on separate exons within the same gene, which is a pre-requisite for alternative splicing. For further consistency, we checked whether the predicted pseudo-exons were reported as splice variants in the UCSC Human Genome Browser <abbrgrp><abbr bid="B8">8</abbr></abbrgrp> under the track named "mRNA sequences from GenBank". In the IGB, we found four out of thirteen genes with predicted alternatively spliced pseudo-exons resided on different exons. These four genes were glutaredoxin (GLRX), carboxypeptidase N polypeptide 1 (CPN1), Keratin 7 (KRT7) and killer cell lectin-like receptor subfamily C member 3 (KLRC3). For instance, we predicted the last three probes for GLRX were within one pseudo-exon. In IGB, based on RefSeq information, these three probes are on a different exon. We searched alternatively transcribed variants deposited in GenBank in the "mRNA sequences from GenBank" track in UCSC Human Genome Browser for the genes confirmed by IGB. All of them except for CPN1 have at least two transcript sequences in the GenBank database. At least one of these sequences lack the region that we predicted to be alternatively spliced, and at least one of these sequences contain the predicted region. We also searched PubMed for reported splice variants for the thirteen identified genes. Five of out of the thirteen genes were reported in the literature to have splice variants. They are nitric oxide synthase 1 (NOS1) <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>, low density lipoprotein receptor (LDLR) <abbrgrp><abbr bid="B10">10</abbr></abbrgrp>, thrombopoietin (THPO) <abbrgrp><abbr bid="B11">11</abbr></abbrgrp>, Down syndrome critical region gene 1 (DSCR1) <abbrgrp><abbr bid="B12">12</abbr></abbrgrp>, paired box gene 2 (PAX2) <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>.</p>
            <tbl id="T1">
               <title>
                  <p>Table 1</p>
               </title>
               <caption>
                  <p>Alternative spliced genes selected by our method: Comparison of non-metastatic medulloblastomas with metastatic medulloblastomas</p>
               </caption>
               <tblbdy cols="8">
                  <r>
                     <c ca="center">
                        <p>Affymetrix Probe Set ID</p>
                     </c>
                     <c ca="center">
                        <p>Gene Symbol</p>
                     </c>
                     <c ca="center">
                        <p>Number of Affymetrix Probes in the Predicted Pseudo-exon</p>
                     </c>
                     <c ca="center">
                        <p>Nucleotide Positions of Predicted Pseudo-exon in the Gene</p>
                     </c>
                     <c ca="center">
                        <p>Mean Difference</p>
                     </c>
                     <c ca="center">
                        <p>Standard Error</p>
                     </c>
                     <c ca="center">
                        <p>Z-score</p>
                     </c>
                     <c ca="left">
                        <p>Description of the Genes</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="8">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>M81882_at</p>
                     </c>
                     <c ca="center">
                        <p>GAD2</p>
                     </c>
                     <c ca="center">
                        <p>4</p>
                     </c>
                     <c ca="center">
                        <p>(2135&#8211;2285)</p>
                     </c>
                     <c ca="center">
                        <p>-1.28</p>
                     </c>
                     <c ca="center">
                        <p>0.20</p>
                     </c>
                     <c ca="center">
                        <p>-6.45</p>
                     </c>
                     <c ca="left">
                        <p>glutamate decarboxylase 2 (pancreatic islets and brain, 65 kDa)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>M13955_at</p>
                     </c>
                     <c ca="center">
                        <p>KRT7</p>
                     </c>
                     <c ca="center">
                        <p>5</p>
                     </c>
                     <c ca="center">
                        <p>(1402&#8211;1474)</p>
                     </c>
                     <c ca="center">
                        <p>-0.63</p>
                     </c>
                     <c ca="center">
                        <p>0.12</p>
                     </c>
                     <c ca="center">
                        <p>-5.23</p>
                     </c>
                     <c ca="left">
                        <p>keratin 7</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>U17327_at</p>
                     </c>
                     <c ca="center">
                        <p>NOS1</p>
                     </c>
                     <c ca="center">
                        <p>7</p>
                     </c>
                     <c ca="center">
                        <p>(6805&#8211;7003)</p>
                     </c>
                     <c ca="center">
                        <p>-0.66</p>
                     </c>
                     <c ca="center">
                        <p>0.13</p>
                     </c>
                     <c ca="center">
                        <p>-5.19</p>
                     </c>
                     <c ca="left">
                        <p>nitric oxide synthase 1 (neuronal)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>X14329_at</p>
                     </c>
                     <c ca="center">
                        <p>CPN1</p>
                     </c>
                     <c ca="center">
                        <p>4</p>
                     </c>
                     <c ca="center">
                        <p>(1569&#8211;1665)</p>
                     </c>
                     <c ca="center">
                        <p>-0.62</p>
                     </c>
                     <c ca="center">
                        <p>0.12</p>
                     </c>
                     <c ca="center">
                        <p>-5.18</p>
                     </c>
                     <c ca="left">
                        <p>carboxypeptidase N, polypeptide 1, 50 kD</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>M89470_s_at</p>
                     </c>
                     <c ca="center">
                        <p>PAX2</p>
                     </c>
                     <c ca="center">
                        <p>6</p>
                     </c>
                     <c ca="center">
                        <p>(2855&#8211;2972)</p>
                     </c>
                     <c ca="center">
                        <p>-0.92</p>
                     </c>
                     <c ca="center">
                        <p>0.19</p>
                     </c>
                     <c ca="center">
                        <p>-4.91</p>
                     </c>
                     <c ca="left">
                        <p>paired box gene 2</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>L14542_at</p>
                     </c>
                     <c ca="center">
                        <p>KLRC3</p>
                     </c>
                     <c ca="center">
                        <p>5</p>
                     </c>
                     <c ca="center">
                        <p>(916&#8211;1006)</p>
                     </c>
                     <c ca="center">
                        <p>-1.18</p>
                     </c>
                     <c ca="center">
                        <p>0.24</p>
                     </c>
                     <c ca="center">
                        <p>-4.91</p>
                     </c>
                     <c ca="left">
                        <p>killer cell lectin-like receptor subfamily C, member 3</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>X76648_at</p>
                     </c>
                     <c ca="center">
                        <p>GLRX</p>
                     </c>
                     <c ca="center">
                        <p>3</p>
                     </c>
                     <c ca="center">
                        <p>(704&#8211;776)</p>
                     </c>
                     <c ca="center">
                        <p>-1.35</p>
                     </c>
                     <c ca="center">
                        <p>0.28</p>
                     </c>
                     <c ca="center">
                        <p>-4.86</p>
                     </c>
                     <c ca="left">
                        <p>glutaredoxin (thioltransferase)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="8">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>U82987_at</p>
                     </c>
                     <c ca="center">
                        <p>BBC3</p>
                     </c>
                     <c ca="center">
                        <p>3</p>
                     </c>
                     <c ca="center">
                        <p>(1578&#8211;1638)</p>
                     </c>
                     <c ca="center">
                        <p>2.25</p>
                     </c>
                     <c ca="center">
                        <p>0.32</p>
                     </c>
                     <c ca="center">
                        <p>6.98</p>
                     </c>
                     <c ca="left">
                        <p>BCL2 binding component 3</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>U01102_at</p>
                     </c>
                     <c ca="center">
                        <p>SCGB1A1</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>(409&#8211;439)</p>
                     </c>
                     <c ca="center">
                        <p>1.42</p>
                     </c>
                     <c ca="center">
                        <p>0.25</p>
                     </c>
                     <c ca="center">
                        <p>5.62</p>
                     </c>
                     <c ca="left">
                        <p>secretoglobin, family 1A, member 1 (uteroglobin)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>M28219_at</p>
                     </c>
                     <c ca="center">
                        <p>LDLR</p>
                     </c>
                     <c ca="center">
                        <p>15</p>
                     </c>
                     <c ca="center">
                        <p>(67&#8211;277)</p>
                     </c>
                     <c ca="center">
                        <p>0.77</p>
                     </c>
                     <c ca="center">
                        <p>0.14</p>
                     </c>
                     <c ca="center">
                        <p>5.42</p>
                     </c>
                     <c ca="left">
                        <p>low density lipoprotein receptor (familial hypercholesterolemia)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>X68194_at</p>
                     </c>
                     <c ca="center">
                        <p>SYPL</p>
                     </c>
                     <c ca="center">
                        <p>5</p>
                     </c>
                     <c ca="center">
                        <p>(1915&#8211;2089)</p>
                     </c>
                     <c ca="center">
                        <p>1.67</p>
                     </c>
                     <c ca="center">
                        <p>0.31</p>
                     </c>
                     <c ca="center">
                        <p>5.42</p>
                     </c>
                     <c ca="left">
                        <p>synaptophysin-like protein</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>U85267_at</p>
                     </c>
                     <c ca="center">
                        <p>DSCR1</p>
                     </c>
                     <c ca="center">
                        <p>10</p>
                     </c>
                     <c ca="center">
                        <p>(64&#8211;169)</p>
                     </c>
                     <c ca="center">
                        <p>1.20</p>
                     </c>
                     <c ca="center">
                        <p>0.24</p>
                     </c>
                     <c ca="center">
                        <p>5.08</p>
                     </c>
                     <c ca="left">
                        <p>Down syndrome critical region gene 1</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>L36051_at</p>
                     </c>
                     <c ca="center">
                        <p>THPO</p>
                     </c>
                     <c ca="center">
                        <p>6</p>
                     </c>
                     <c ca="center">
                        <p>(1647&#8211;1809)</p>
                     </c>
                     <c ca="center">
                        <p>1.05</p>
                     </c>
                     <c ca="center">
                        <p>0.21</p>
                     </c>
                     <c ca="center">
                        <p>4.96</p>
                     </c>
                     <c ca="left">
                        <p>thrombopoietin (myeloproliferative leukemia virus oncogene ligand, megakaryocyte growth and development factor)</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>Number of Affymetrix Probes in the Predicted Pseudo-exon: number of probes that are contained in a predicted alternatively spliced pseudo-exon. Nucleotide Positions of Predicted Pseudo-exon in the Gene: nucleotide positions of the pseudo-exon from the beginning of the gene it resides. Mean difference: Mean difference of the expression values between the two tissue types being compared for each predicted pseudo-exon in the t-test in STEP 2. Standard Error: the standard error calculated in the same t-test. Z-score: the ratio of mean difference over standard error (noise), a measure of significance of the difference between the two tissues being compared. The sign of the Z-scores indicate direction of the difference. A negative Z-score means a lower expression in metastatic medulloblastomas than in non-metastatic medulloblastomas, and vice-versa for a positive Z-score.</p>
               </tblfn>
            </tbl>
         </sec>
         <sec>
            <st>
               <p>Comparison with Hu et al's approach</p>
            </st>
            <p>To compare our method with the Hu et al's, we implemented their method and applied it to our dataset. When comparing normal cerebellum and medulloblastomas samples using Hu et al's method, we inferred 31 alternatively spliced genes with the selection criterion used by Hu et al in their paper (Table <tblr tid="T2">2</tblr>). Among these 31 genes, seven overlapped with the findings from our approach (Table <tblr tid="T3">3</tblr>). For four of them, D87119_at, U14971_at, U29953_rna1_at, X04828_at, the locations of the alternative splicing were consistent between the two methods. In the comparison between non-metastatic and metastatic medulloblastoma samples, we did not find any gene that was alternatively spliced by Hu et al's method. We also investigated the effect of different selection criteria in Hu et al's method (i.e. the R threshold, which is the ratio of the probe intensity in a tissue over the mean of the probe intensities in the remaining nine tissue types for the same probe). Table <tblr tid="T4">4</tblr> shows the relation between the 577 genes predicted by our approach and the genes selected with different R thresholds in Hu's approach. Numbers of predicted alternatively spliced genes increase as smaller R values (less stringent) are used.</p>
            <tbl id="T2">
               <title>
                  <p>Table 2</p>
               </title>
               <caption>
                  <p>Alternative spliced genes inferred by applying Hu's method to our dataset: Comparison of normal cerebellum with medulloblastoma samples</p>
               </caption>
               <tblbdy cols="5">
                  <r>
                     <c ca="center">
                        <p>Affy Probe Set ID</p>
                     </c>
                     <c ca="left">
                        <p>Gene Symbol</p>
                     </c>
                     <c ca="center">
                        <p>Number of Affymetrix Probes in the Predicted Pseudo-exon</p>
                     </c>
                     <c ca="center">
                        <p>Nucleotide Positions of Predicted Pseudo-exon in the Gene</p>
                     </c>
                     <c ca="left">
                        <p>Description of the Genes</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="5">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>X51362_s_at</p>
                     </c>
                     <c ca="left">
                        <p>DRD2</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>(2541&#8211;2574)</p>
                     </c>
                     <c ca="left">
                        <p>dopamine receptor D2</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>M15517_cds5_at</p>
                     </c>
                     <c ca="left">
                        <p>TTR</p>
                     </c>
                     <c ca="center">
                        <p>3</p>
                     </c>
                     <c ca="center">
                        <p>(155&#8211;197)</p>
                     </c>
                     <c ca="left">
                        <p>transthyretin (prealbumin, amyloidosis type I)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>Y10141_s_at</p>
                     </c>
                     <c ca="left">
                        <p>SLC6A3</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>(96&#8211;125)</p>
                     </c>
                     <c ca="left">
                        <p>solute carrier family 6 (neurotransmitter transporter, dopamine), member 3</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>Z14982_rna1_at</p>
                     </c>
                     <c ca="left">
                        <p>PSMB8</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>(820&#8211;850)</p>
                     </c>
                     <c ca="left">
                        <p>proteasome (prosome, macropain) subunit, beta type, 8 (large multifunctional protease 7)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>X69654_at</p>
                     </c>
                     <c ca="left">
                        <p>RPS26</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>(9&#8211;35)</p>
                     </c>
                     <c ca="left">
                        <p>ribosomal protein S26</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>U63842_at</p>
                     </c>
                     <c ca="left">
                        <p>NEUROG1</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>(834&#8211;891)</p>
                     </c>
                     <c ca="left">
                        <p>neurogenin 1</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>M97815_at</p>
                     </c>
                     <c ca="left">
                        <p>CRABP2</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>(524&#8211;554)</p>
                     </c>
                     <c ca="left">
                        <p>cellular retinoic acid binding protein 2</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>D00017_at</p>
                     </c>
                     <c ca="left">
                        <p>ANXA2</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>(1229&#8211;1265)</p>
                     </c>
                     <c ca="left">
                        <p>annexin A2</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>U13021_s_at</p>
                     </c>
                     <c ca="left">
                        <p>CASP2</p>
                     </c>
                     <c ca="center">
                        <p>3</p>
                     </c>
                     <c ca="center">
                        <p>(844&#8211;913)</p>
                     </c>
                     <c ca="left">
                        <p>caspase 2, apoptosis-related cysteine protease (neural precursor cell expressed, developmentally down-regulated 2)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>U30999_at</p>
                     </c>
                     <c ca="left">
                        <p>ALCAM</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>(373&#8211;403)</p>
                     </c>
                     <c ca="left">
                        <p>activated leukocyte cell adhesion molecule</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>X04828_at</p>
                     </c>
                     <c ca="left">
                        <p>GNAI2</p>
                     </c>
                     <c ca="center">
                        <p>3</p>
                     </c>
                     <c ca="center">
                        <p>(1668&#8211;1701)</p>
                     </c>
                     <c ca="left">
                        <p>guanine nucleotide binding protein (G protein), alpha inhibiting activity polypeptide 2</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>U14971_at</p>
                     </c>
                     <c ca="left">
                        <p>RPS9</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>(319&#8211;373)</p>
                     </c>
                     <c ca="left">
                        <p>ribosomal protein S9</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>U79299_at</p>
                     </c>
                     <c ca="left">
                        <p>OLFM1</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>(1342&#8211;1372)</p>
                     </c>
                     <c ca="left">
                        <p>olfactomedin 1</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>L20298_at</p>
                     </c>
                     <c ca="left">
                        <p>CBFB</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>(2298&#8211;2334)</p>
                     </c>
                     <c ca="left">
                        <p>core-binding factor, beta subunit</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>X93017_at</p>
                     </c>
                     <c ca="left">
                        <p>SLC8A3</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>(1725&#8211;1821)</p>
                     </c>
                     <c ca="left">
                        <p>solute carrier family 8 (sodium-calcium exchanger), member 3</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>M17886_at</p>
                     </c>
                     <c ca="left">
                        <p>RPLP1</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>(127&#8211;163)</p>
                     </c>
                     <c ca="left">
                        <p>ribosomal protein, large, P1</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>D16480_at</p>
                     </c>
                     <c ca="left">
                        <p>HADHA</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>(2335&#8211;2365)</p>
                     </c>
                     <c ca="left">
                        <p>hydroxyacyl-Coenzyme A dehydrogenase/3-ketoacyl-Coenzyme A thiolase/enoyl-Coenzyme A hydratase (trifunctional protein), alpha subunit</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>D38305_at</p>
                     </c>
                     <c ca="left">
                        <p>TOB1</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>(707&#8211;749)</p>
                     </c>
                     <c ca="left">
                        <p>transducer of ERBB2, 1</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>U32519_at</p>
                     </c>
                     <c ca="left">
                        <p>G3BP</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>(1534&#8211;1564)</p>
                     </c>
                     <c ca="left">
                        <p>Ras-GTPase-activating protein SH3-domain-binding protein</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>U07919_at</p>
                     </c>
                     <c ca="left">
                        <p>ALDH1A3</p>
                     </c>
                     <c ca="center">
                        <p>3</p>
                     </c>
                     <c ca="center">
                        <p>(3363&#8211;3411)</p>
                     </c>
                     <c ca="left">
                        <p>aldehyde dehydrogenase 1 family, member A3</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>U29953_rna1_at</p>
                     </c>
                     <c ca="left">
                        <p>SERPINF1</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>(1288&#8211;1324)</p>
                     </c>
                     <c ca="left">
                        <p>serine (or cysteine) proteinase inhibitor, clade F (alpha-2 antiplasmin, pigment epithelium derived factor), member 1</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>D55716_at</p>
                     </c>
                     <c ca="left">
                        <p>MCM7</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>(2288&#8211;2396)</p>
                     </c>
                     <c ca="left">
                        <p>MCM7 minichromosome maintenance deficient 7 (S. cerevisiae)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>J05448_at</p>
                     </c>
                     <c ca="left">
                        <p>POLR2C</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>(1575&#8211;1605)</p>
                     </c>
                     <c ca="left">
                        <p>polymerase (RNA) II (DNA directed) polypeptide C, 33 kDa</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>U46570_at</p>
                     </c>
                     <c ca="left">
                        <p>TTC1</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>(1226&#8211;1262)</p>
                     </c>
                     <c ca="left">
                        <p>tetratricopeptide repeat domain 1</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>D87119_at</p>
                     </c>
                     <c ca="left">
                        <p>TRB2</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>(4022&#8211;4136)</p>
                     </c>
                     <c ca="left">
                        <p>tribbles homolog 2</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>X69910_at</p>
                     </c>
                     <c ca="left">
                        <p>CKAP4</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>(2543&#8211;2573)</p>
                     </c>
                     <c ca="left">
                        <p>cytoskeleton-associated protein 4</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>U50078_at</p>
                     </c>
                     <c ca="left">
                        <p>HERC1</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>(14885&#8211;14915)</p>
                     </c>
                     <c ca="left">
                        <p>hect (homologous to the E6-AP (UBE3A) carboxyl terminus) domain and RCC1 (CHC1)-like domain (RLD) 1</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>J04164_at</p>
                     </c>
                     <c ca="left">
                        <p>IFITM1</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>(798&#8211;828)</p>
                     </c>
                     <c ca="left">
                        <p>interferon induced transmembrane protein 1 (9&#8211;27)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>AFFX-HUMRGE/M10098_3_at</p>
                     </c>
                     <c ca="left">
                        <p>N/A</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>(1562&#8211;1613)</p>
                     </c>
                     <c ca="left">
                        <p>N/A</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>HG2788-HT2896_at</p>
                     </c>
                     <c ca="left">
                        <p>N/A</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>(N/A-N/A)</p>
                     </c>
                     <c ca="left">
                        <p>N/A</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>HG2994-HT4850_s_at</p>
                     </c>
                     <c ca="left">
                        <p>N/A</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>(N/A-N/A)</p>
                     </c>
                     <c ca="left">
                        <p>N/A</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>Number of Affymetrix Probes in the Predicted Pseudo-exon: number of probes that are contained in a predicted alternatively spliced pseudo-exon. Nucleotide Positions of Predicted Pseudo-exon in the Gene: nucleotide positions of the pseudo-exon from the beginning of the gene it resides. Mean difference: Mean difference of the expression values between the two tissue types being compared for each predicted pseudo-exon in the t-test in STEP 2. Standard Error: the standard error calculated in the same t-test. Z-score: the ratio of mean difference over standard error (noise), a measure of significance of the difference between the two tissues being compared. The sign of the Z-scores indicate direction of the difference. A negative Z-score means a lower expression in metastatic medulloblastomas than in non-metastatic medulloblastomas, and vice-versa for a positive Z-score.</p>
               </tblfn>
            </tbl>
            <tbl id="T3">
               <title>
                  <p>Table 3</p>
               </title>
               <caption>
                  <p>Overlapping of the predicted gene from our method and Hu's method for the comparison of normal cerebellum and medulloblastoma samples</p>
               </caption>
               <tblbdy cols="7">
                  <r>
                     <c ca="left">
                        <p>Affy Probe Set ID</p>
                     </c>
                     <c ca="left">
                        <p>Gene Symbol</p>
                     </c>
                     <c cspan="2" ca="left">
                        <p>Number of Affymetrix Probes in the Predicted Pseudo-exon</p>
                     </c>
                     <c cspan="2" ca="left">
                        <p>Nucleotide Positions of Predicted Pseudo-exon in the Gene</p>
                     </c>
                     <c ca="left">
                        <p>Descriptions of the Genes</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="7">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Ours</p>
                     </c>
                     <c ca="left">
                        <p>Hu's</p>
                     </c>
                     <c ca="left">
                        <p>Ours</p>
                     </c>
                     <c ca="left">
                        <p>Hu's</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>X04828_at*</p>
                     </c>
                     <c ca="left">
                        <p>GNAI2</p>
                     </c>
                     <c ca="left">
                        <p>3</p>
                     </c>
                     <c ca="left">
                        <p>3</p>
                     </c>
                     <c ca="left">
                        <p>(1668&#8211;1701)</p>
                     </c>
                     <c ca="left">
                        <p>(1668&#8211;1701)</p>
                     </c>
                     <c ca="left">
                        <p>guanine nucleotide binding protein (G protein), alpha inhibiting activity polypeptide 2</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>U14971_at*</p>
                     </c>
                     <c ca="left">
                        <p>RPS9</p>
                     </c>
                     <c ca="left">
                        <p>19</p>
                     </c>
                     <c ca="left">
                        <p>2</p>
                     </c>
                     <c ca="left">
                        <p>(103&#8211;685)</p>
                     </c>
                     <c ca="left">
                        <p>(319&#8211;373)</p>
                     </c>
                     <c ca="left">
                        <p>ribosomal protein S9</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>U29953_rna1_at*</p>
                     </c>
                     <c ca="left">
                        <p>SERPINF1</p>
                     </c>
                     <c ca="left">
                        <p>13</p>
                     </c>
                     <c ca="left">
                        <p>2</p>
                     </c>
                     <c ca="left">
                        <p>(1288&#8211;1492)</p>
                     </c>
                     <c ca="left">
                        <p>(1288&#8211;1324)</p>
                     </c>
                     <c ca="left">
                        <p>serine (or cysteine) proteinase inhibitor, clade F (alpha-2 antiplasmin, pigment epithelium derived factor), member 1</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>D87119_at*</p>
                     </c>
                     <c ca="left">
                        <p>TRB2</p>
                     </c>
                     <c ca="left">
                        <p>13</p>
                     </c>
                     <c ca="left">
                        <p>2</p>
                     </c>
                     <c ca="left">
                        <p>(3824&#8211;4184)</p>
                     </c>
                     <c ca="left">
                        <p>(4022&#8211;4136)</p>
                     </c>
                     <c ca="left">
                        <p>tribbles homolog 2</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>X69910_at</p>
                     </c>
                     <c ca="left">
                        <p>CKAP4</p>
                     </c>
                     <c ca="left">
                        <p>5</p>
                     </c>
                     <c ca="left">
                        <p>2</p>
                     </c>
                     <c ca="left">
                        <p>(2789&#8211;2891)</p>
                     </c>
                     <c ca="left">
                        <p>(2543&#8211;2573)</p>
                     </c>
                     <c ca="left">
                        <p>cytoskeleton-associated protein 4</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>U30999_at</p>
                     </c>
                     <c ca="left">
                        <p>ALCAM</p>
                     </c>
                     <c ca="left">
                        <p>16</p>
                     </c>
                     <c ca="left">
                        <p>2</p>
                     </c>
                     <c ca="left">
                        <p>(25&#8211;337)</p>
                     </c>
                     <c ca="left">
                        <p>(373&#8211;403)</p>
                     </c>
                     <c ca="left">
                        <p>activated leukocyte cell adhesion molecule</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>D55716_at</p>
                     </c>
                     <c ca="left">
                        <p>MCM7</p>
                     </c>
                     <c ca="left">
                        <p>8</p>
                     </c>
                     <c ca="left">
                        <p>2</p>
                     </c>
                     <c ca="left">
                        <p>(1952&#8211;2096)</p>
                     </c>
                     <c ca="left">
                        <p>(2288&#8211;2396)</p>
                     </c>
                     <c ca="left">
                        <p>MCM7 minichromosome maintenance deficient 7 (S. cerevisiae)</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>* Consistent alternative splice sites between two methods.</p>
               </tblfn>
            </tbl>
            <tbl id="T4">
               <title>
                  <p>Table 4</p>
               </title>
               <caption>
                  <p>Comparison of the results from our approach and those from Hu's using different R thresholds when normal cerebellum samples are compared with medulloblastomas</p>
               </caption>
               <tblbdy cols="5">
                  <r>
                     <c ca="left">
                        <p>R used</p>
                     </c>
                     <c ca="left">
                        <p>Number of Genes Found in Hu's Approach</p>
                     </c>
                     <c ca="left">
                        <p>Number of Overlap Between Hu's and Our 577 Genes</p>
                     </c>
                     <c ca="left">
                        <p>Percentage of the overlapping genes based on number of genes found in Hu's method</p>
                     </c>
                     <c ca="left">
                        <p>Percentage of the overlapping genes based on our 577 selected genes</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="5">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>4</p>
                     </c>
                     <c ca="left">
                        <p>324</p>
                     </c>
                     <c ca="left">
                        <p>69</p>
                     </c>
                     <c ca="left">
                        <p>21%</p>
                     </c>
                     <c ca="left">
                        <p>11.9%</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>6</p>
                     </c>
                     <c ca="left">
                        <p>103</p>
                     </c>
                     <c ca="left">
                        <p>28</p>
                     </c>
                     <c ca="left">
                        <p>27%</p>
                     </c>
                     <c ca="left">
                        <p>4.9%</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>8</p>
                     </c>
                     <c ca="left">
                        <p>53</p>
                     </c>
                     <c ca="left">
                        <p>14</p>
                     </c>
                     <c ca="left">
                        <p>26%</p>
                     </c>
                     <c ca="left">
                        <p>2.4%</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>10</p>
                     </c>
                     <c ca="left">
                        <p>31</p>
                     </c>
                     <c ca="left">
                        <p>7</p>
                     </c>
                     <c ca="left">
                        <p>23%</p>
                     </c>
                     <c ca="left">
                        <p>1.2%</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>Genes found in Hu's methods using different R thresholds are compared to each other. Larger R value represents more stringent selection criterion. Genes found using smaller R values always include those found using larger R values, i.e. gene list of 324 genes contains gene list of 103 genes, etc. Genes obtained from Hu's method are also compared with 577 genes from our approach. Numbers of overlapping genes are presented in the third column for different R values. Similarly, overlapping genes for the smaller R values contains those for the larger R values, i.e. gene list of 69 genes contains gene list of 28 genes, etc.</p>
               </tblfn>
            </tbl>
            <p>We checked both IGB and UCSC Human Genome Browsers for supportive evidence for the seven predicted alternatively spliced variants in Table <tblr tid="T3">3</tblr>. We found four genes that had predicted pseudo-exons located on separate exons according to IGB and alternative spliced mRNA from GenBank in UCSC Human Genome Browser. They are guanine nucleotide binding protein alpha inhibiting activity polypeptide 2 (GNAI2), ribosomal protein S9 (RPS9), activated leukocyte cell adhesion molecule (ALCAM), and minichromosome maintenance deficient 7 (MCM7). There are splicing variants reported in PubMed literature for ALCAM <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Discussion</p>
         </st>
         <p>We have developed a two-step approach to predict splice variants between two groups of samples using GeneChip<sup>&#174; </sup>gene expression array data. We illustrated the method using empirical data from normal cerebellum, metastatic medulloblastoma and non-metastatic medulloblastoma samples. We predicted a total of 577 alternatively spliced genes when we compared normal cerebellum with medulloblastomas tumor samples and thirteen alternatively spliced genes when we compared non-metastatic medulloblastomas with metastatic medulloblastomas. A comparison of the results from our approach and the method described by Hu et al on the same dataset revealed some overlapping alternatively spliced genes.</p>
         <p>Our proposed method can be used to predict splice variants and takes advantage of the extensive repositories of gene expression array data. Inferred splice variants can be used to generate alternative splicing hypotheses for subsequent experimental validation. Higher signal quality in the newer generation GeneChip<sup>&#174;</sup>, such as U133 Plus 2.0 array, should make our predictions more robust. Recently, a genome-wide human exon array became available from Affymetrix <abbrgrp><abbr bid="B7">7</abbr></abbrgrp> to detect known alternative splicing in a biological sample. Bypassing the need for defining "pseudo-exons" in the STEP 1 of our approach, one can directly use STEP 2 of our method to predict splice variants. As expected, such an exon array coupled with our rigorous statistical method may improve the power to predict more splice variants.</p>
         <p>There are some limitations associated with using GeneChip<sup>&#174; </sup>gene expression array data to detect alternatively spliced variants. Currently, GeneChip<sup>&#174; </sup>probes cover 600 base pairs in sequence from the 3' end. Thus we can only detect splice variants at the 3' end. Furthermore, some 3' end splice variants could be due to alternative polyadenylation sites, and our method does not differentiate between these in the analysis. The splice variants resulting from the 3' non-translational region could be removed by checking whether the predicted pseudo-exons on the 3' end are located in translational regions.</p>
         <p>Since our approach depends on probe intensities to cluster probes into pseudo-exons within a single gene, non-specific hybridization in an expression array could complicate this step (STEP 1), thus result in both false positive and false negative findings. Cross-hybridization can be partially addressed by excluding lower grade probe sets, such as probe sets with the suffix _s or _x, which could hybridize to multiple genes either before analysis or from the gene list after analysis.</p>
      </sec>
      <sec>
         <st>
            <p>Conclusion</p>
         </st>
         <p>In this paper we describe a method that can generate hypotheses of alternative splicing for further investigation. Our approach overcomes two limitations of a previously proposed method <abbrgrp><abbr bid="B5">5</abbr></abbrgrp>: 1) we use t-tests instead of fold changes, 2) we can predict splicing variants between two groups of samples. These differences make our inference more robust and not dependent on multiple tissue types to stabilize the inference.</p>
      </sec>
      <sec>
         <st>
            <p>Methods</p>
         </st>
         <sec>
            <st>
               <p>Dataset</p>
            </st>
            <p>Our empirical dataset consists of GeneChip<sup>&#174; </sup>Hu6800 expression array data from sixty-nine medulloblastoma samples and four cerebellum samples as normal controls. Among the medulloblastoma samples, forty-two are from non-metastatic tumors and twenty-seven are from metastatic tumors. There are 7,129 probe sets in the Hu6800 expression array, and twenty probes in each probe set.</p>
         </sec>
         <sec>
            <st>
               <p>Inferring pseudo-exons within a gene (STEP 1)</p>
            </st>
            <p>In this step, we merge probes within a gene into clusters that represent pseudo-exons. First, we compute the difference in probe hybridization intensity between two groups of samples for each probe. Then, for each gene, we merge probes into clusters based on the similarity of the differences in probe intensity (between the two groups of samples) and the probe adjacency on the genome sequence. For a gene, let <b><it>Y</it></b><sub>(<it>i</it>, 1)</sub>, <graphic file="1742-4682-3-19-i1.gif"/> and <it>n</it><sub>1 </sub>be the probe intensity for the <it>i</it>th probe in sample group 1, variance, and sample size, respectively. Similarly, <b><it>Y</it></b><sub>(<it>i</it>, 2)</sub>, <graphic file="1742-4682-3-19-i2.gif"/> and <it>n</it><sub>2 </sub>are for the sample group 2. Within the gene, the index <it>i </it>increases from the direction of the 5' end to the 3' end. We start with the first probe from the 5' end and compute:</p>
            <p>
               <graphic file="1742-4682-3-19-i3.gif"/>
            </p>
            <p>
               <graphic file="1742-4682-3-19-i4.gif"/>
            </p>
            <p>where <graphic file="1742-4682-3-19-i5.gif"/> is the mean of probe intensities. If the absolute value of <b><it>t</it></b><sub>i </sub>does not exceed the threshold value at the significance level <it>&#945; </it>= 0.05, we merge the <it>i</it>th probe with the (<it>i</it>+1)th probe to generate a pseudo-exon. The resulting pseudo-exon becomes the new <it>i</it>th probe in the next iteration of the t-test. The pseudo-exon extends with each iteration until the results of the t-test become significant or reach the last probe within a probe set. If <b><it>t</it></b><sub><it>i </it></sub>exceeds the significance threshold value, we do not merge the <it>i</it>th probe with the (<it>i</it>+1)th probe, but start generating a new pseudo-exon from this (<it>i</it>+1)th probe, using the same iteration procedure. After we finish the last probe at the 3' end, we may either have several pseudo-exons or only one pseudo-exon (i.e. the entire probe set) if every t-statistic within a probe set is not significant.</p>
         </sec>
         <sec>
            <st>
               <p>Testing for statistical significance (STEP 2)</p>
            </st>
            <p>For each pseudo-exon, we determine whether there is a difference in hybridization intensity between the two groups x<sub>1 </sub>and x<sub>2</sub>. Our null hypothesis is that, for any pseudo-exon, the difference in probe intensity between the two groups is zero. If we reject the null hypothesis for a pseudo-exon, meaning that the hybridization intensities between the two groups are significant different for that pseudo-exon, we then infer that there is a splice variant between the two groups of samples for the corresponding region within the gene.</p>
            <p>In the same vein as Li and Wong's model to analyze gene expression at the probe level <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>, we propose a multiplicative heterogeneity factor model to associate the probe intensities of a pseudo-exon directly with the covariate, i.e. group indictor x<sub><it>k</it></sub>:</p>
            <p>
               <graphic file="1742-4682-3-19-i6.gif"/>
            </p>
            <p>where <it>Y</it><sub><it>jik </it></sub>is the hybridization intensity for the <it>i</it>th probe in the <it>j</it>th pseudo-exon in the <it>k</it>th sample, N is the number of probes in the <it>j</it>th pseudo-exon, <it>&#948;</it><sub><it>k </it></sub>and <it>&#955;</it><sub><it>k </it></sub>are heterogeneity factors for normalization,<it>x</it><sub><it>k </it></sub>is the group indicator for the <it>k</it>th sample, <it>&#946;</it><sub><it>j </it></sub>is the coefficient for <it>j</it>th pseudo-exon, <it>&#966;</it><sub><it>ji </it></sub>is the multiplicative probe-specific parameter for <it>i</it>th probe in <it>j</it>th pseudo-exon, and <it>&#958; </it>is random variation term. To avoid making any distributional assumptions, we applied estimating equation techniques to estimate the coefficients and their standard errors for making statistical inferences <abbrgrp><abbr bid="B16">16</abbr><abbr bid="B17">17</abbr><abbr bid="B18">18</abbr><abbr bid="B19">19</abbr></abbrgrp>.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Abbreviations</p>
         </st>
         <p>IGB: Integrated Genome Browser; UCSC: University of California, Santa Cruz</p>
      </sec>
      <sec>
         <st>
            <p>Competing interests</p>
         </st>
         <p>The author(s) declare that they have no competing interests.</p>
      </sec>
      <sec>
         <st>
            <p>Authors' contributions</p>
         </st>
         <p>WF performed the data analysis, drafted the manuscript and developed method jointly with LPZ. NK revised the manuscript. ARH and JMO conceived the study. LPZ conceived the study and developed the method jointly with WF. All authors read and approved the final manuscript.</p>
         <suppl id="S1">
            <title>
               <p>Additional File 1</p>
            </title>
            <text>
               <p>Alternative spliced pseudo-exons selected by our method: Comparison of normal cerebellum with medulloblastomas. Complete results for the 811 pseudo-exons predicted to be alternatively spliced between normal cerebellum and medulloblastomas.</p>
            </text>
            <file name="1742-4682-3-19-S1.xls">
               <p>Click here for file</p>
            </file>
         </suppl>
         <fig id="F2">
            <title>
               <p>Figure 2</p>
            </title>
            <caption>
               <p>Histogram of the Z-scores for all 10,838 pseudo-exons obtained in the comparison of normal cerebellum samples with medulloblastomas</p>
            </caption>
            <text>
               <p>Histogram of the Z-scores for all 10,838 pseudo-exons obtained in the comparison of normal cerebellum samples with medulloblastomas.</p>
            </text>
            <graphic file="1742-4682-3-19-2"/>
         </fig>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>The authors thank Harvard and MIT researchers for allowing us to use their microarray data for this paper. This work was supported by grants from the National Institutes of Health.</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>Stage-specific changes in SR splicing factors and alternative splicing in mammary tumorigenesis</p>
            </title>
            <aug>
               <au>
                  <snm>Stickeler</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Kittrell</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Medina</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Berget</snm>
                  <fnm>SM</fnm>
               </au>
            </aug>
            <source>Oncogene</source>
            <pubdate>1999</pubdate>
            <volume>18</volume>
            <fpage>3574</fpage>
            <lpage>82</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmpid" link="fulltext">10380879</pubid>
                  <pubid idtype="doi">10.1038/sj.onc.1202671</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <title>
               <p>Cloning of a gene (SR-A1), encoding for a new member of the human Ser/Arg-rich family of pre-mRNA splicing factors: overexpression in aggressive ovarian cancer</p>
            </title>
            <aug>
               <au>
                  <snm>Scorilas</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Kyriakopoulou</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Katsaros</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Diamandis</snm>
                  <fnm>EP</fnm>
               </au>
            </aug>
            <source>Br J Cancer</source>
            <pubdate>2001</pubdate>
            <volume>85</volume>
            <fpage>190</fpage>
            <lpage>8</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1054/bjoc.2001.1885</pubid>
                  <pubid idtype="pmpid" link="fulltext">11461075</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>The mutational spectrum of single base-pair substitutions in mRNA splice junctions of human genes: causes and consequences</p>
            </title>
            <aug>
               <au>
                  <snm>Krawczak</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Reiss</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Cooper</snm>
                  <fnm>DN</fnm>
               </au>
            </aug>
            <source>Hum Genet</source>
            <pubdate>1992</pubdate>
            <volume>90</volume>
            <fpage>41</fpage>
            <lpage>54</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/BF00210743</pubid>
                  <pubid idtype="pmpid">1427786</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>Pre-mRNA splicing and human disease</p>
            </title>
            <aug>
               <au>
                  <snm>Faustino</snm>
                  <fnm>NA</fnm>
               </au>
               <au>
                  <snm>Cooper</snm>
                  <fnm>TA</fnm>
               </au>
            </aug>
            <source>Genes Dev</source>
            <pubdate>2003</pubdate>
            <volume>17</volume>
            <fpage>419</fpage>
            <lpage>37</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1101/gad.1048803</pubid>
                  <pubid idtype="pmpid" link="fulltext">12600935</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>Predicting splice variant from DNA chip expression data</p>
            </title>
            <aug>
               <au>
                  <snm>Hu</snm>
                  <fnm>GK</fnm>
               </au>
               <au>
                  <snm>Madore</snm>
                  <fnm>SJ</fnm>
               </au>
               <au>
                  <snm>Moldover</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Jatkoe</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Balaban</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Thomas</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>Y</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2001</pubdate>
            <volume>11</volume>
            <fpage>1237</fpage>
            <lpage>45</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">311070</pubid>
                  <pubid idtype="pmpid" link="fulltext">11435406</pubid>
                  <pubid idtype="doi">10.1101/gr.165501</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>PathwayAssist</p>
            </title>
            <url>http://www.ariadnegenomics.com/products/pathway.html</url>
         </bibl>
         <bibl id="B7">
            <title>
               <p>Affymetrix</p>
            </title>
            <url>http://www.affymetrix.com</url>
         </bibl>
         <bibl id="B8">
            <title>
               <p>UCSC Human Genome Browser</p>
            </title>
            <url>http://genome.ucsc.edu</url>
         </bibl>
         <bibl id="B9">
            <title>
               <p>Neuronal NOS: gene structure, mRNA diversity, and functional relevance</p>
            </title>
            <aug>
               <au>
                  <snm>Wang</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Newton</snm>
                  <fnm>DC</fnm>
               </au>
               <au>
                  <snm>Marsden</snm>
                  <fnm>PA</fnm>
               </au>
            </aug>
            <source>Crit Rev Neurobiol</source>
            <pubdate>1999</pubdate>
            <volume>13</volume>
            <fpage>21</fpage>
            <lpage>43</lpage>
            <xrefbib>
               <pubid idtype="pmpid">10223522</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B10">
            <title>
               <p>Exon/intron organization, chromosome localization, alternative splicing, and transcription units of the human apolipoprotein E receptor 2 gene</p>
            </title>
            <aug>
               <au>
                  <snm>Kim</snm>
                  <fnm>DH</fnm>
               </au>
               <au>
                  <snm>Magoori</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Inoue</snm>
                  <fnm>TR</fnm>
               </au>
               <au>
                  <snm>Mao</snm>
                  <fnm>CC</fnm>
               </au>
               <au>
                  <snm>Kim</snm>
                  <fnm>HJ</fnm>
               </au>
               <au>
                  <snm>Suzuki</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Fujita</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Endo</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Saeki</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Yamamoto</snm>
                  <fnm>TT</fnm>
               </au>
            </aug>
            <source>J Biol Chem</source>
            <pubdate>1997</pubdate>
            <volume>272</volume>
            <issue>13</issue>
            <fpage>8498</fpage>
            <lpage>504</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1074/jbc.272.13.8498</pubid>
                  <pubid idtype="pmpid" link="fulltext">9079678</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>Genomic structure, chromosomal localization, and conserved alternative splice forms of thrombopoietin</p>
            </title>
            <aug>
               <au>
                  <snm>Gurney</snm>
                  <fnm>AL</fnm>
               </au>
               <au>
                  <snm>Kuang</snm>
                  <fnm>WJ</fnm>
               </au>
               <au>
                  <snm>Xie</snm>
                  <fnm>MH</fnm>
               </au>
               <au>
                  <snm>Malloy</snm>
                  <fnm>BE</fnm>
               </au>
               <au>
                  <snm>Eaton</snm>
                  <fnm>DL</fnm>
               </au>
               <au>
                  <snm>de Sauvage</snm>
                  <fnm>FJ</fnm>
               </au>
            </aug>
            <source>Blood</source>
            <pubdate>1995</pubdate>
            <volume>85</volume>
            <fpage>981</fpage>
            <lpage>8</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">7849319</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>Genomic organization, alternative splicing, and expression patterns of the DSCR1 (Down syndrome candidate region 1) gene</p>
            </title>
            <aug>
               <au>
                  <snm>Fuentes</snm>
                  <fnm>JJ</fnm>
               </au>
               <au>
                  <snm>Pritchard</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Estivill</snm>
                  <fnm>X</fnm>
               </au>
            </aug>
            <source>Genomics</source>
            <pubdate>1997</pubdate>
            <volume>44</volume>
            <fpage>358</fpage>
            <lpage>61</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1006/geno.1997.4866</pubid>
                  <pubid idtype="pmpid" link="fulltext">9325060</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B13">
            <title>
               <p>Alternative splicing in PAX2 generates a new reading frame and an extended conserved coding region at the carboxy terminus</p>
            </title>
            <aug>
               <au>
                  <snm>Tavassoli</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Ruger</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Horst</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Hum Genet</source>
            <pubdate>1997</pubdate>
            <volume>279</volume>
            <fpage>371</fpage>
            <lpage>5</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1007/s004390050644</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B14">
            <title>
               <p>Molecular isolation and characterization of a soluble isoform of activated leukocyte cell adhesion molecule that modulates endothelial cell function</p>
            </title>
            <aug>
               <au>
                  <snm>Ikeda</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Quertermous</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>J Biol Chem</source>
            <pubdate>2004</pubdate>
            <volume>279</volume>
            <fpage>55315</fpage>
            <lpage>23</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1074/jbc.M407776200</pubid>
                  <pubid idtype="pmpid" link="fulltext">15496415</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B15">
            <title>
               <p>Model-based analysis of oligonucleotide arrays: expression index computation and outlier detection</p>
            </title>
            <aug>
               <au>
                  <snm>Li</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Wong</snm>
                  <fnm>WH</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2001</pubdate>
            <volume>98</volume>
            <fpage>31</fpage>
            <lpage>6</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">14539</pubid>
                  <pubid idtype="pmpid" link="fulltext">11134512</pubid>
                  <pubid idtype="doi">10.1073/pnas.011404098</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B16">
            <title>
               <p>Statistical modeling of large microarray data sets to identify stimulus-response profiles</p>
            </title>
            <aug>
               <au>
                  <snm>Zhao</snm>
                  <fnm>LP</fnm>
               </au>
               <au>
                  <snm>Prentice</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Breeden</snm>
                  <fnm>L</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2001</pubdate>
            <volume>98</volume>
            <fpage>5631</fpage>
            <lpage>6</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">33264</pubid>
                  <pubid idtype="pmpid" link="fulltext">11344303</pubid>
                  <pubid idtype="doi">10.1073/pnas.101013198</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B17">
            <title>
               <p>Longitudinal data analysis using generalized linear models</p>
            </title>
            <aug>
               <au>
                  <snm>Liang</snm>
                  <fnm>KY</fnm>
               </au>
               <au>
                  <snm>Zeger</snm>
                  <fnm>SL</fnm>
               </au>
            </aug>
            <source>Biometrika</source>
            <pubdate>1986</pubdate>
            <volume>73</volume>
            <fpage>13</fpage>
            <lpage>22</lpage>
            <xrefbib>
               <pubid idtype="doi">10.2307/2336267</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B18">
            <title>
               <p>Estimating equations for parameters in means and covariances of multivariate discrete and continuous responses</p>
            </title>
            <aug>
               <au>
                  <snm>Prentice</snm>
                  <fnm>RL</fnm>
               </au>
               <au>
                  <snm>Zhao</snm>
                  <fnm>LP</fnm>
               </au>
            </aug>
            <source>Biometrics</source>
            <pubdate>1991</pubdate>
            <volume>47</volume>
            <fpage>825</fpage>
            <lpage>39</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.2307/2532642</pubid>
                  <pubid idtype="pmpid">1742441</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B19">
            <title>
               <p>A Class of Models for Analyzing GeneChip <sup>&#174; </sup>Gene Expression Analysis Array Data</p>
            </title>
            <aug>
               <au>
                  <snm>Fan</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Pritchard</snm>
                  <fnm>JI</fnm>
               </au>
               <au>
                  <snm>Olson</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Khalid</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Zhao</snm>
                  <fnm>LP</fnm>
               </au>
            </aug>
            <source>BMC Genomics</source>
            <pubdate>2005</pubdate>
            <volume>6</volume>
            <fpage>16</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">553974</pubid>
                  <pubid idtype="pmpid" link="fulltext">15710039</pubid>
                  <pubid idtype="doi">10.1186/1471-2164-6-16</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
      </refgrp>
   </bm>
</art>
