<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1471-2105-10-131</ui>
   <ji>1471-2105</ji>
   <fm>
      <dochead>Methodology article</dochead>
      <bibl>
         <title>
            <p>SNEP: Simultaneous detection of nucleotide and expression polymorphisms using Affymetrix GeneChip</p>
         </title>
         <aug>
            <au id="A1" ca="yes">
               <snm>Fujisawa</snm>
               <fnm>Hironori</fnm>
               <insr iid="I1"/>
               <insr iid="I4"/>
               <email>fujisawa@ism.ac.jp</email>
            </au>
            <au id="A2">
               <snm>Horiuchi</snm>
               <fnm>Youko</fnm>
               <insr iid="I2"/>
               <insr iid="I4"/>
               <email>yhoriuch@lab.nig.ac.jp</email>
            </au>
            <au id="A3">
               <snm>Harushima</snm>
               <fnm>Yoshiaki</fnm>
               <insr iid="I2"/>
               <insr iid="I4"/>
               <email>yharushi@lab.nig.ac.jp</email>
            </au>
            <au id="A4">
               <snm>Takada</snm>
               <fnm>Toyoyuki</fnm>
               <insr iid="I3"/>
               <insr iid="I4"/>
               <email>ttakada@lab.nig.ac.jp</email>
            </au>
            <au id="A5">
               <snm>Eguchi</snm>
               <fnm>Shinto</fnm>
               <insr iid="I1"/>
               <insr iid="I4"/>
               <email>eguchi@ism.ac.jp</email>
            </au>
            <au id="A6">
               <snm>Mochizuki</snm>
               <fnm>Takako</fnm>
               <insr iid="I2"/>
               <email>tmochidu@lab.nig.ac.jp</email>
            </au>
            <au id="A7">
               <snm>Sakaguchi</snm>
               <fnm>Takayuki</fnm>
               <insr iid="I1"/>
               <insr iid="I4"/>
               <email>t-saka@ism.ac.jp</email>
            </au>
            <au id="A8">
               <snm>Shiroishi</snm>
               <fnm>Toshihiko</fnm>
               <insr iid="I3"/>
               <insr iid="I4"/>
               <email>tshirois@lab.nig.ac.jp</email>
            </au>
            <au id="A9">
               <snm>Kurata</snm>
               <fnm>Nori</fnm>
               <insr iid="I2"/>
               <insr iid="I4"/>
               <email>nkurata@lab.nig.ac.jp</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>The Institute of Statistical Mathematics, Tokyo 106-8569, Japan</p>
            </ins>
            <ins id="I2">
               <p>Plant Genetics Laboratory, National Institute of Genetics, Mishima, Shizuoka 411-8540, Japan</p>
            </ins>
            <ins id="I3">
               <p>Mammalian Genetics Laboratory, National Institute of Genetics, Mishima, Shizuoka 411-8540, Japan</p>
            </ins>
            <ins id="I4">
               <p>Transdisciplinary Research Integration Center, Research Organization of Information and Systems, Tokyo 105-0001, Japan</p>
            </ins>
         </insg>
         <source>BMC Bioinformatics</source>
         <issn>1471-2105</issn>
         <pubdate>2009</pubdate>
         <volume>10</volume>
         <issue>1</issue>
         <fpage>131</fpage>
         <url>http://www.biomedcentral.com/1471-2105/10/131</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">19419536</pubid>
               <pubid idtype="doi">10.1186/1471-2105-10-131</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>16</day>
               <month>12</month>
               <year>2008</year>
            </date>
         </rec>
         <acc>
            <date>
               <day>06</day>
               <month>5</month>
               <year>2009</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>06</day>
               <month>5</month>
               <year>2009</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2009</year>
         <collab>Fujisawa et al; licensee BioMed Central Ltd.</collab>
         <note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>High-density short oligonucleotide microarrays are useful tools for studying biodiversity, because they can be used to investigate both nucleotide and expression polymorphisms. However, when different strains (or species) produce different signal intensities after mRNA hybridization, it is not easy to determine whether the signal intensities were affected by nucleotide or expression polymorphisms. To overcome this difficulty, nucleotide and expression polymorphisms are currently examined separately.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>We have developed SNEP, a new method that allows simultaneous detection of both nucleotide and expression polymorphisms. SNEP involves a robust statistical procedure based on the idea that a nucleotide polymorphism observed at the probe level can be regarded as an outlier, because the nucleotide polymorphism can reduce the hybridization signal intensity. To investigate the performance of SNEP, we used three species: barley, rice and mice. In addition to the publicly available barley data, we obtained new rice and mouse data from the strains with available genome sequences. The sensitivity and false positive rate of nucleotide polymorphism detection were estimated based on the sequence information. The robustness of expression polymorphism detection against nucleotide polymorphisms was also investigated.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusion</p>
               </st>
               <p>SNEP performed well regardless of the genome size and showed a better performance for nucleotide polymorphism detection, when compared with other previously proposed methods. The R-software 'SNEP' is available at <url>http://www.ism.ac.jp/~fujisawa/SNEP/</url>.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>Affymetrix GeneChip expression arrays are high-density short oligonucleotide microarrays that were initially designed to monitor genome-wide expression profiles <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>. Affymetrix probe sets consist of several (typically 11) 25-mer short oligomer probes matching each gene [perfect match (PM) probes] and accompanying probes with single complementary substitutions in the 13th base of each PM probe [mismatch (MM) probes]. Signal intensities for the probes are obtained by hybridizing labeled genomic DNA (gDNA) or mRNA to the expression array. Recently, nucleotide polymorphisms have been detected with these probes by hybridizing gDNA from human malaria parasite <abbrgrp><abbr bid="B2">2</abbr></abbrgrp>, yeast <abbrgrp><abbr bid="B3">3</abbr></abbrgrp>, malaria mosquito <abbrgrp><abbr bid="B4">4</abbr></abbrgrp>, <it>Arabidopsis </it><abbrgrp><abbr bid="B5">5</abbr><abbr bid="B6">6</abbr></abbrgrp>, and rice <abbrgrp><abbr bid="B7">7</abbr></abbrgrp>, and by hybridizing mRNA from yeast <abbrgrp><abbr bid="B8">8</abbr></abbrgrp>, <it>Arabidopsis </it><abbrgrp><abbr bid="B9">9</abbr></abbrgrp>, barley <abbrgrp><abbr bid="B10">10</abbr><abbr bid="B11">11</abbr><abbr bid="B12">12</abbr></abbrgrp>, maize <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>, and mammals <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>. A nucleotide polymorphism observed at a probe level was called a single feature polymorphism (SFP) by Borevitz <it>et al. </it><abbrgrp><abbr bid="B5">5</abbr></abbrgrp>. An expression polymorphism was defined as a difference in gene expression levels between strains (or species), which can be used as a gene expression marker <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>. Because an expression array can be used for detecting both expression and nucleotide polymorphisms, the expression array has the potential to be a powerful tool for identifying functional variants that are associated with morphological, physiological, and/or ecological diversity within and between strains (or species).</p>
         <p>In contrast, when different strains (or species) produce different signal intensities after mRNA hybridization, it is not easy to determine whether the signal intensities are affected by nucleotide or expression polymorphisms. Thus, it has been noted that caution should be used when evaluating gene expression levels in cross-strain (or cross-species) hybridization using expression arrays <abbrgrp><abbr bid="B13">13</abbr><abbr bid="B15">15</abbr><abbr bid="B16">16</abbr></abbrgrp>. To overcome this difficulty, nucleotide and expression polymorphisms are currently examined separately. In this paper, we simultaneously examine these two types of polymorphism to effectively detect them.</p>
         <p>Ideally, we assume that the signal intensity ratios for two strains are almost the same on all the probes when no SFP probes are present in a probe set. This was similarly adopted in Ronald <it>et al. </it><abbrgrp><abbr bid="B8">8</abbr></abbrgrp>, Cui <it>et al. </it><abbrgrp><abbr bid="B11">11</abbr></abbrgrp> and Luo <it>et al. </it><abbrgrp><abbr bid="B12">12</abbr></abbrgrp>. Here, we suppose that this assumption may not hold for SFP probes because a nucleotide polymorphism can reduce hybridization signal intensity. Therefore, the signal intensities from the SFP probes may be regarded as outliers. Based on these premises, we have constructed a statistical model and then detected nucleotide and expression polymorphisms using a robust procedure against the outliers. The proposed method is referred to as 'Simultaneous detection of Nucleotide and Expression Polymorphisms (SNEP)'.</p>
         <p>SFPs between two strains are easily detected when gDNA hybridization is feasible, because the amounts of applied target DNA are thought to be almost the same between the two strains for each probe. This allows us to easily detect SFPs, e.g., by a simple <it>t</it>-test for each probe. However, gDNA hybridization can only be used with smaller genomes. For larger genomes, such as barley and mammals, mRNA hybridization should be used instead, because the significant cross hybridization is observed during whole genome hybridization <abbrgrp><abbr bid="B17">17</abbr><abbr bid="B18">18</abbr></abbrgrp>. When mRNA hybridization is employed, the amount of applied target cRNA for a given gene is not always the same between the two strains, which makes simple <it>t</it>-tests infeasible; therefore some methods for detecting SFP probes have been developed. Rostoks <it>et al. </it><abbrgrp><abbr bid="B10">10</abbr></abbrgrp> adopted a standard testing procedure based on a standard interaction model with significance analysis of microarrays, SAM <abbrgrp><abbr bid="B19">19</abbr></abbrgrp>, and detected SFP probes by a significant interaction of probe by genotype. Similar to our study, Cui <it>et al. </it><abbrgrp><abbr bid="B11">11</abbr></abbrgrp> regarded the signal intensities from the SFP probes as outliers and adopted a robust projection pursuit to detect the SFP probes. These two groups used their methods to examine barley. Ronald <it>et al. </it><abbrgrp><abbr bid="B8">8</abbr></abbrgrp> focused on the ratio of signal intensity to the gene expression level for each probe; if the ratio was different between two genomes, then the probe was judged to be an SFP. They applied their method to <it>S. cerevisiae</it>. Luo <it>et al. </it><abbrgrp><abbr bid="B12">12</abbr></abbrgrp> used a similar strategy of analyzing the signal intensity ratio. Greenhall <it>et al. </it><abbrgrp><abbr bid="B14">14</abbr></abbrgrp> compensated for gene expression differences by appropriately scaling the PM minus MM values and detected the SFP probes by a simple <it>t</it>-test.</p>
      </sec>
      <sec>
         <st>
            <p>Methods</p>
         </st>
         <sec>
            <st>
               <p>Model and hypothesis</p>
            </st>
            <p>Figure <figr fid="F1">1</figr> shows two typical sets of mRNA data. Based on these data, we have constructed a basic statistical model and next prepared some hypotheses. Hereafter, the log10 value of signal intensity is called the 'log-intensity' for simplicity.</p>
            <fig id="F1">
               <title>
                  <p>Figure 1</p>
               </title>
               <caption>
                  <p>mRNA data by GeneChip</p>
               </caption>
               <text>
                  <p><b>mRNA data by GeneChip</b>. Each figure contains data for a different gene. The <it>x</it>-axis indicates the 11 probes. The <it>y</it>-axis is the log10 value of signal intensity. The blue and red lines correspond to two strains. The line number indicates the replicate number.</p>
               </text>
               <graphic file="1471-2105-10-131-1"/>
            </fig>
            <p>Let <it>x</it><sub><it>ijk </it></sub>be the log-intensity in the <it>k</it>th replicate on the <it>j</it>th probe for the <it>i</it>th strain, where <it>i </it>= 1, 2 stand for two strains. Let <it>&#956;</it><sub><it>ij </it></sub>be the mean log-intensity on the <it>j</it>th probe for the <it>i</it>th strain. Here we assume that the difference between the log-intensity and the mean does not depend on the position of the probe. This tendency was seen for many probe sets as well as in Figure <figr fid="F1">1</figr> and similarly adopted in <abbrgrp><abbr bid="B8">8</abbr><abbr bid="B11">11</abbr><abbr bid="B12">12</abbr></abbrgrp>. To express this characteristic difference, we use the parameter <it>&#957;</it><sub><it>ik </it></sub>in the <it>k</it>th replicate for the <it>i</it>th strain. Consequently, the basic statistical model can be expressed as</p>
            <p>
               <display-formula>
                  <graphic file="1471-2105-10-131-i1.gif"/>
               </display-formula>
            </p>
            <p>where <inline-formula><graphic file="1471-2105-10-131-i2.gif"/></inline-formula>, <it>&#949;</it><sub><it>ijk</it></sub>'s are the noise terms, <it>J </it>is the number of probes and <it>K </it>is the number of replicates. We assume that <it>&#949;</it><sub><it>ijk </it></sub>has a normal distribution with mean zero and variance <it>&#963;</it><sup>2</sup>.</p>
            <p>If no SFP is present in a probe set, then the log-intensity differences between two strains are expected to be almost the same on all the probes in the probe set, in other words, the signal intensity ratios for two strains are almost the same on all the probes in the probe set. This tendency was seen for many probe sets as well as in Figure <figr fid="F1">1</figr> with the exception of some SFP probes. Let the difference between two means be denoted by <it>&#955;</it><sub><it>j </it></sub>= <it>&#956;</it><sub>1<it>j </it></sub>- <it>&#956;</it><sub>2<it>j</it></sub>. The above expectation implies the hypothesis that</p>
            <p>
               <display-formula>
                  <graphic file="1471-2105-10-131-i3.gif"/>
               </display-formula>
            </p>
            <p>In this paper, the difference between <it>&#955;</it><sub><it>j </it></sub>and the common <it>&#955; </it>is called the 'probe effect'.</p>
            <p>In Figure <figr fid="F1">1</figr>, some <it>&#955;</it><sub><it>j</it></sub>'s are clearly larger than the common <it>&#955;</it>. This would be caused by SFPs, because the nucleotide polymorphism can reduce hybridization signal intensity. The alternative hypothesis can be expressed as a one-sided one, given by</p>
            <p>
               <display-formula>
                  <graphic file="1471-2105-10-131-i4.gif"/>
               </display-formula>
            </p>
            <p>which means that <it>&#955;</it><sub><it>j </it></sub>is larger (or smaller) than the common <it>&#955; </it>and the <it>&#955;</it><sub><it>j</it>' </sub>'s except for the <it>j</it>th probe are the same as the common <it>&#955;</it>. If we reject the null hypothesis <it>H </it>and accept the alternative hypothesis <it>K</it><sub><it>j</it></sub>, then we will judge the <it>j</it>th probe to be an SFP for the second (or first) strain. For the 9th probe in Figure <figr fid="F1">1(a)</figr> and the 10th and 11th probes in Figure <figr fid="F1">1(b)</figr>, we expect to accept the alternative hypothesis <it>K</it><sub><it>j</it></sub>: <it>&#955;</it><sub><it>j </it></sub>> <it>&#955;</it>.</p>
            <p>Consider the case where the first strain is the platform one, in other words, the hybridization can be disturbed only for the second strain. Then we use only one alternative hypothesis, such as <it>K</it><sub><it>j</it></sub>: <it>&#955;</it><sub><it>j </it></sub>> <it>&#955;</it>, because <it>&#956;</it><sub>2<it>j </it></sub>can become much smaller than expected.</p>
            <p>Let us prepare the hypothesis <it>H</it><sub>0</sub>: <it>&#955; </it>= 0. If the null hypothesis <it>H</it><sub>0 </sub>is rejected, then the corresponding gene is judged to be differently expressed. For Figures <figr fid="F1">1(a)</figr> and <figr fid="F1">1(b)</figr>, we expect to accept and reject the null hypothesis <it>H</it><sub>0</sub>: <it>&#955; </it>= 0, respectively.</p>
         </sec>
         <sec>
            <st>
               <p>Random effects</p>
            </st>
            <p>We also incorporate random effects into the model to address various types of dispersion, including the noise dispersion. Such a device is often adopted to reduce the number of parameters when the number of replicates is small. This device also makes the robust procedure easily applicable, as described later. We assume that the difference in the means, <it>&#955;</it><sub><it>j </it></sub>= <it>&#956;</it><sub>1<it>j </it></sub>- <it>&#956;</it><sub>2<it>j</it></sub>, has a normal distribution with mean <it>&#955; </it>and variance <it>&#964;</it><sup>2 </sup>because <it>&#955;</it><sub><it>j</it></sub>'s can be regarded to be dispersed around the common difference <it>&#955;</it>.</p>
         </sec>
         <sec>
            <st>
               <p>Treatment of outlier</p>
            </st>
            <p>The following is a simple review about the adverse effects of an outlier. Let <it>y</it><sub>1</sub>,..., <it>y</it><sub>10 </sub>be the observations. Let <it>y</it><sub>1 </sub>= &#8943; = <it>y</it><sub>9 </sub>= 0 and <it>y</it><sub>10 </sub>= 50. Consider the estimation of the mean parameter <it>&#956; </it>= E [<it>y</it>]. The sample mean, a standard estimate of the mean parameter, is 5. This estimate may be inappropriate because we generally regard <it>y</it><sub>10 </sub>= 50 as an outlier and expect <it>&#956; </it>= 0 from the other observations. To carefully treat outliers, we often adopt the robust parameter estimation.</p>
            <p>The signal intensities from the SFP probes can be regarded as outliers when they show different behaviors from other signal intensities, as seen in Figure <figr fid="F1">1</figr>. Thus we need to carefully examine the RNA data and then adopt a robust procedure against the outliers, as described later.</p>
            <p>The parameter estimation of the probe effect (<it>&#955;</it><sub><it>j </it></sub>- <it>&#955;</it>) is illustrated in Figure <figr fid="F2">2</figr>. The probe effect is expected to be close to zero if the probe is not an SFP. Except for the 10th and 11th SFP probes in Figure <figr fid="F1">1(b)</figr>, the robust estimates were balanced on both sides of zero, whereas most of the standard estimates (maximum likelihood estimates) were greater than zero.</p>
            <fig id="F2">
               <title>
                  <p>Figure 2</p>
               </title>
               <caption>
                  <p>Estimate of the probe effect (<it>&#955;</it><sub><it>j </it></sub>- <it>&#955;</it>)</p>
               </caption>
               <text>
                  <p><b>Estimate of the probe effect (<it>&#955;</it><sub><it>j </it></sub>- <it>&#955;</it>)</b>. The <it>x</it>-axis indicates the 11 probes. The <it>y</it>-axis denotes the estimate of the probe effect from Figure 1(b). The dashed and solid lines correspond to the robust estimate and the standard estimate (maximum likelihood estimate), respectively.</p>
               </text>
               <graphic file="1471-2105-10-131-2"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>Parameter estimation</p>
            </st>
            <p>Let <inline-formula><graphic file="1471-2105-10-131-i5.gif"/></inline-formula> and <inline-formula><graphic file="1471-2105-10-131-i6.gif"/></inline-formula>. Let <inline-formula><graphic file="1471-2105-10-131-i7.gif"/></inline-formula>. Here we assume that the hypothesis <it>H </it>holds. Then, <inline-formula><graphic file="1471-2105-10-131-i8.gif"/></inline-formula>, where <it>&#954;</it><sup>2 </sup>= <it>&#964;</it><sup>2 </sup>+ 2<it>&#963;</it><sup>2</sup>/<it>K</it>. Let us consider the parameter estimation based on <it>z</it><sub><it>j</it></sub>'s.</p>
            <p>Let &#632;(<it>z</it>; <it><b>&#952;</b></it>) be the normal density function with the parameter <it><b>&#952; </b></it>= (<it>&#955;</it>, <it>&#954;</it><sup>2</sup>). The robust parameter estimate <inline-formula><graphic file="1471-2105-10-131-i9.gif"/></inline-formula> can be obtained by the minimizer of</p>
            <p>
               <display-formula>
                  <graphic file="1471-2105-10-131-i10.gif"/>
               </display-formula>
            </p>
            <p>This type of robust parameter estimation was investigated by Windham <abbrgrp><abbr bid="B20">20</abbr></abbrgrp>, Basu <it>et al. </it><abbrgrp><abbr bid="B21">21</abbr></abbrgrp>, Jones <it>et al. </it><abbrgrp><abbr bid="B22">22</abbr></abbrgrp>, and Fujisawa and Eguchi <abbrgrp><abbr bid="B23">23</abbr></abbrgrp>.</p>
            <p>The positive tuning parameter <it>&#947; </it>controls the trade-off between efficiency and robustness. As <it>&#947; </it>goes to zero, the <inline-formula><graphic file="1471-2105-10-131-i11.gif"/></inline-formula> limits to the maximum likelihood estimator, which is efficient but not robust against outliers. When <it>&#947; </it>= 1, the <inline-formula><graphic file="1471-2105-10-131-i11.gif"/></inline-formula> is similar to the <it>L</it><sub>2 </sub>estimator, which is known as a strong robust estimator <abbrgrp><abbr bid="B24">24</abbr></abbrgrp>. As the tuning parameter <it>&#947; </it>is smaller or larger, the robustness will become weaker or stronger, respectively, whereas the efficiency will increase or decrease, respectively. We used <it>&#947; </it>= 0.5 for the analysis of the mRNA data from various experiences. We will also discuss the choice of <it>&#947; </it>later.</p>
            <p>The normal density function <it>&#632;</it>(<it>z</it>; <it><b>&#952;</b></it>) belongs to an exponential family. For this reason, we can construct a convenient and iterative algorithm to obtain the robust parameter estimate (Appendix A1 of the additional file <supplr sid="S1">1</supplr>). By virtue of the standard theory of M-estimation, the distribution of the robust parameter estimator can be approximated to a normal distribution (Appendix A2 of additional file <supplr sid="S1">1</supplr>). The above robust parameter estimation shows strong robustness even when the ratio of the outlier is not small. This is suitable for the analysis of the mRNA data because the probe set may contain a number of SFP probes. For the detailed properties of the above robust parameter estimation, see Fujisawa and Eguchi <abbrgrp><abbr bid="B23">23</abbr></abbrgrp>.</p>
            <suppl id="S1">
               <title>
                  <p>Additional file 1</p>
               </title>
               <text>
                  <p><b>Additional Information</b>. It provides supplementary information about the detailed explanation of data and complicated mathematical derivations.</p>
               </text>
               <file name="1471-2105-10-131-S1.pdf">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <p>It should be noted that the variance parameter <it>&#954;</it><sup>2 </sup>= <it>&#964;</it><sup>2 </sup>+ 2<it>&#963;</it><sup>2</sup>/<it>K </it>includes two types of variance parameter, <it>&#964;</it><sup>2 </sup>and <it>&#963;</it><sup>2</sup>. The estimate <inline-formula><graphic file="1471-2105-10-131-i12.gif"/></inline-formula> is sometimes underestimated for the analysis of the mRNA data when <it>&#963;</it><sup>2 </sup>is relatively large and the number of replicates is small. To overcome this difficulty, we modified the estimate <inline-formula><graphic file="1471-2105-10-131-i12.gif"/></inline-formula> as follows. We first estimated the variance parameter <it>&#963;</it><sup>2 </sup>by a standard unbiased estimate <inline-formula><graphic file="1471-2105-10-131-i13.gif"/></inline-formula> and then replaced the estimate <inline-formula><graphic file="1471-2105-10-131-i12.gif"/></inline-formula> by <inline-formula><graphic file="1471-2105-10-131-i14.gif"/></inline-formula> if <inline-formula><graphic file="1471-2105-10-131-i15.gif"/></inline-formula>, because <it>&#954;</it><sup>2 </sup>&#8805; 2<it>&#963;</it><sup>2</sup>/<it>K</it>.</p>
         </sec>
         <sec>
            <st>
               <p>Testing procedure</p>
            </st>
            <p>Consider the testing problem for the null hypothesis <it>H</it>: <it>&#955;</it><sub>1 </sub>= &#8943; = <it>&#955;</it><sub><it>J </it></sub>= <it>&#955; </it>against the alternative hypothesis <it>K</it><sub><it>j</it></sub>: <it>&#955;</it><sub><it>j </it></sub>> <it>&#955;</it>. If we can make an appropriate estimate <inline-formula><graphic file="1471-2105-10-131-i16.gif"/></inline-formula> of <it>&#955;</it><sub><it>j</it></sub>, then we can propose the Wald-type test statistic:</p>
            <p>
               <display-formula>
                  <graphic file="1471-2105-10-131-i17.gif"/>
               </display-formula>
            </p>
            <p>where <inline-formula><graphic file="1471-2105-10-131-i18.gif"/></inline-formula> is an appropriate estimate of the standard deviation of <inline-formula><graphic file="1471-2105-10-131-i19.gif"/></inline-formula> (Appendix A3 of additional file <supplr sid="S1">1</supplr>). We simply estimated the parameter <it>&#955;</it><sub><it>j </it></sub>by <inline-formula><graphic file="1471-2105-10-131-i20.gif"/></inline-formula>. The distribution of the test statistic <it>T</it><sub><it>j </it></sub>can be approximated to the standard normal distribution. Let <it>z</it><sub><it>&#945; </it></sub>be the upper 100&#945; % point of the standard normal distribution. If <it>T</it><sub><it>j </it></sub>> <it>z</it><sub><it>&#945;</it></sub>, then we will accept the alternative hypothesis <it>K</it><sub><it>j </it></sub>at significance level &#945; and judge the <it>j</it>th probe as an SFP. By a similar way, we can also treat the alternative hypothesis <it>K</it><sub><it>j</it></sub>: <it>&#955;</it><sub><it>j </it></sub>&lt;<it>&#955; </it>and two-sided alternatives.</p>
            <p>Consider the testing problem for the null hypothesis <it>H</it><sub>0</sub>: <it>&#955; </it>= 0. We can propose the Wald-type test statistic:</p>
            <p>
               <display-formula>
                  <graphic file="1471-2105-10-131-i21.gif"/>
               </display-formula>
            </p>
            <p>where <inline-formula><graphic file="1471-2105-10-131-i22.gif"/></inline-formula> is an appropriate estimate of the standard deviation of <inline-formula><graphic file="1471-2105-10-131-i23.gif"/></inline-formula> (Appendix A4 of the additional file <supplr sid="S1">1</supplr>). If |<it>T</it><sub>0</sub>| > <it>z</it><sub><it>&#945;</it>/2</sub>, then we will reject the null hypothesis <it>H</it><sub>0 </sub>at significance level <it>&#945; </it>and judge the corresponding gene to be differently expressed.</p>
         </sec>
         <sec>
            <st>
               <p>mRNA data</p>
            </st>
            <p>The Affymetrix GeneChip Rice Genome Array consisted of 57,381 probe sets containing 631,066 probes. Signal intensities of two fully sequenced rice cultivars,  <it>japonica </it>rice "Nipponbare" <abbrgrp><abbr bid="B25">25</abbr></abbrgrp> and <it>indica </it>one "93-11" <abbrgrp><abbr bid="B26">26</abbr></abbrgrp>, were observed by hybridizing their mRNA to the rice array. mRNA data were obtained for five biological replicates from 2 cm young panicles of both Nipponbare and 93-11.</p>
            <p>The Affymetrix GeneChip Mouse Genome 430 2.0 Array consisted of 45,101 probe sets containing 496,468 probes. Signal intensities of two inbred strains, C57BL/6J (referred to below as B6) and MSM/Ms (<it>Mus musculus molossinus</it>), were observed by hybridizing their mRNA to the mouse array. mRNA data were obtained for two biological replicates from the liver of both B6 and MSM/Ms.</p>
            <p>For a more detailed experimental environment and sequence analysis, see the additional file <supplr sid="S1">1</supplr>. All microarray data from this study are available from the Center for Information Biology gene EXpression (CIBEX) database <url>http://cibex.nig.ac.jp/index.jsp</url> under accession numbers CBX50 and CBX54.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Results</p>
         </st>
         <sec>
            <st>
               <p>SFP detection in barley data</p>
            </st>
            <p>The barley data were analyzed by Rostoks <it>et al. </it><abbrgrp><abbr bid="B10">10</abbr></abbrgrp>. To detect SFP probes, they adopted a standard testing procedure based on a standard interaction model with SAM <abbrgrp><abbr bid="B19">19</abbr></abbrgrp>. There were 2,601 probes whose target sequences were confirmed in the two analyzed varieties: Morex and Golden Promise. They consisted of 2,200 non-polymorphic probes and 401 polymorphic probes among which 178 and 223 probes were polymorphic to Morex and Golden Promise sequences, respectively. There were six types of tissue and all except one were analyzed using three replicates (<url>http://naturalvariation.org/barley</url>).</p>
            <p>We considered both alternative hypotheses, <it>K</it><sub><it>j</it></sub>: <it>&#955;</it><sub><it>j </it></sub>> <it>&#955; </it>and <it>K</it><sub><it>j</it></sub>: <it>&#955;</it><sub><it>j </it></sub>&lt;<it>&#955;</it>, because a probe could be an SFP for both strains. The sensitivity was calculated by the ratio of the number of probes correctly judged as SFPs to the number of SFP probes. The false positive rate (FPR) was calculated by the ratio of the number of probes incorrectly judged as SFPs to the number of probes judged as SFPs. The sensitivities and FPRs of various methods are given in Table <tblr tid="T1">1</tblr>.</p>
            <tbl id="T1">
               <title>
                  <p>Table 1</p>
               </title>
               <caption>
                  <p>Sensitivity and FPR of SFP detection in barley data.</p>
               </caption>
               <tblbdy cols="5">
                  <r>
                     <c ca="center">
                        <p>Tissue</p>
                     </c>
                     <c cspan="4" ca="center">
                        <p>COL</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>Method</p>
                     </c>
                     <c ca="center">
                        <p>SNEP</p>
                     </c>
                     <c ca="center">
                        <p>LRT</p>
                     </c>
                     <c ca="center">
                        <p>Ros<sup><it>a</it></sup></p>
                     </c>
                     <c ca="center">
                        <p>Gre<sup><it>b</it></sup></p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>Sensitivity</p>
                     </c>
                     <c ca="center">
                        <p>0.579</p>
                     </c>
                     <c ca="center">
                        <p>0.569</p>
                     </c>
                     <c ca="center">
                        <p>0.52</p>
                     </c>
                     <c ca="center">
                        <p>0.506</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>FPR</p>
                     </c>
                     <c ca="center">
                        <p>0.259</p>
                     </c>
                     <c ca="center">
                        <p>0.299</p>
                     </c>
                     <c ca="center">
                        <p>0.35</p>
                     </c>
                     <c ca="center">
                        <p>0.420</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="5">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>Tissue</p>
                     </c>
                     <c cspan="4" ca="center">
                        <p>CRO</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>Method</p>
                     </c>
                     <c ca="center">
                        <p>SNEP</p>
                     </c>
                     <c ca="center">
                        <p>LRT</p>
                     </c>
                     <c ca="center">
                        <p>Ros<sup><it>a</it></sup></p>
                     </c>
                     <c ca="center">
                        <p>Gre<sup><it>b</it></sup></p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>Sensitivity</p>
                     </c>
                     <c ca="center">
                        <p>0.673</p>
                     </c>
                     <c ca="center">
                        <p>0.636</p>
                     </c>
                     <c ca="center">
                        <p>0.58</p>
                     </c>
                     <c ca="center">
                        <p>0.608</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>FPR</p>
                     </c>
                     <c ca="center">
                        <p>0.173</p>
                     </c>
                     <c ca="center">
                        <p>0.338</p>
                     </c>
                     <c ca="center">
                        <p>0.34</p>
                     </c>
                     <c ca="center">
                        <p>0.440</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="5">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>Tissue</p>
                     </c>
                     <c cspan="4" ca="center">
                        <p>GEM</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>Method</p>
                     </c>
                     <c ca="center">
                        <p>SNEP</p>
                     </c>
                     <c ca="center">
                        <p>LRT</p>
                     </c>
                     <c ca="center">
                        <p>Ros<sup><it>a</it></sup></p>
                     </c>
                     <c ca="center">
                        <p>Gre<sup><it>b</it></sup></p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>Sensitivity</p>
                     </c>
                     <c ca="center">
                        <p>0.691</p>
                     </c>
                     <c ca="center">
                        <p>0.618</p>
                     </c>
                     <c ca="center">
                        <p>0.63</p>
                     </c>
                     <c ca="center">
                        <p>0.534</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>FPR</p>
                     </c>
                     <c ca="center">
                        <p>0.153</p>
                     </c>
                     <c ca="center">
                        <p>0.218</p>
                     </c>
                     <c ca="center">
                        <p>0.34</p>
                     </c>
                     <c ca="center">
                        <p>0.314</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="5">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>Tissue</p>
                     </c>
                     <c cspan="4" ca="center">
                        <p>LEA</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>Method</p>
                     </c>
                     <c ca="center">
                        <p>SNEP</p>
                     </c>
                     <c ca="center">
                        <p>LRT</p>
                     </c>
                     <c ca="center">
                        <p>Ros<sup><it>a</it></sup></p>
                     </c>
                     <c ca="center">
                        <p>Gre<sup><it>b</it></sup></p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>Sensitivity</p>
                     </c>
                     <c ca="center">
                        <p>0.574</p>
                     </c>
                     <c ca="center">
                        <p>0.531</p>
                     </c>
                     <c ca="center">
                        <p>0.51</p>
                     </c>
                     <c ca="center">
                        <p>0.524</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>FPR</p>
                     </c>
                     <c ca="center">
                        <p>0.151</p>
                     </c>
                     <c ca="center">
                        <p>0.273</p>
                     </c>
                     <c ca="center">
                        <p>0.34</p>
                     </c>
                     <c ca="center">
                        <p>0.440</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="5">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>Tissue</p>
                     </c>
                     <c cspan="4" ca="center">
                        <p>RAD</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>Method</p>
                     </c>
                     <c ca="center">
                        <p>SNEP</p>
                     </c>
                     <c ca="center">
                        <p>LRT</p>
                     </c>
                     <c ca="center">
                        <p>Ros<sup><it>a</it></sup></p>
                     </c>
                     <c ca="center">
                        <p>Gre<sup><it>b</it></sup></p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>Sensitivity</p>
                     </c>
                     <c ca="center">
                        <p>0.656</p>
                     </c>
                     <c ca="center">
                        <p>0.623</p>
                     </c>
                     <c ca="center">
                        <p>0.62</p>
                     </c>
                     <c ca="center">
                        <p>0.504</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>FPR</p>
                     </c>
                     <c ca="center">
                        <p>0.137</p>
                     </c>
                     <c ca="center">
                        <p>0.264</p>
                     </c>
                     <c ca="center">
                        <p>0.34</p>
                     </c>
                     <c ca="center">
                        <p>0.276</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>The barley data were obtained from the supplementary data at <url>http://naturalvariation.org/barley</url><abbrgrp><abbr bid="B10">10</abbr></abbrgrp>. COL: coleoptile. CRO: seedling crown. GEM: embryo from germinating seed. LEA: seedling leaf. RAD: radicle (seminal root). The number of replicates was three (<it>K </it>= 3) for each tissue.</p>
                  <p>Ros<sup><it>a</it></sup>: Method of Rostoks <it>et al. </it><abbrgrp><abbr bid="B10">10</abbr></abbrgrp>. The corresponding sensitivity and FPR were extracted from Rostoks <it>et al. </it><abbrgrp><abbr bid="B10">10</abbr></abbrgrp>.</p>
                  <p>Gre<sup><it>b</it></sup>: Method of Greenhall <it>et al. </it><abbrgrp><abbr bid="B14">14</abbr></abbrgrp>. Approximately 10% probes of 2,601 candidate probes were not used when calculating the sensitivity and FPR, because these probes did not meet the authors' selection criteria.</p>
               </tblfn>
            </tbl>
            <p>SNEP was applied to the barley data with three replicates (<it>K </it>= 3) without normalization. The significance levels were set at 10<sup>-3</sup>/2 and 10<sup>-2</sup>/2 for the first four and RAD samples, respectively, which allowed us to easily compare SNEP with the method employed by Rostoks <it>et al. </it><abbrgrp><abbr bid="B10">10</abbr></abbrgrp>. SNEP markedly outperformed their method. For example, for the CRO samples, the sensitivity and FPR of SNEP were approximately 9% and 17% superior to those obtained using their method, respectively.</p>
            <p>We also applied the likelihood ratio test (LRT) based on a standard interaction model without normalization. LRT is a standard testing procedure that is similar to the basic test statistic used by Rostoks <it>et al. </it><abbrgrp><abbr bid="B10">10</abbr></abbrgrp>. The significance levels for LRT were set at 10<sup>-5</sup>/2 and 10<sup>-3</sup>/2 for the first four and RAD samples, respectively, which allowed us to easily compare LRT with the other methods. SNEP markedly outperformed LRT, whereas LRT almost outperformed the methods of Rostoks <it>et al. </it><abbrgrp><abbr bid="B10">10</abbr></abbrgrp>.</p>
            <p>We also analyzed the same barley data with the method employed by Greenhall <it>et al. </it><abbrgrp><abbr bid="B14">14</abbr></abbrgrp>. Both of the one-sided alternative hypotheses were considered and the significance level was set at 10<sup>-4</sup>/2 by using the standard normal approximation. Both SNEP and LRT markedly outperformed their method.</p>
            <p>The method of Rostoks <it>et al. </it><abbrgrp><abbr bid="B10">10</abbr></abbrgrp> and LRT were based on a similar standard testing procedure, but the former was inferior to the latter. The major difference between two methods was that the former adopted the normalization and SAM. It might seem that the normalization markedly affected the performance of the method. A prerequisite for most normalization is that the mRNA affinities to the microarray are the same for all of the replicates. Normalization enables the total amount of hybridized mRNA to be equalized for all of the replicates. However, in different strains (or species), nucleotide polymorphisms affect the mRNA affinities to the microarray. For cross-strain (or cross-species) microarrays, normalization is not sufficient to equalize the affinities. Thus, we did not include normalization in SNEP.</p>
            <p>Receiver operating characteristic (ROC) curves are shown in Figure <figr fid="F3">3</figr>. One method is said to outperform another when the associated ROC curve lies above the ROC curve of the comparator method. SNEP markedly outperformed LRT. In particular, when the FPR was approximately 0.2, the difference between the results obtained by SNEP and LRT tended to become larger as the FPR became smaller. We also tried to use <it>&#947; </it>= 0.2 and <it>&#947; </it>= 0.8 instead of <it>&#947; </it>= 0.5, which was the default value for SNEP. The results obtained with <it>&#947; </it>= 0.2 was worse than those obtained with the other values. It was also pointed out in Fujisawa and Eguchi <abbrgrp><abbr bid="B23">23</abbr></abbrgrp> that the case <it>&#947; </it>= 0.2 would not suffice when the ratio of outlier was not small. We could not clearly determine which was better, <it>&#947; </it>= 0.5 or <it>&#947; </it>= 0.8. Because the objective function for robust parameter estimation tends to be flat for a large value of <it>&#947;</it>, the iterative algorithm for robust parameter estimation did not work well for synthetic data sets with a large value of <it>&#947; </it>(data not shown). Therefore, we used <it>&#947; </it>= 0.5 as the default value for SNEP.</p>
            <fig id="F3">
               <title>
                  <p>Figure 3</p>
               </title>
               <caption>
                  <p>ROC curve for SFP detection</p>
               </caption>
               <text>
                  <p><b>ROC curve for SFP detection</b>. The <it>x</it>- and <it>y</it>-axes are the FPR and sensitivity, respectively. The black, red, green and blue lines are based on LRT, SNEP(<it>&#947; </it>= 0.2), SNEP(<it>&#947; </it>= 0.5) and SNEP(<it>&#947; </it>= 0.8), respectively, using appropriate significance levels, which range from 10<sup>-16 </sup>to 1.</p>
               </text>
               <graphic file="1471-2105-10-131-3"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>SFP detection in rice data</p>
            </st>
            <p>SNEP was also applied to the rice data. Two points made this analysis different from that performed with the barley data. For barley, the genome sequence was only partly known, whereas we obtained the whole genome sequences of both Nipponbare and 93-11. We thus could study the performance of the methods in more detail. The rice arrays were designed mostly with <it>japonica </it>transcripts and therefore Nipponbare was regarded as the platform strain.</p>
            <p>We prepared 'canonical rice data' and then we applied SNEP and LRT to the canonical rice data. The canonical rice data consisted of signal intensities for the probe sets in which all 11 probe sequences were perfectly matched as a single copy in the Nipponbare genome and were matched as a single copy in the 93-11 genome. Note that the term 'single copy' means there is only one similar sequence in a genome. For the canonical rice data, the SFP probes only interacted with the 93-11 sequences. For this reason, we used only one alternative hypothesis to detect SFP probes for 93-11.</p>
            <p>We first examined the effects of the degree of signal intensity by the median of the log-intensities in a probe set, called the 'median-intensity', because we thought that low signal intensity level might not represent enough hybridization. SNEP was applied to the canonical rice data at significance level 10<sup>-3</sup>. We constructed four classes (&lt;2, 2&#8211;2.5, 2.5&#8211;3, and 3&#8804;) of the median-intensity and categorized the probe sets into each class. The sensitivity and FPR of SNEP were calculated for each class when the number of sequence-verified SFP probes in a probe set, called the 'SFP number', was one (Table <tblr tid="T2">2</tblr>). It was clear that the sensitivity and FPR were much worse when the median-intensity was low. Moreover, when the median-intensities for both strains were more than 2.5, the sensitivity and FPR were stable at high and low levels, respectively. Thus, we say that the gene is sufficiently expressed when the median-intensity is more than 2.5.</p>
            <tbl id="T2">
               <title>
                  <p>Table 2</p>
               </title>
               <caption>
                  <p>Sensitivity and FPR of SFP detection for various signal intensities in the canonical rice data in which the SFP number was one.</p>
               </caption>
               <tblbdy cols="5">
                  <r>
                     <c ca="center">
                        <p>
                           <inline-formula>
                              <graphic file="1471-2105-10-131-i24.gif"/>
                           </inline-formula>
                        </p>
                     </c>
                     <c ca="right">
                        <p>&lt; 2</p>
                     </c>
                     <c ca="right">
                        <p>2 &#8211; 2.5</p>
                     </c>
                     <c ca="right">
                        <p>2.5 &#8211; 3</p>
                     </c>
                     <c ca="right">
                        <p>3 &#8804;</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="5">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c cspan="4" ca="center">
                        <p># of probe sets</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>&lt; 2</p>
                     </c>
                     <c ca="right">
                        <p>186</p>
                     </c>
                     <c ca="right">
                        <p>6</p>
                     </c>
                     <c ca="right">
                        <p>0</p>
                     </c>
                     <c ca="right">
                        <p>0</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>2 &#8211; 2.5</p>
                     </c>
                     <c ca="right">
                        <p>322</p>
                     </c>
                     <c ca="right">
                        <p>1184</p>
                     </c>
                     <c ca="right">
                        <p>18</p>
                     </c>
                     <c ca="right">
                        <p>1</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>2.5 &#8211; 3</p>
                     </c>
                     <c ca="right">
                        <p>6</p>
                     </c>
                     <c ca="right">
                        <p>136</p>
                     </c>
                     <c ca="right">
                        <p>452</p>
                     </c>
                     <c ca="right">
                        <p>9</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>3 &#8804;</p>
                     </c>
                     <c ca="right">
                        <p>0</p>
                     </c>
                     <c ca="right">
                        <p>5</p>
                     </c>
                     <c ca="right">
                        <p>115</p>
                     </c>
                     <c ca="right">
                        <p>422</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="5">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c cspan="4" ca="center">
                        <p># of SFP probes judged by SNEP</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>&lt; 2</p>
                     </c>
                     <c ca="right">
                        <p>10</p>
                     </c>
                     <c ca="right">
                        <p>3</p>
                     </c>
                     <c ca="right">
                        <p>0</p>
                     </c>
                     <c ca="right">
                        <p>0</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>2 &#8211; 2.5</p>
                     </c>
                     <c ca="right">
                        <p>70</p>
                     </c>
                     <c ca="right">
                        <p>338</p>
                     </c>
                     <c ca="right">
                        <p>24</p>
                     </c>
                     <c ca="right">
                        <p>0</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>2.5 &#8211; 3</p>
                     </c>
                     <c ca="right">
                        <p>8</p>
                     </c>
                     <c ca="right">
                        <p>99</p>
                     </c>
                     <c ca="right">
                        <p>415</p>
                     </c>
                     <c ca="right">
                        <p>9</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>3 &#8804;</p>
                     </c>
                     <c ca="right">
                        <p>0</p>
                     </c>
                     <c ca="right">
                        <p>5</p>
                     </c>
                     <c ca="right">
                        <p>111</p>
                     </c>
                     <c ca="right">
                        <p>415</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="5">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c cspan="4" ca="center">
                        <p>Sensitivity</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>&lt; 2</p>
                     </c>
                     <c ca="right">
                        <p>0.022</p>
                     </c>
                     <c ca="right">
                        <p>0.333</p>
                     </c>
                     <c ca="right">
                        <p>NA</p>
                     </c>
                     <c ca="right">
                        <p>NA</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>2 &#8211; 2.5</p>
                     </c>
                     <c ca="right">
                        <p>0.034</p>
                     </c>
                     <c ca="right">
                        <p>0.164</p>
                     </c>
                     <c ca="right">
                        <p>0.722</p>
                     </c>
                     <c ca="right">
                        <p>0.000</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>2.5 &#8211; 3</p>
                     </c>
                     <c ca="right">
                        <p>0.167</p>
                     </c>
                     <c ca="right">
                        <p>0.529</p>
                     </c>
                     <c ca="right">
                        <p>0.750</p>
                     </c>
                     <c ca="right">
                        <p>0.667</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>3 &#8804;</p>
                     </c>
                     <c ca="right">
                        <p>NA</p>
                     </c>
                     <c ca="right">
                        <p>0.200</p>
                     </c>
                     <c ca="right">
                        <p>0.843</p>
                     </c>
                     <c ca="right">
                        <p>0.777</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c cspan="4" ca="center">
                        <p>FPR</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>&lt; 2</p>
                     </c>
                     <c ca="right">
                        <p>0.600</p>
                     </c>
                     <c ca="right">
                        <p>0.333</p>
                     </c>
                     <c ca="right">
                        <p>NA</p>
                     </c>
                     <c ca="right">
                        <p>NA</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>2 &#8211; 2.5</p>
                     </c>
                     <c ca="right">
                        <p>0.843</p>
                     </c>
                     <c ca="right">
                        <p>0.426</p>
                     </c>
                     <c ca="right">
                        <p>0.458</p>
                     </c>
                     <c ca="right">
                        <p>NA</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>2.5 &#8211; 3</p>
                     </c>
                     <c ca="right">
                        <p>0.875</p>
                     </c>
                     <c ca="right">
                        <p>0.273</p>
                     </c>
                     <c ca="right">
                        <p>0.183</p>
                     </c>
                     <c ca="right">
                        <p>0.333</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>3 &#8804;</p>
                     </c>
                     <c ca="right">
                        <p>NA</p>
                     </c>
                     <c ca="right">
                        <p>0.800</p>
                     </c>
                     <c ca="right">
                        <p>0.126</p>
                     </c>
                     <c ca="right">
                        <p>0.210</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p><inline-formula><graphic file="1471-2105-10-131-i25.gif"/></inline-formula>. <inline-formula><graphic file="1471-2105-10-131-i26.gif"/></inline-formula>: median-intensities for Nipponbare and 93-11. NA: not available.</p>
               </tblfn>
            </tbl>
            <p>SNEP and LRT were applied to the canonical rice data in which the genes were sufficiently expressed. The significance levels for SNEP and LRT were set at 10<sup>-3 </sup>and 10<sup>-4</sup>, respectively, which allowed us to easily compare the two methods. The sensitivity and FPR are given in Table <tblr tid="T3">3</tblr>. We omitted the extreme case in which all 11 probes were SFPs, because there were much more extreme cases compared with the other cases. The sensitivity and FPR became smaller as the SFP number became larger, as expected. In contrast to the analysis of the barley data, SNEP only slightly outperformed LRT. This would be because the disadvantages associated with LRT are not so remarkable in general when using only one of the two one-sided alternative hypotheses.</p>
            <tbl id="T3">
               <title>
                  <p>Table 3</p>
               </title>
               <caption>
                  <p>Sensitivity and FPR of SFP detection for various SFP numbers in the canonical rice data in which the genes were sufficiently expressed.</p>
               </caption>
               <tblbdy cols="5">
                  <r>
                     <c ca="center">
                        <p>SFP number</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>1&#8211;3</p>
                     </c>
                     <c ca="center">
                        <p>1&#8211;5</p>
                     </c>
                     <c ca="center">
                        <p>1&#8211;10</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="5">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p># of probe sets</p>
                     </c>
                     <c ca="center">
                        <p>998</p>
                     </c>
                     <c ca="center">
                        <p>1699</p>
                     </c>
                     <c ca="center">
                        <p>1831</p>
                     </c>
                     <c ca="center">
                        <p>1901</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p># of SFP probes</p>
                     </c>
                     <c ca="center">
                        <p>998</p>
                     </c>
                     <c ca="center">
                        <p>2602</p>
                     </c>
                     <c ca="center">
                        <p>3177</p>
                     </c>
                     <c ca="center">
                        <p>3689</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c cspan="4" ca="center">
                        <p>SNEP</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>Sensitivity</p>
                     </c>
                     <c ca="center">
                        <p>0.772</p>
                     </c>
                     <c ca="center">
                        <p>0.747</p>
                     </c>
                     <c ca="center">
                        <p>0.717</p>
                     </c>
                     <c ca="center">
                        <p>0.653</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>FPR</p>
                     </c>
                     <c ca="center">
                        <p>0.189</p>
                     </c>
                     <c ca="center">
                        <p>0.112</p>
                     </c>
                     <c ca="center">
                        <p>0.101</p>
                     </c>
                     <c ca="center">
                        <p>0.097</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c cspan="4" ca="center">
                        <p>LRT</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>Sensitivity</p>
                     </c>
                     <c ca="center">
                        <p>0.800</p>
                     </c>
                     <c ca="center">
                        <p>0.748</p>
                     </c>
                     <c ca="center">
                        <p>0.711</p>
                     </c>
                     <c ca="center">
                        <p>0.666</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>FPR</p>
                     </c>
                     <c ca="center">
                        <p>0.222</p>
                     </c>
                     <c ca="center">
                        <p>0.132</p>
                     </c>
                     <c ca="center">
                        <p>0.119</p>
                     </c>
                     <c ca="center">
                        <p>0.113</p>
                     </c>
                  </r>
               </tblbdy>
            </tbl>
            <p>We also examined an alternative way of selecting genes. We used Affymetrix GeneChip Operating Software (GCOS), which has been often used to determine whether or not a probe is sufficiently expressed. We calculated the sensitivity and FPR of SNEP for detecting SFP probes in the probe sets in which all 11 probes were judged to be 'Present' at significance level 0.01(default) by GCOS. When the SFP number was one, the sensitivity and FPR became approximately 13% and 3% worse than those shown in Table <tblr tid="T3">3</tblr>.</p>
         </sec>
         <sec>
            <st>
               <p>SFP detection in mouse data</p>
            </st>
            <p>SNEP was also applied to the mouse data. The mouse genome might be more complex than the rice genome, because the mouse genome is roughly six times larger, but mice contain less than half the number of genes identified in rice <abbrgrp><abbr bid="B25">25</abbr><abbr bid="B26">26</abbr><abbr bid="B27">27</abbr></abbrgrp>. Because the mouse array was designed for B6 transcripts, we used only one alternative hypothesis to detect SFP probes for MSM/Ms. The overall nucleotide substitution rate between these two strains was as high as 0.0096 <abbrgrp><abbr bid="B28">28</abbr></abbrgrp>. The genome sequence of MSM/Ms has been extensively studied and 187,560 probe target sequences from MSM/Ms were available at <url>http://molossinus.lab.nig.ac.jp/msmdb/</url>. We used 17,043 probe sets in which all 11 probe target sequences were known in both B6 and MSM/Ms.</p>
            <p>SNEP was applied to the mouse data in which the median-intensity was more than 2.5 and the SFP number was one. There were 710 objective probe sets. The significance level was set at 10<sup>-3</sup>. The sensitivity and FPR of SNEP were 0.524 and 0.316, respectively, which were inferior to the values obtained in a similar analysis of rice data (0.772 and 0.189). However, because the mouse genome is more complicated, it seemed that the performance of SNEP was still good.</p>
         </sec>
         <sec>
            <st>
               <p>Effects of signal intensity level for SFP detection</p>
            </st>
            <p>In the analysis of mouse data, we increased the threshold value from 2.5 to 3 in order to avoid the effects of cross-hybridization. There were 165 objective probe sets. The sensitivity and FPR were improved to 0.733 and 0.243, respectively. We also used the threshold value 2.5 to examine the barley data. We first focus on the analysis of the COL samples. The number of objective probes was reduced from 2,601 to 1,937. The sensitivity and FPR did not markedly change in comparison to those obtained in the analysis of the rice data. We further increased the threshold value from 2.5 to 3. The number of objective probes was reduced to 838. The sensitivity and FPR were much improved from 0.579 and 0.259 to 0.810 and 0.156, respectively. For other types of tissue, similar improvements were observed. Thus, the way of selecting genes will be an important issue to stabilize the SFP detection.</p>
         </sec>
         <sec>
            <st>
               <p>Detecting differently expressed genes in rice data</p>
            </st>
            <p>In contrast to SFP detection, it is difficult to clearly investigate the performance of the method for detecting differently expressed genes, due to the paucity of data regarding which genes are differently expressed. Instead, we examined whether SNEP was robust against the adverse effects of an SFP probe for detecting differently expressed genes. We compared the robust test statistic <it>T</it><sub>0 </sub>with the standard <it>t</it>-statistic based on the Tukey's biweight estimate. We adopted the Tukey's biweight estimate instead of directly using the raw data, because this estimate has been commonly used for detecting differently expressed genes.</p>
            <p>We first illustrate the robustness of the two test statistics against the adverse effects of an SFP probe by analyzing the data in Figure <figr fid="F1">1(a)</figr>, which suggests the hypothesis that the gene is not differently expressed. The <it>T</it><sub>0</sub>-value was 1.53 and the <it>p</it>-value was 0.126. This result was consistent with our hypothesis. However, the <it>t</it>-value was 3.64 and the <it>p</it>-value was less than 10<sup>-3</sup>. This result was not consistent with our hypothesis. The Tukey's biweight method tends to weaken the adverse effects of an outlier, but it is not always designed to weaken the adverse effects of an SFP probe because it is based on only one strain. In fact, the signal intensities on the 9th probe may not be outliers when we focus only on each replicate for 93-11 and neglect the other replicates. In such a case, the Tukey's biweight method produces a smaller estimate of gene expression level for 93-11, which results in a larger <it>t</it>-statistic. These would be the reason why the <it>t</it>-value was larger than expected. SNEP can weaken the adverse effects of an SFP probe because it examines two strains simultaneously, as described already.</p>
            <p>Figure <figr fid="F4">4</figr> shows the global robustness of the two test statistics by comparing two cases. One case was based on the canonical rice data in which the genes were sufficiently expressed and the SFP number ranged from 1 to 5. The other case was based on the modified data in which the signal intensities from the SFP probes were deleted. The former case might be affected by the SFP probes, because the data included the signal intensities from the SFP probes. If an SFP probe produced an adverse effect, the test statistic would tend to be larger in the former case than in the latter case, as illustrated above. As seen in Figure <figr fid="F4">4</figr>, the <it>t</it>-statistic tended to be larger in the former case, whereas the <it>T</it><sub>0</sub>-statistic did not. We showed that the robust test statistic <it>T</it><sub>0 </sub>was much more robust against the adverse effects of an SFP probe than the <it>t</it>-statistic based on Tukey's biweight estimate. We also found that the absolute value of the <it>T</it><sub>0</sub>-statistic tended to be slightly smaller in the former case than in the latter case. This may occur because the signal intensities from the SFP probes are not always outliers. In such a case, the sample size of meaningful probes tends to become large and then the denominator of <it>T</it><sub>0 </sub>tends to become small.</p>
            <fig id="F4">
               <title>
                  <p>Figure 4</p>
               </title>
               <caption>
                  <p>Distribution of the test statistics</p>
               </caption>
               <text>
                  <p><b>Distribution of the test statistics</b>. The gray histograms are based on the canonical rice data in which the genes were sufficiently expressed and the SFP number ranged from 1 to 5. The unshaded histograms are based on the modified data in which the signal intensities from the SFP probes were deleted.</p>
               </text>
               <graphic file="1471-2105-10-131-4"/>
            </fig>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Conclusion and discussion</p>
         </st>
         <p>We have developed 'SNEP' to simultaneously detect nucleotide and expression polymorphisms. We expected that the signal intensity ratios for two strains were almost the same on all the probes in a probe set when no SFPs were present. We furthermore considered that the SFP probe could be regarded as an outlier because the SFP probe might not satisfy this expectation. To effectively use these ideas, we adopted a statistical model and a robust procedure.</p>
         <p>SNEP was applied to data from barley (large genome that has not been extensively sequenced), rice (small genome that has been extensively sequenced) and mice (large genome that has been extensively sequenced). When a great deal of sequence information was available, one of the two strains (or varieties) was regarded as a platform strain. SNEP worked well regardless of genome size. SNEP outperformed the standard testing procedure, the method of Rostoks <it>et al. </it><abbrgrp><abbr bid="B10">10</abbr></abbrgrp> and the method of Greenhall <it>et al. </it><abbrgrp><abbr bid="B14">14</abbr></abbrgrp> for detecting SFP probes in the barley data. SNEP also performed well for detecting SFP probes in the rice and mouse data. SNEP was more powerful for detecting SFP probes than the standard testing procedure when no platform strain was present, in other words, when both alternative hypotheses were necessary. SNEP was carried out without normalization and the effect of normalization was also investigated. SNEP was more robust against the adverse effect of an SFP probe for detecting differently expressed genes than the standard <it>t</it>-statistic based on the Tukey's biweight estimates.</p>
         <p>It is worth noting that there may be more than two SFP probes in a given probe set, which typically consisted of 11 probes. In this case, the ratio of the outlier is not small generally, making it difficult to appropriately obtain statistical results. To overcome this difficulty, SNEP uses a divergence-based procedure. This procedure is robust even in the cases where the ratio of the outlier is not small, and provides some convenient properties. Cui <it>et al. </it><abbrgrp><abbr bid="B11">11</abbr></abbrgrp> also adopted a robust procedure based on projection pursuit using the median, but the projection pursuit was computationally heavy and furthermore the median might suffer from a heavy bias because the median had no redescending weight <abbrgrp><abbr bid="B29">29</abbr></abbrgrp>.</p>
         <p>Differently expressed genes can be detected by a simple <it>t</it>-test based on estimated gene expression levels. The gene expression levels are typically estimated by the Tukey's biweight method. This method tends to weaken the adverse effects of an outlier, but it is not always designed to weaken the adverse effects of an SFP probe because it is based on only one strain. However, SNEP can weaken the adverse effects of an SFP probe because it addresses two strains simultaneously. Some studies showed that SNEP was more robust against the adverse effects of an SFP probe for detecting differently expressed genes than a simple <it>t</it>-test based on the Tukey's biweight estimates.</p>
         <p>New DNA sequencing methods, which can produce hundreds of millions of DNA sequence reads during a single run, are superior to microarray technology in both sequence variation detection and gene expression level estimation <abbrgrp><abbr bid="B30">30</abbr><abbr bid="B31">31</abbr></abbrgrp>. However, the cost of a single run using a "next generation sequencer" is still four to five times higher than that with Affymetrix GeneChip. Array technologies are a cost-effective option for studies of biodiversity. SNEP offers a reliable tool for detection of both nucleotide sequence and expression level variations in small or large genomes during array analysis.</p>
      </sec>
      <sec>
         <st>
            <p>Authors' contributions</p>
         </st>
         <p>HF proposed the idea of the method, performed the statistical analysis, made the software, and drafted the manuscript. YH performed the rice experiments and statistical analysis. YH discussed the rice experimental design, analyzed the rice probe sequences, and helped to draft the manuscript. TT performed the mouse experiments and helped to draft the manuscript. SE improved the method. TM analyzed the rice probe sequences. TS helped to perform the statistical analysis and make the software. TS conceived of the study and participated in the mouse experimental design and coordination. NK conceived of the study and participated in the rice experimental design and coordination. All authors read and approved the final manuscript.</p>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>We thank Yukiko Yamazaki at NIG for helping with the rice sequence analysis and Toshinobu Ebata and Yuji Kohara at NIG for helping with the mouse SNP analysis. We also thank five reviewers for their constructive comments. This work was supported by the Bio-diversity Research Project of the Transdisciplinary Research Integration Center, Research Organization of Information and Systems, by ISM Project Research, and by Grant-in-Aid for Scientific Research from the Ministry of Education, Culture, Sports, Science and Technology.</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>Expression monitoring by hybridization to high-density oligonucleotide arrays</p>
            </title>
            <aug>
               <au>
                  <snm>Lockhart</snm>
                  <fnm>DJ</fnm>
               </au>
               <au>
                  <snm>Dong</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Byrne</snm>
                  <fnm>MC</fnm>
               </au>
               <au>
                  <snm>Follettie</snm>
                  <fnm>MT</fnm>
               </au>
               <au>
                  <snm>Gallo</snm>
                  <fnm>MV</fnm>
               </au>
               <au>
                  <snm>Chee</snm>
                  <fnm>MS</fnm>
               </au>
               <au>
                  <snm>Mittmann</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Kobayashi</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Horton</snm>
                  <fnm>H</fnm>
               </au>
               <etal/>
            </aug>
            <source>Nat Biotechnol</source>
            <pubdate>1996</pubdate>
            <volume>14</volume>
            <fpage>1675</fpage>
            <lpage>1680</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nbt1296-1675</pubid>
                  <pubid idtype="pmpid" link="fulltext">9634850</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <title>
               <p>A systematic map of genetic variation in <it>Plasmodium falciparum</it></p>
            </title>
            <aug>
               <au>
                  <snm>Kidgell</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Volkman</snm>
                  <fnm>SK</fnm>
               </au>
               <au>
                  <snm>Daily</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Borevitz</snm>
                  <fnm>JO</fnm>
               </au>
               <au>
                  <snm>Plouffe</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Zhou</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Johnson</snm>
                  <fnm>JR</fnm>
               </au>
               <au>
                  <snm>Le Roch</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Sarr</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Ndir</snm>
                  <fnm>O</fnm>
               </au>
               <etal/>
            </aug>
            <source>PLoS Path</source>
            <pubdate>2006</pubdate>
            <volume>2</volume>
            <fpage>e57</fpage>
            <xrefbib>
               <pubid idtype="doi">10.1371/journal.ppat.0020057</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>Direct allelic variation scanning of the yeast genome</p>
            </title>
            <aug>
               <au>
                  <snm>Winzeler</snm>
                  <fnm>EA</fnm>
               </au>
               <au>
                  <snm>Richards</snm>
                  <fnm>DR</fnm>
               </au>
               <au>
                  <snm>Conway</snm>
                  <fnm>AR</fnm>
               </au>
               <au>
                  <snm>Goldstein</snm>
                  <fnm>AL</fnm>
               </au>
               <au>
                  <snm>Kalman</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>McCullough</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>McCusker</snm>
                  <fnm>JH</fnm>
               </au>
               <au>
                  <snm>Stevens</snm>
                  <fnm>DA</fnm>
               </au>
               <au>
                  <snm>Wodicka</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Lockhart</snm>
                  <fnm>DJ</fnm>
               </au>
               <au>
                  <snm>Davis</snm>
                  <fnm>RW</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>1998</pubdate>
            <volume>281</volume>
            <fpage>1194</fpage>
            <lpage>1197</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.281.5380.1194</pubid>
                  <pubid idtype="pmpid" link="fulltext">9712584</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>Genomic islands of speciation in <it>Anopheles gambiae</it></p>
            </title>
            <aug>
               <au>
                  <snm>Turner</snm>
                  <fnm>TL</fnm>
               </au>
               <au>
                  <snm>Hahn</snm>
                  <fnm>MW</fnm>
               </au>
               <au>
                  <snm>Nuzhdin</snm>
                  <fnm>SV</fnm>
               </au>
            </aug>
            <source>PLoS Biol</source>
            <pubdate>2005</pubdate>
            <volume>3</volume>
            <fpage>e285</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1371/journal.pbio.0030285</pubid>
                  <pubid idtype="pmpid" link="fulltext">16076241</pubid>
                  <pubid idtype="pmcid">1182689</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>Large-scale identification of single-feature polymorphisms in complex genomes</p>
            </title>
            <aug>
               <au>
                  <snm>Borevitz</snm>
                  <fnm>JO</fnm>
               </au>
               <au>
                  <snm>Liang</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Plouffe</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Chang</snm>
                  <fnm>HS</fnm>
               </au>
               <au>
                  <snm>Zhu</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Weigel</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Berry</snm>
                  <fnm>CC</fnm>
               </au>
               <au>
                  <snm>Winzeler</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Chory</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2003</pubdate>
            <volume>13</volume>
            <fpage>513</fpage>
            <lpage>523</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1101/gr.541303</pubid>
                  <pubid idtype="pmpid" link="fulltext">12618383</pubid>
                  <pubid idtype="pmcid">430246</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>Genome-wide patterns of single-feature polymorphism in <it>Arabidopsis thaliana</it></p>
            </title>
            <aug>
               <au>
                  <snm>Borevitz</snm>
                  <fnm>JO</fnm>
               </au>
               <au>
                  <snm>Hazen</snm>
                  <fnm>SP</fnm>
               </au>
               <au>
                  <snm>Michael</snm>
                  <fnm>TP</fnm>
               </au>
               <au>
                  <snm>Morris</snm>
                  <fnm>GP</fnm>
               </au>
               <au>
                  <snm>Baxter</snm>
                  <fnm>IR</fnm>
               </au>
               <au>
                  <snm>Hu</snm>
                  <fnm>TT</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Werner</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Nordborg</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Salt</snm>
                  <fnm>DE</fnm>
               </au>
               <etal/>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2007</pubdate>
            <volume>104</volume>
            <fpage>12057</fpage>
            <lpage>12062</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1073/pnas.0705323104</pubid>
                  <pubid idtype="pmpid" link="fulltext">17626786</pubid>
                  <pubid idtype="pmcid">1914337</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>Single feature polymorphism discovery in rice</p>
            </title>
            <aug>
               <au>
                  <snm>Kumar</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Qiu</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Joshi</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Valliyodan</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Xu</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Nguyen</snm>
                  <fnm>HT</fnm>
               </au>
            </aug>
            <source>PLoS ONE</source>
            <pubdate>2007</pubdate>
            <volume>2</volume>
            <fpage>e284</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1371/journal.pone.0000284</pubid>
                  <pubid idtype="pmpid" link="fulltext">17372626</pubid>
                  <pubid idtype="pmcid">1808426</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>Simultaneous genotyping, gene-expression measurement, and detection of allele-specific expression with oligonucleotide arrays</p>
            </title>
            <aug>
               <au>
                  <snm>Ronald</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Akey</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Whittle</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Smith</snm>
                  <fnm>EN</fnm>
               </au>
               <au>
                  <snm>Yvert</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Kruglyak</snm>
                  <fnm>L</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2005</pubdate>
            <volume>15</volume>
            <fpage>284</fpage>
            <lpage>291</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1101/gr.2850605</pubid>
                  <pubid idtype="pmpid" link="fulltext">15687292</pubid>
                  <pubid idtype="pmcid">546530</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p>High-density haplotyping with microarray-based expression and single feature polymorphism markers in <it>Arabidopsis</it></p>
            </title>
            <aug>
               <au>
                  <snm>West</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>van Leeuwen</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Kozik</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Kliebenstein</snm>
                  <fnm>DJ</fnm>
               </au>
               <au>
                  <snm>Doerge</snm>
                  <fnm>RW</fnm>
               </au>
               <au>
                  <snm>St Clair</snm>
                  <fnm>DA</fnm>
               </au>
               <au>
                  <snm>Michelmore</snm>
                  <fnm>RW</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2006</pubdate>
            <volume>16</volume>
            <fpage>787</fpage>
            <lpage>795</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1101/gr.5011206</pubid>
                  <pubid idtype="pmpid" link="fulltext">16702412</pubid>
                  <pubid idtype="pmcid">1473188</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B10">
            <title>
               <p>Single-feature polymorphism discovery in the barley transcriptome</p>
            </title>
            <aug>
               <au>
                  <snm>Rostoks</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Borevitz</snm>
                  <fnm>JO</fnm>
               </au>
               <au>
                  <snm>Hedley</snm>
                  <fnm>PE</fnm>
               </au>
               <au>
                  <snm>Russell</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Mudie</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Morris</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Cardle</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Marshall</snm>
                  <fnm>DF</fnm>
               </au>
               <au>
                  <snm>Waugh</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Genome Biol</source>
            <pubdate>2005</pubdate>
            <volume>6</volume>
            <fpage>R54</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1186/gb-2005-6-6-r54</pubid>
                  <pubid idtype="pmpid" link="fulltext">15960806</pubid>
                  <pubid idtype="pmcid">1175974</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>Detecting single-feature polymorphisms using oligonucleotide arrays and robustified projection pursuit</p>
            </title>
            <aug>
               <au>
                  <snm>Cui</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Xu</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Asghar</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Condamine</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Svensson</snm>
                  <fnm>JT</fnm>
               </au>
               <au>
                  <snm>Wanamaker</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Stein</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Roose</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Close</snm>
                  <fnm>TJ</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2005</pubdate>
            <volume>21</volume>
            <fpage>3852</fpage>
            <lpage>3858</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/bti640</pubid>
                  <pubid idtype="pmpid" link="fulltext">16118260</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>SFP Genotyping from Affymetrix arrays is robust but largely detects <it>cis</it>-acting expression regulators</p>
            </title>
            <aug>
               <au>
                  <snm>Luo</snm>
                  <fnm>ZW</fnm>
               </au>
               <au>
                  <snm>Potokina</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Druka</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Wise</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Waugh</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Kearsey</snm>
                  <fnm>MJ</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>2007</pubdate>
            <volume>176</volume>
            <fpage>789</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1534/genetics.106.067843</pubid>
                  <pubid idtype="pmpid" link="fulltext">17409081</pubid>
                  <pubid idtype="pmcid">1894608</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B13">
            <title>
               <p>Genetic diversity contribution to errors in short oligonucleotide microarray analysis</p>
            </title>
            <aug>
               <au>
                  <snm>Kirst</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Caldo</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Casati</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Tanimoto</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Walbot</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Wise</snm>
                  <fnm>RP</fnm>
               </au>
               <au>
                  <snm>Buckler</snm>
                  <fnm>ES</fnm>
               </au>
            </aug>
            <source>Plant Biotechnol J</source>
            <pubdate>2006</pubdate>
            <volume>4</volume>
            <fpage>489</fpage>
            <lpage>498</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">17309725</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B14">
            <title>
               <p>Detecting genetic variation in microarray expression data</p>
            </title>
            <aug>
               <au>
                  <snm>Greenhall</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Zapala</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Caceres</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Libiger</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Barlow</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Schork</snm>
                  <fnm>NJ</fnm>
               </au>
               <au>
                  <snm>Lockhart</snm>
                  <fnm>DJ</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2007</pubdate>
            <volume>17</volume>
            <fpage>1228</fpage>
            <lpage>1235</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1101/gr.6307307</pubid>
                  <pubid idtype="pmpid" link="fulltext">17609390</pubid>
                  <pubid idtype="pmcid">1933513</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B15">
            <title>
               <p>Mixed-model reanalysis of primate data suggests tissue and species biases in oligonucleotide-based gene expression profiles</p>
            </title>
            <aug>
               <au>
                  <snm>Hsieh</snm>
                  <fnm>WP</fnm>
               </au>
               <au>
                  <snm>Chu</snm>
                  <fnm>TM</fnm>
               </au>
               <au>
                  <snm>Wolfinger</snm>
                  <fnm>RD</fnm>
               </au>
               <au>
                  <snm>Gibson</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>2003</pubdate>
            <volume>165</volume>
            <fpage>747</fpage>
            <lpage>757</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1462792</pubid>
                  <pubid idtype="pmpid" link="fulltext">14573485</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B16">
            <title>
               <p>Sequence polymorphisms cause many false <it>cis </it>eQTLs</p>
            </title>
            <aug>
               <au>
                  <snm>Alberts</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Terpstra</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Breitling</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Nap</snm>
                  <fnm>JP</fnm>
               </au>
               <au>
                  <snm>Jansen</snm>
                  <fnm>RC</fnm>
               </au>
            </aug>
            <source>PLoS ONE</source>
            <pubdate>2007</pubdate>
            <volume>2</volume>
            <fpage>e622</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1371/journal.pone.0000622</pubid>
                  <pubid idtype="pmpid" link="fulltext">17637838</pubid>
                  <pubid idtype="pmcid">1906859</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B17">
            <title>
               <p>Evaluation of Target Preparation Methods for Single-Feature Polymorphism Detection in Large Complex Plant Genomes</p>
            </title>
            <aug>
               <au>
                  <snm>Gore</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Bradbury</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Hogers</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Kirst</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Verstege</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>van Oeveren</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Peleman</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Buckler</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>van Eijk</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Crop Science</source>
            <pubdate>2007</pubdate>
            <volume>47</volume>
            <fpage>S135</fpage>
            <lpage>S148</lpage>
            <xrefbib>
               <pubid idtype="doi">10.2135/cropsci2007.02.0085tpg</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B18">
            <title>
               <p>Mapping translocation breakpoints using a wheat microarray</p>
            </title>
            <aug>
               <au>
                  <snm>Bhat</snm>
                  <fnm>PR</fnm>
               </au>
               <au>
                  <snm>Lukaszewski</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Cui</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Xu</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Svensson</snm>
                  <fnm>JT</fnm>
               </au>
               <au>
                  <snm>Wanamaker</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Waines</snm>
                  <fnm>JG</fnm>
               </au>
               <au>
                  <snm>Close</snm>
                  <fnm>TJ</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2007</pubdate>
            <volume>35</volume>
            <fpage>2936</fpage>
            <lpage>2943</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/nar/gkm148</pubid>
                  <pubid idtype="pmpid" link="fulltext">17439961</pubid>
                  <pubid idtype="pmcid">1888831</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B19">
            <title>
               <p>Significance analysis of microarrays applied to the ionizing radiation response</p>
            </title>
            <aug>
               <au>
                  <snm>Tusher</snm>
                  <fnm>VG</fnm>
               </au>
               <au>
                  <snm>Tibshirani</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Chu</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2001</pubdate>
            <volume>98</volume>
            <fpage>5116</fpage>
            <lpage>5121</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1073/pnas.091062498</pubid>
                  <pubid idtype="pmpid" link="fulltext">11309499</pubid>
                  <pubid idtype="pmcid">33173</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B20">
            <title>
               <p>Robustifying model fitting</p>
            </title>
            <aug>
               <au>
                  <snm>Windham</snm>
                  <fnm>MP</fnm>
               </au>
            </aug>
            <source>J Roy Statist Soc Ser B</source>
            <pubdate>1995</pubdate>
            <volume>57</volume>
            <fpage>599</fpage>
            <lpage>609</lpage>
         </bibl>
         <bibl id="B21">
            <title>
               <p>Robust and efficient estimation by minimising a density power divergence</p>
            </title>
            <aug>
               <au>
                  <snm>Basu</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Harris</snm>
                  <fnm>IR</fnm>
               </au>
               <au>
                  <snm>Hjort</snm>
                  <fnm>NL</fnm>
               </au>
               <au>
                  <snm>Jones</snm>
                  <fnm>MC</fnm>
               </au>
            </aug>
            <source>Biometrika</source>
            <pubdate>1998</pubdate>
            <volume>85</volume>
            <fpage>549</fpage>
            <lpage>559</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1093/biomet/85.3.549</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B22">
            <title>
               <p>A comparison of related density-based minimum divergence estimators</p>
            </title>
            <aug>
               <au>
                  <snm>Jones</snm>
                  <fnm>MC</fnm>
               </au>
               <au>
                  <snm>Hjort</snm>
                  <fnm>NL</fnm>
               </au>
               <au>
                  <snm>Harris</snm>
                  <fnm>IR</fnm>
               </au>
               <au>
                  <snm>Basu</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Biometrika</source>
            <pubdate>2001</pubdate>
            <volume>88</volume>
            <fpage>865</fpage>
            <lpage>873</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1093/biomet/88.3.865</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B23">
            <title>
               <p>Robust parameter estimation with a small bias against heavy contamination</p>
            </title>
            <aug>
               <au>
                  <snm>Fujisawa</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Eguchi</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>J Multivariate Anal</source>
            <pubdate>2008</pubdate>
            <volume>99</volume>
            <fpage>2053</fpage>
            <lpage>2081</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1016/j.jmva.2008.02.004</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B24">
            <title>
               <p>Parametric statistical modeling by minimum integrated square error</p>
            </title>
            <aug>
               <au>
                  <snm>Scott</snm>
                  <fnm>DW</fnm>
               </au>
            </aug>
            <source>Technometrics</source>
            <pubdate>2001</pubdate>
            <volume>43</volume>
            <fpage>274</fpage>
            <lpage>285</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1198/004017001316975880</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B25">
            <title>
               <p>The map-based sequence of the rice genome</p>
            </title>
            <aug>
               <au>
                  <cnm>International Rice Genome Sequencing Project</cnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2005</pubdate>
            <volume>436</volume>
            <fpage>793</fpage>
            <lpage>800</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature03895</pubid>
                  <pubid idtype="pmpid" link="fulltext">16100779</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B26">
            <title>
               <p>The genomes of <it>Oryza sativa</it>: A history of duplications</p>
            </title>
            <aug>
               <au>
                  <snm>Yu</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Lin</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Zhou</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Ni</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Dong</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Hu</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Zeng</snm>
                  <fnm>C</fnm>
               </au>
               <etal/>
            </aug>
            <source>PLoS Biol</source>
            <pubdate>2005</pubdate>
            <volume>3</volume>
            <fpage>e38</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1371/journal.pbio.0030038</pubid>
                  <pubid idtype="pmpid" link="fulltext">15685292</pubid>
                  <pubid idtype="pmcid">546038</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B27">
            <title>
               <p>Initial sequencing and comparative analysis of the mouse genome</p>
            </title>
            <aug>
               <au>
                  <cnm>Mouse Genome Sequencing Consortium</cnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2002</pubdate>
            <volume>420</volume>
            <fpage>520</fpage>
            <lpage>562</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature01262</pubid>
                  <pubid idtype="pmpid" link="fulltext">12466850</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B28">
            <title>
               <p>Contribution of Asian mouse subspecies <it>Mus musculus molossinus </it>to genomic constitution of strain C57BL/6J, as defined by BAC-end sequence-SNP analysis</p>
            </title>
            <aug>
               <au>
                  <snm>Abe</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Noguchi</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Tagawa</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Yuzuriha</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Toyoda</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Kojima</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Ezawa</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Saitou</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Hattori</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Sakaki</snm>
                  <fnm>Y</fnm>
               </au>
               <etal/>
            </aug>
            <source>Genome Res</source>
            <pubdate>2004</pubdate>
            <volume>14</volume>
            <fpage>2439</fpage>
            <lpage>2447</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1101/gr.2899304</pubid>
                  <pubid idtype="pmpid" link="fulltext">15574823</pubid>
                  <pubid idtype="pmcid">534668</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B29">
            <aug>
               <au>
                  <snm>Hampel</snm>
                  <fnm>FR</fnm>
               </au>
               <au>
                  <snm>Ronchetti</snm>
                  <fnm>EM</fnm>
               </au>
               <au>
                  <snm>Rousseeuw</snm>
                  <fnm>PJ</fnm>
               </au>
               <au>
                  <snm>Stahel</snm>
                  <fnm>WA</fnm>
               </au>
            </aug>
            <source>Robust statistics: The approach based on influence functions</source>
            <publisher>Wiley</publisher>
            <pubdate>1986</pubdate>
         </bibl>
         <bibl id="B30">
            <title>
               <p>RNA-seq: an assessment of technical reproducibility and comparison with gene expression arrays</p>
            </title>
            <aug>
               <au>
                  <snm>Marioni</snm>
                  <fnm>JC</fnm>
               </au>
               <au>
                  <snm>Mason</snm>
                  <fnm>CE</fnm>
               </au>
               <au>
                  <snm>Mane</snm>
                  <fnm>SM</fnm>
               </au>
               <au>
                  <snm>Stephens</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Gilad</snm>
                  <fnm>Y</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2008</pubdate>
            <volume>18</volume>
            <fpage>1509</fpage>
            <lpage>1517</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1101/gr.079558.108</pubid>
                  <pubid idtype="pmpid" link="fulltext">18550803</pubid>
                  <pubid idtype="pmcid">2527709</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B31">
            <title>
               <p>Stem cell transcriptome profiling via massive-scale mRNA sequencing</p>
            </title>
            <aug>
               <au>
                  <snm>Cloonan</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Forrest</snm>
                  <fnm>AR</fnm>
               </au>
               <au>
                  <snm>Kolle</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Gardiner</snm>
                  <fnm>BB</fnm>
               </au>
               <au>
                  <snm>Faulkner</snm>
                  <fnm>GJ</fnm>
               </au>
               <au>
                  <snm>Brown</snm>
                  <fnm>MK</fnm>
               </au>
               <au>
                  <snm>Taylor</snm>
                  <fnm>DF</fnm>
               </au>
               <au>
                  <snm>Steptoe</snm>
                  <fnm>AL</fnm>
               </au>
               <au>
                  <snm>Wani</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Bethel</snm>
                  <fnm>G</fnm>
               </au>
               <etal/>
            </aug>
            <source>Nat Meth</source>
            <pubdate>2008</pubdate>
            <volume>5</volume>
            <fpage>613</fpage>
            <lpage>619</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1038/nmeth.1223</pubid>
            </xrefbib>
         </bibl>
      </refgrp>
   </bm>
</art>
