<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1471-2105-9-113</ui>
   <ji>1471-2105</ji>
   <fm>
      <dochead>Methodology article</dochead>
      <bibl>
         <title>
            <p>Human Pol II promoter recognition based on primary sequences and free energy of dinucleotides</p>
         </title>
         <aug>
            <au id="A1">
               <snm>Yang</snm>
               <fnm>Jian-Yi</fnm>
               <insr iid="I1"/>
               <email>yangjianyiapple@163.com</email>
            </au>
            <au id="A2">
               <snm>Zhou</snm>
               <fnm>Yu</fnm>
               <insr iid="I1"/>
               <email>zynova@hotmail.com</email>
            </au>
            <au id="A3" ca="yes">
               <snm>Yu</snm>
               <fnm>Zu-Guo</fnm>
               <insr iid="I1"/>
               <insr iid="I2"/>
               <email>yuzg1970@yahoo.com</email>
            </au>
            <au id="A4">
               <snm>Anh</snm>
               <fnm>Vo</fnm>
               <insr iid="I2"/>
               <email>v.anh@qut.edu.au</email>
            </au>
            <au id="A5">
               <snm>Zhou</snm>
               <fnm>Li-Qian</fnm>
               <insr iid="I1"/>
               <email>zhoulq@xtu.edu.cn</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>School of Mathematics and Computational Science, Xiangtan University, Hunan 411105, China</p>
            </ins>
            <ins id="I2">
               <p>School of Mathematical Sciences, Queensland University of Technology, GPO Box 2434, Brisbane, Q 4001, Australia</p>
            </ins>
         </insg>
         <source>BMC Bioinformatics</source>
         <issn>1471-2105</issn>
         <pubdate>2008</pubdate>
         <volume>9</volume>
         <issue>1</issue>
         <fpage>113</fpage>
         <url>http://www.biomedcentral.com/1471-2105/9/113</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">18294399</pubid>
               <pubid idtype="doi">10.1186/1471-2105-9-113</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>13</day>
               <month>8</month>
               <year>2007</year>
            </date>
         </rec>
         <acc>
            <date>
               <day>24</day>
               <month>2</month>
               <year>2008</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>24</day>
               <month>2</month>
               <year>2008</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2008</year>
         <collab>Yang et al; licensee BioMed Central Ltd.</collab>
         <note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>Promoter region plays an important role in determining where the transcription of a particular gene should be initiated. Computational prediction of eukaryotic Pol II promoter sequences is one of the most significant problems in sequence analysis. Existing promoter prediction methods are still far from being satisfactory.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>We attempt to recognize the human Pol II promoter sequences from the non-promoter sequences which are made up of exon and intron sequences. Four methods are used: two kinds of multifractal analysis performed on the numeric sequences obtained from the dinucleotide free energy, Z curve analysis and global descriptor of the promoter/non-promoter primary sequences. A total of 141 parameters are extracted from these methods and categorized into seven groups (methods). They are used to generate certain spaces and then each promoter/non-promoter sequence is represented by a point in the corresponding space. All the 120 possible combinations of the seven methods are tested. Based on Fisher's linear discriminant algorithm, with a relatively smaller number of parameters (96 and 117), we get satisfactory discriminant accuracies. Particularly, in the case of 117 parameters, the accuracies for the training and test sets reach 90.43% and 89.79%, respectively. A comparison with five other existing methods indicates that our methods have a better performance. Using the global descriptor method (36 parameters), 17 of the 18 experimentally verified promoter sequences of human chromosome 22 are correctly identified.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusion</p>
               </st>
               <p>The high accuracies achieved suggest that the methods of this paper are useful for understanding the difficult problem of promoter prediction.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>Promoter region plays an essential role in determining where the transcription of a particular gene should be initiated. Hence, promoter recognition &#8211; the computational task of finding the promoter regions on a DNA sequence, is an important problem <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>. The accumulation of a huge amount of genome sequence data in recent years makes the annotation process more and more complicated for higher eukaryotes <abbrgrp><abbr bid="B2">2</abbr></abbrgrp>. The RNA polymerase II (Pol II) promoter is a key region that regulates differential transcription of protein coding genes. Computational analysis of Pol II promoters may contribute to improved gene identification and to prediction of the expression context of genes <abbrgrp><abbr bid="B3">3</abbr></abbrgrp>. There is a need for prediction techniques that can rapidly and accurately evaluate sequences for the presence of promoter sequences <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>.</p>
         <p>Existing promoter prediction methods are still far from being satisfactory <abbrgrp><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr></abbrgrp>. The performance of many current eukaryote promoter prediction methods has been unreliable with poor specificity or poor sensitivity <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>. Many methods predict promoter sequences based on the regulatory sequence elements (RSEs) in them. But the RSEs are short and not fully conserved in the promoter sequences, which results in a high probability of finding similar sequence elements elsewhere in genomes, outside the promoter regions. That is why most of the promoter prediction methods end up predicting a lot of false positions <abbrgrp><abbr bid="B6">6</abbr></abbrgrp>. Fickett and Hatzigeorgiou <abbrgrp><abbr bid="B3">3</abbr></abbrgrp> performed an evaluation of the different promoter prediction methods on genome DNA and suggested that it would be worth attempting nonlinear recognition methods, such as neural nets or quadratic discriminant analysis. Following this direction, Gangal and Sharma <abbrgrp><abbr bid="B7">7</abbr></abbrgrp> applied time series descriptors and machine learning methods to human Pol II promoter prediction and got a higher accuracy compared with other methods; Kanhere and Bansal <abbrgrp><abbr bid="B6">6</abbr></abbrgrp> presented a novel prokaryotic promoter prediction method based on DNA stability showing that the changing in the stability of DNA provides a much better clue than the usual sequence motifs.</p>
         <p>In this paper, we attempt to recognize the human Pol II promoter sequences from the non-promoter sequences which contain exon and intron sequences. It should be noted that the aim of the present paper is similar to that of Ref. <abbrgrp><abbr bid="B7">7</abbr></abbrgrp>, but the non-promoter sequences in Ref. <abbrgrp><abbr bid="B7">7</abbr></abbrgrp> are made up of coding sequences (CDSs) and intron sequences, while we use an existing database, the Exon/Intron database, to extract non-promoter sequences. We first convert the promoter/non-promoter sequences into numeric sequences according to the 10 unified free energy parameters <abbrgrp><abbr bid="B8">8</abbr></abbrgrp>, which have been used to measure the stability of DNA <abbrgrp><abbr bid="B6">6</abbr></abbrgrp>. Then a measure representation is introduced for the numeric sequences. Multifractal analysis of the measure is next performed, which results in the first 5 parameters. Analogous multifractal analysis <abbrgrp><abbr bid="B9">9</abbr></abbrgrp> is also used on the numeric sequences to achieve another 4 parameters. The Z curve method, which has been used in recent years with some successes <abbrgrp><abbr bid="B10">10</abbr><abbr bid="B11">11</abbr></abbrgrp>, yields 96 parameters for the promoter/non-promoter primary sequences. The protein-chain descriptor method was first proposed by Dubchak <it>et al</it>. <abbrgrp><abbr bid="B12">12</abbr></abbrgrp> to predict protein folding classes. Here we propose a global descriptor for the promoter/non-promoter sequences, which yields 36 parameters for a global description of the primary sequences. Overall, a total of 141 parameters are extracted from these four different methods and categorized into seven groups (methods). Fisher's linear discriminant algorithm shows that the global descriptor method is the most effective when used separately. Complete enumerations of all the possible combinations of these seven methods (120) are tested to find possibly better results with a relatively smaller number of parameters. Numerical results show that the methods with 96 and 117 parameters can produce satisfactory results. Compared with five other existing tools, the higher sensitivity, specificity, accuracy and correlation coefficient demonstrate that the methods proposed here are useful for understanding the human Pol II promoter prediction problem. 17 of the 18 experimentally verified promoter sequences of human chromosome 22 <abbrgrp><abbr bid="B13">13</abbr></abbrgrp> are successfully identified by the global descriptor method (with only 36 parameters).</p>
      </sec>
      <sec>
         <st>
            <p>Results</p>
         </st>
         <sec>
            <st>
               <p>Testing</p>
            </st>
            <p>We use two different data sets downloaded from two databases. The first set is the human Pol II promoter sequences from Release 90 of the Eukaryotic Promoter Database (EPD) <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>. The EPD is an annotated non-redundant collection of eukaryotic Pol II promoters, experimentally defined by a transcription start site (TSS) <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>. The EPD is a useful database when one wants to deal with the Pol II promoter prediction problem and it is broadly tested by different prediction tools <abbrgrp><abbr bid="B7">7</abbr><abbr bid="B16">16</abbr><abbr bid="B17">17</abbr><abbr bid="B18">18</abbr><abbr bid="B19">19</abbr></abbrgrp>. A total of 1871 entries of human Pol II promoter sequences with window size of 499 bp upstream and 100 bp downstream of TSS, which is the same as that used in Ref. <abbrgrp><abbr bid="B16">16</abbr></abbrgrp>, are obtained from EPD. The sequences containing 'N' are manually filtered out, which results in a total of 1856 sequences. The second set is the non-promoter sequences of the human genome. For this data set, we consider using the Exon/Intron Database (EID), which incorporates information on the exon/intron structure of eukaryotic genes <abbrgrp><abbr bid="B20">20</abbr></abbrgrp> (<abbrgrp><abbr bid="B21">21</abbr></abbrgrp>, hs35p1.EID.tar.gz). Firstly, the exon/intron sequences with 'n' and length less than 600 are filtered out. Then, we randomly select 1000 intron sequences from the file hs35p1.intrEID and 500 exon sequences from the file hs35p1.exEID. A fragment of length 600 is then selected randomly from each exon/intron sequence with length larger than 600. As the intron sequences are represented by lower-case letters in the file hs35p1.intrEID, we transform them into upper-case letters to be consistent with the promoter and exon sequences.</p>
            <p>From the four different methods described in the Methods section, we get a total of 141 parameters. We will test their contributions in the promoter/non-promoter problem. Then we will try to combine some of them to see whether better results can be achieved.</p>
            <p>For comparison of various methods, a benchmark should be set up. We use Fisher's linear discriminant algorithm <abbrgrp><abbr bid="B22">22</abbr><abbr bid="B23">23</abbr><abbr bid="B24">24</abbr></abbrgrp> to calculate the discriminant accuracies. We divide all promoter and non-promoter sequences into two sets randomly. A set of 90% of promoter/non-promoter sequences is regarded as a training set, and the set of remaining 10% of promoter/non-promoter sequences as a test set.</p>
            <p>Fisher's discriminant algorithm is used to find a classifier in the parameter space for a training set. The given training set <it>H </it>= {<b>x</b><sub><b>1</b></sub>, <b>x</b><sub><b>2</b></sub>, &#8943;, <b>x</b><sub><b>n</b></sub>} is partitioned into <it>n</it><sub>1 </sub>&#8804; <it>n </it>training vectors in a subset <it>H</it><sub>1 </sub>and <it>n</it><sub>2 </sub>&#8804; <it>n </it>training vectors in a subset <it>H</it><sub>2</sub>, where <it>n</it><sub>1 </sub>+ <it>n</it><sub>2 </sub>= <it>n </it>and each <b>x</b><sub><b>i </b></sub>is a <it>&#954;</it>-dimensional vector, represented by one point in the <it>&#954;</it>-dimensional parameter space. Then <it>H </it>= <it>H</it><sub>1 </sub>&#8746; <it>H</it><sub>2</sub>. We need to find a parameter vector <b>w </b>= (<it>w</it><sub>1</sub>, <it>w</it><sub>2</sub>, &#8943;, <it>w</it><sub><it>&#954;</it></sub>)<sup><it>T </it></sup>for the <it>&#954;</it>-dimensional space such that <inline-formula><m:math name="1471-2105-9-113-i1" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mrow><m:mo>{</m:mo><m:msub><m:mi>y</m:mi><m:mi>i</m:mi></m:msub><m:mo>=</m:mo><m:mi>w</m:mi><m:msub><m:mi>x</m:mi><m:mi>i</m:mi></m:msub><m:mo>}</m:mo></m:mrow><m:mrow><m:mi>i</m:mi><m:mo>=</m:mo><m:mn>1</m:mn></m:mrow><m:mi>n</m:mi></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGaei4EaSNaemyEaK3aaSbaaSqaaiabdMgaPbqabaGccqGH9aqpieqacqWF3bWDcqWF4baEdaWgaaWcbaGae8xAaKgabeaakiabc2ha9naaDaaaleaacqWGPbqAcqGH9aqpcqaIXaqmaeaacqWGUbGBaaaaaa@3C47@</m:annotation></m:semantics></m:math></inline-formula> can be classified into two classes in the space of real numbers. If we denote</p>
            <p>
               <display-formula id="M1">
                  <m:math name="1471-2105-9-113-i2" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:mtable>
                              <m:mtr>
                                 <m:mtd>
                                    <m:mrow>
                                       <m:msub>
                                          <m:mi>m</m:mi>
                                          <m:mi>j</m:mi>
                                       </m:msub>
                                       <m:mo>=</m:mo>
                                       <m:mfrac>
                                          <m:mn>1</m:mn>
                                          <m:mrow>
                                             <m:msub>
                                                <m:mi>n</m:mi>
                                                <m:mi>j</m:mi>
                                             </m:msub>
                                          </m:mrow>
                                       </m:mfrac>
                                       <m:mstyle displaystyle="true">
                                          <m:munder>
                                             <m:mo>&#8721;</m:mo>
                                             <m:mrow>
                                                <m:msub>
                                                   <m:mi>x</m:mi>
                                                   <m:mi>i</m:mi>
                                                </m:msub>
                                                <m:mo>&#8712;</m:mo>
                                                <m:msub>
                                                   <m:mi>H</m:mi>
                                                   <m:mi>j</m:mi>
                                                </m:msub>
                                             </m:mrow>
                                          </m:munder>
                                          <m:mrow>
                                             <m:msub>
                                                <m:mi>x</m:mi>
                                                <m:mi>i</m:mi>
                                             </m:msub>
                                          </m:mrow>
                                       </m:mstyle>
                                    </m:mrow>
                                 </m:mtd>
                                 <m:mtd>
                                    <m:mrow>
                                       <m:mi>j</m:mi>
                                       <m:mo>=</m:mo>
                                       <m:mn>1</m:mn>
                                       <m:mo>,</m:mo>
                                       <m:mn>2</m:mn>
                                       <m:mo>,</m:mo>
                                    </m:mrow>
                                 </m:mtd>
                              </m:mtr>
                           </m:mtable>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaqbaeqabeGaaaqaaGqabiab=1gaTnaaBaaaleaacqWGQbGAaeqaaOGaeyypa0tcfa4aaSaaaeaacqaIXaqmaeaacqWGUbGBdaWgaaqaaiabdQgaQbqabaaaaOWaaabuaeaacqWF4baEdaWgaaWcbaGae8xAaKgabeaaaeaacqWF4baEdaWgaaadbaGae8xAaKgabeaaliabgIGiolabdIeainaaBaaameaacqWGQbGAaeqaaaWcbeqdcqGHris5aaGcbaGaemOAaOMaeyypa0JaeGymaeJaeiilaWIaeGOmaiJaeiilaWcaaaaa@4700@</m:annotation>
                     </m:semantics>
                  </m:math>
               </display-formula>
            </p>
            <p>
               <display-formula id="M2">
                  <m:math name="1471-2105-9-113-i3" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:mtable>
                              <m:mtr>
                                 <m:mtd>
                                    <m:mrow>
                                       <m:msub>
                                          <m:mi>S</m:mi>
                                          <m:mi>j</m:mi>
                                       </m:msub>
                                       <m:mo>=</m:mo>
                                       <m:mstyle displaystyle="true">
                                          <m:munder>
                                             <m:mo>&#8721;</m:mo>
                                             <m:mrow>
                                                <m:msub>
                                                   <m:mi>x</m:mi>
                                                   <m:mi>i</m:mi>
                                                </m:msub>
                                                <m:mo>&#8712;</m:mo>
                                                <m:msub>
                                                   <m:mi>H</m:mi>
                                                   <m:mi>j</m:mi>
                                                </m:msub>
                                             </m:mrow>
                                          </m:munder>
                                          <m:mrow>
                                             <m:mo stretchy="false">(</m:mo>
                                             <m:msub>
                                                <m:mi>x</m:mi>
                                                <m:mi>i</m:mi>
                                             </m:msub>
                                             <m:mo>&#8722;</m:mo>
                                             <m:msub>
                                                <m:mi>m</m:mi>
                                                <m:mi>j</m:mi>
                                             </m:msub>
                                             <m:mo stretchy="false">)</m:mo>
                                             <m:msup>
                                                <m:mrow>
                                                   <m:mo stretchy="false">(</m:mo>
                                                   <m:msub>
                                                      <m:mi>x</m:mi>
                                                      <m:mi>i</m:mi>
                                                   </m:msub>
                                                   <m:mo>&#8722;</m:mo>
                                                   <m:msub>
                                                      <m:mi>m</m:mi>
                                                      <m:mi>j</m:mi>
                                                   </m:msub>
                                                   <m:mo stretchy="false">)</m:mo>
                                                </m:mrow>
                                                <m:mi>T</m:mi>
                                             </m:msup>
                                          </m:mrow>
                                       </m:mstyle>
                                       <m:mo>,</m:mo>
                                    </m:mrow>
                                 </m:mtd>
                                 <m:mtd>
                                    <m:mrow>
                                       <m:mi>j</m:mi>
                                       <m:mo>=</m:mo>
                                       <m:mn>1</m:mn>
                                       <m:mo>,</m:mo>
                                       <m:mn>2</m:mn>
                                       <m:mo>,</m:mo>
                                    </m:mrow>
                                 </m:mtd>
                              </m:mtr>
                           </m:mtable>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaqbaeqabeGaaaqaaGqabiab=nfatnaaBaaaleaacqWGQbGAaeqaaOGaeyypa0ZaaabuaeaacqGGOaakcqWF4baEdaWgaaWcbaGae8xAaKgabeaakiabgkHiTiab=1gaTnaaBaaaleaacqWFQbGAaeqaaOGaeiykaKIaeiikaGIae8hEaG3aaSbaaSqaaiab=LgaPbqabaGccqGHsislcqWFTbqBdaWgaaWcbaGae8NAaOgabeaakiabcMcaPmaaCaaaleqabaGaemivaqfaaaqaaiab=Hha4naaBaaameaacqWFPbqAaeqaaSGaeyicI4SaemisaG0aaSbaaWqaaiabdQgaQbqabaaaleqaniabggHiLdGccqGGSaalaeaacqWGQbGAcqGH9aqpcqaIXaqmcqGGSaalcqaIYaGmcqGGSaalaaaaaa@52B5@</m:annotation>
                     </m:semantics>
                  </m:math>
               </display-formula>
            </p>
            <p>
               <display-formula id="M3"><b>S</b><sub><it>w </it></sub>= <b>S</b><sub>1 </sub>+ <b>S</b><sub>2</sub>,</display-formula>
            </p>
            <p>then the parameter vector <b>w </b>is estimated as <inline-formula><m:math name="1471-2105-9-113-i4" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>S</m:mi><m:mi>w</m:mi><m:mrow><m:mo>&#8722;</m:mo><m:mn>1</m:mn></m:mrow></m:msubsup><m:mo stretchy="false">(</m:mo><m:msub><m:mi>m</m:mi><m:mn>1</m:mn></m:msub><m:mo>&#8722;</m:mo><m:msub><m:mi>m</m:mi><m:mn>2</m:mn></m:msub><m:mo stretchy="false">)</m:mo></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaacbeGae83uam1aa0baaSqaaiabdEha3bqaaiabgkHiTiabigdaXaaakiabcIcaOiab=1gaTnaaBaaaleaacqaIXaqmaeqaaOGaeyOeI0Iae8xBa02aaSbaaSqaaiabikdaYaqabaGccqGGPaqkaaa@383E@</m:annotation></m:semantics></m:math></inline-formula><abbrgrp><abbr bid="B23">23</abbr></abbrgrp>. As a result, Fisher's discriminant rule becomes: "assign <b>x </b>to <it>H</it><sub>1 </sub>if <inline-formula><m:math name="1471-2105-9-113-i5" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:mi>Z</m:mi><m:mo stretchy="false">(</m:mo><m:mi>x</m:mi><m:mo stretchy="false">)</m:mo><m:mo>=</m:mo><m:msup><m:mrow><m:mo stretchy="false">(</m:mo><m:msub><m:mi>m</m:mi><m:mn>1</m:mn></m:msub><m:mo>&#8722;</m:mo><m:msub><m:mi>m</m:mi><m:mn>2</m:mn></m:msub><m:mo stretchy="false">)</m:mo></m:mrow><m:mi>T</m:mi></m:msup><m:msubsup><m:mi>S</m:mi><m:mi>w</m:mi><m:mrow><m:mo>&#8722;</m:mo><m:mn>1</m:mn></m:mrow></m:msubsup><m:mo stretchy="false">[</m:mo><m:mi>x</m:mi><m:mo>&#8722;</m:mo><m:mfrac><m:mn>1</m:mn><m:mn>2</m:mn></m:mfrac><m:mo stretchy="false">(</m:mo><m:msub><m:mi>m</m:mi><m:mn>1</m:mn></m:msub><m:mo>+</m:mo><m:msub><m:mi>m</m:mi><m:mn>2</m:mn></m:msub><m:mo stretchy="false">)</m:mo><m:mo stretchy="false">]</m:mo><m:mo>></m:mo><m:mn>0</m:mn></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaacbeGae8NwaOLaeiikaGIae8hEaGNaeiykaKIaeyypa0JaeiikaGIae8xBa02aaSbaaSqaaiabigdaXaqabaGccqGHsislcqWFTbqBdaWgaaWcbaGaeGOmaidabeaakiabcMcaPmaaCaaaleqabaGaemivaqfaaOGae83uam1aa0baaSqaaiabdEha3bqaaiabgkHiTiabigdaXaaakiabcUfaBjab=Hha4jabgkHiTKqbaoaalaaabaGaeGymaedabaGaeGOmaidaaOGaeiikaGIae8xBa02aaSbaaSqaaiabigdaXaqabaGccqGHRaWkcqWFTbqBdaWgaaWcbaGaeGOmaidabeaakiabcMcaPiabc2faDjabg6da+iabicdaWaaa@500E@</m:annotation></m:semantics></m:math></inline-formula> and to <it>H</it><sub>2 </sub>otherwise" <abbrgrp><abbr bid="B22">22</abbr></abbrgrp>.</p>
            <p>The discriminant accuracies for resubstitution analysis are defined as</p>
            <p>
               <display-formula id="M4">
                  <m:math name="1471-2105-9-113-i6" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:msub>
                              <m:mi>p</m:mi>
                              <m:mi>c</m:mi>
                           </m:msub>
                           <m:mo>=</m:mo>
                           <m:mfrac>
                              <m:mrow>
                                 <m:mtext>The&#160;number&#160;of&#160;all&#160;correct&#160;promoter&#160;discriminations</m:mtext>
                              </m:mrow>
                              <m:mrow>
                                 <m:mtext>The&#160;number&#160;of&#160;promoter&#160;sequences&#160;in&#160;the&#160;training&#160;set</m:mtext>
                              </m:mrow>
                           </m:mfrac>
                           <m:mo>,</m:mo>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGaemiCaa3aaSbaaSqaaiabdogaJbqabaGccqGH9aqpjuaGdaWcaaqaaiabbsfaujabbIgaOjabbwgaLjabbccaGiabb6gaUjabbwha1jabb2gaTjabbkgaIjabbwgaLjabbkhaYjabbccaGiabb+gaVjabbAgaMjabbccaGiabbggaHjabbYgaSjabbYgaSjabbccaGiabbogaJjabb+gaVjabbkhaYjabbkhaYjabbwgaLjabbogaJjabbsha0jabbccaGiabbchaWjabbkhaYjabb+gaVjabb2gaTjabb+gaVjabbsha0jabbwgaLjabbkhaYjabbccaGiabbsgaKjabbMgaPjabbohaZjabbogaJjabbkhaYjabbMgaPjabb2gaTjabbMgaPjabb6gaUjabbggaHjabbsha0jabbMgaPjabb+gaVjabb6gaUjabbohaZbqaaiabbsfaujabbIgaOjabbwgaLjabbccaGiabb6gaUjabbwha1jabb2gaTjabbkgaIjabbwgaLjabbkhaYjabbccaGiabb+gaVjabbAgaMjabbccaGiabbchaWjabbkhaYjabb+gaVjabb2gaTjabb+gaVjabbsha0jabbwgaLjabbkhaYjabbccaGiabbohaZjabbwgaLjabbghaXjabbwha1jabbwgaLjabb6gaUjabbogaJjabbwgaLjabbohaZjabbccaGiabbMgaPjabb6gaUjabbccaGiabbsha0jabbIgaOjabbwgaLjabbccaGiabbsha0jabbkhaYjabbggaHjabbMgaPjabb6gaUjabbMgaPjabb6gaUjabbEgaNjabbccaGiabbohaZjabbwgaLjabbsha0baacqGGSaalaaa@B4C7@</m:annotation>
                     </m:semantics>
                  </m:math>
               </display-formula>
            </p>
            <p>
               <display-formula id="M5">
                  <m:math name="1471-2105-9-113-i7" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:msub>
                              <m:mi>p</m:mi>
                              <m:mrow>
                                 <m:mi>n</m:mi>
                                 <m:mi>c</m:mi>
                              </m:mrow>
                           </m:msub>
                           <m:mo>=</m:mo>
                           <m:mfrac>
                              <m:mrow>
                                 <m:mtext>The&#160;number&#160;of&#160;all&#160;correct&#160;non-promoter&#160;discriminations</m:mtext>
                              </m:mrow>
                              <m:mrow>
                                 <m:mtext>The&#160;number&#160;of&#160;non-promoter&#160;sequences&#160;in&#160;the&#160;training&#160;set</m:mtext>
                              </m:mrow>
                           </m:mfrac>
                           <m:mo>.</m:mo>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGaemiCaa3aaSbaaSqaaiabd6gaUjabdogaJbqabaGccqGH9aqpjuaGdaWcaaqaaiabbsfaujabbIgaOjabbwgaLjabbccaGiabb6gaUjabbwha1jabb2gaTjabbkgaIjabbwgaLjabbkhaYjabbccaGiabb+gaVjabbAgaMjabbccaGiabbggaHjabbYgaSjabbYgaSjabbccaGiabbogaJjabb+gaVjabbkhaYjabbkhaYjabbwgaLjabbogaJjabbsha0jabbccaGiabb6gaUjabb+gaVjabb6gaUjabb2caTiabbchaWjabbkhaYjabb+gaVjabb2gaTjabb+gaVjabbsha0jabbwgaLjabbkhaYjabbccaGiabbsgaKjabbMgaPjabbohaZjabbogaJjabbkhaYjabbMgaPjabb2gaTjabbMgaPjabb6gaUjabbggaHjabbsha0jabbMgaPjabb+gaVjabb6gaUjabbohaZbqaaiabbsfaujabbIgaOjabbwgaLjabbccaGiabb6gaUjabbwha1jabb2gaTjabbkgaIjabbwgaLjabbkhaYjabbccaGiabb+gaVjabbAgaMjabbccaGiabb6gaUjabb+gaVjabb6gaUjabb2caTiabbchaWjabbkhaYjabb+gaVjabb2gaTjabb+gaVjabbsha0jabbwgaLjabbkhaYjabbccaGiabbohaZjabbwgaLjabbghaXjabbwha1jabbwgaLjabb6gaUjabbogaJjabbwgaLjabbohaZjabbccaGiabbMgaPjabb6gaUjabbccaGiabbsha0jabbIgaOjabbwgaLjabbccaGiabbsha0jabbkhaYjabbggaHjabbMgaPjabb6gaUjabbMgaPjabb6gaUjabbEgaNjabbccaGiabbohaZjabbwgaLjabbsha0baacqGGUaGlaaa@C048@</m:annotation>
                     </m:semantics>
                  </m:math>
               </display-formula>
            </p>
            <p>For the test analysis, the discriminant accuracies <it>q</it><sub><it>c </it></sub>and <it>q</it><sub><it>nc </it></sub>are defined similarly by changing "training set" to "test set" in Eqs. (4) and (5), respectively.</p>
            <p>We first divide the data into training and test sets randomly, then we use the above algorithm to calculate the discriminant accuracies for different methods. The results are listed in Table <tblr tid="T1">1</tblr>.</p>
            <tbl id="T1">
               <title>
                  <p>Table 1</p>
               </title>
               <caption>
                  <p>The discriminant accuracies for various methods with Fisher's discriminant. The method marked "3+6+7" in the 8<sup>th </sup>row means the combination of the methods listed in the 3<sup>rd</sup>, 6<sup>th </sup>and 7<sup>th </sup>rows. The meanings of the methods marked for the 9<sup>th </sup>row is similar.</p>
               </caption>
               <tblbdy cols="7">
                  <r>
                     <c ca="center">
                        <p>Order</p>
                     </c>
                     <c ca="center">
                        <p><it>p</it><sub><it>c</it></sub>(%)</p>
                     </c>
                     <c ca="center">
                        <p><it>p</it><sub><it>nc</it></sub>(%)</p>
                     </c>
                     <c ca="center">
                        <p><it>q</it><sub><it>c</it></sub>(%)</p>
                     </c>
                     <c ca="center">
                        <p><it>q</it><sub><it>nc</it></sub>(%)</p>
                     </c>
                     <c ca="center">
                        <p>Method</p>
                     </c>
                     <c ca="center">
                        <p>No. of parameters</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="7">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>73.05</p>
                     </c>
                     <c ca="center">
                        <p>85.63</p>
                     </c>
                     <c ca="center">
                        <p>74.73</p>
                     </c>
                     <c ca="center">
                        <p>83.33</p>
                     </c>
                     <c ca="center">
                        <p><it>MFA</it>+<it>AMFA</it></p>
                     </c>
                     <c ca="center">
                        <p>9</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>79.16</p>
                     </c>
                     <c ca="center">
                        <p>75.78</p>
                     </c>
                     <c ca="center">
                        <p>76.88</p>
                     </c>
                     <c ca="center">
                        <p>62.67</p>
                     </c>
                     <c ca="center">
                        <p><it>ZC </it>Eq.(19)</p>
                     </c>
                     <c ca="center">
                        <p>9</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>3</p>
                     </c>
                     <c ca="center">
                        <p>78.86</p>
                     </c>
                     <c ca="center">
                        <p>88.00</p>
                     </c>
                     <c ca="center">
                        <p>79.03</p>
                     </c>
                     <c ca="center">
                        <p>85.33</p>
                     </c>
                     <c ca="center">
                        <p><it>ZC </it>Eq.(21), <it>k </it>= 12</p>
                     </c>
                     <c ca="center">
                        <p>12</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>4</p>
                     </c>
                     <c ca="center">
                        <p>78.62</p>
                     </c>
                     <c ca="center">
                        <p>89.33</p>
                     </c>
                     <c ca="center">
                        <p>79.57</p>
                     </c>
                     <c ca="center">
                        <p>89.33</p>
                     </c>
                     <c ca="center">
                        <p><it>ZC </it>Eq.(21), <it>k </it>= 23</p>
                     </c>
                     <c ca="center">
                        <p>12</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>5</p>
                     </c>
                     <c ca="center">
                        <p>80.30</p>
                     </c>
                     <c ca="center">
                        <p>90.74</p>
                     </c>
                     <c ca="center">
                        <p>80.65</p>
                     </c>
                     <c ca="center">
                        <p>90.00</p>
                     </c>
                     <c ca="center">
                        <p><it>ZC </it>Eqs.(20, 22)</p>
                     </c>
                     <c ca="center">
                        <p>15</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>
                           <b>6</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>85.75</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>88.30</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>86.02</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>91.33</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>GD</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>36</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>7</p>
                     </c>
                     <c ca="center">
                        <p>81.92</p>
                     </c>
                     <c ca="center">
                        <p>91.48</p>
                     </c>
                     <c ca="center">
                        <p>81.72</p>
                     </c>
                     <c ca="center">
                        <p>89.33</p>
                     </c>
                     <c ca="center">
                        <p><it>ZC </it>Eq.(23)</p>
                     </c>
                     <c ca="center">
                        <p>48</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>
                           <b>8</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>86.11</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>93.48</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>86.02</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>90.67</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>3+6+7</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>96</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>
                           <b>9</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>86.89</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>93.11</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>86.02</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>92.67</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>1+3+4+6+7</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>117</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>10</p>
                     </c>
                     <c ca="center">
                        <p>87.31</p>
                     </c>
                     <c ca="center">
                        <p>93.19</p>
                     </c>
                     <c ca="center">
                        <p>86.02</p>
                     </c>
                     <c ca="center">
                        <p>92.00</p>
                     </c>
                     <c ca="center">
                        <p>All methods</p>
                     </c>
                     <c ca="center">
                        <p>141</p>
                     </c>
                  </r>
               </tblbdy>
            </tbl>
            <p>Firstly, seven groups of parameters are derived from the four methods: (i) 9 parameters from fractal methods (<it>MFA </it>and <it>AMFA</it>); (ii) 9 parameters from <it>ZC </it>representing the codon-position-dependent frequencies of mononucleotides; (iii) 12 parameters from <it>ZC </it>representing the frequencies of phase-specific dinucleotides (codon positions 1&#8211;2); (iv) 12 parameters from <it>ZC </it>representing the frequencies of phase-specific dinucleotides (codon positions 2&#8211;3); (v) 15 parameters for the phase-independent mononucleotides and dinucleotides from <it>ZC</it>; (vi) 36 parameters from <it>GD</it>; (vii) 48 parameters for the frequencies of phase-independent tri-nucleotides from <it>ZC</it>. From Table <tblr tid="T1">1</tblr>, it is seen that the results from the multifractal analyses seem to be better than that from <it>ZC </it>with an equal number of parameters, namely 9. We have successfully applied multifractal analyses in the clustering of large protein structures <abbrgrp><abbr bid="B9">9</abbr><abbr bid="B25">25</abbr></abbrgrp> and the distinction of coding and non-coding sequences in complete genomes <abbrgrp><abbr bid="B26">26</abbr></abbrgrp>, where the length of protein sequences and coding and non-coding sequences are larger than 300. It is well-known that the promoter sequences are highly diverse, which makes it notoriously difficult to generate patterns and rules for promoter prediction. It is expected that multifractal analyses can unfold some useful information on promoter sequences. The results from the frequencies of phase-specific dinucleotides at codon positions 2&#8211;3 in <it>ZC </it>indicate a better performance than that at codon positions 1&#8211;2. In addition, the accuracies from <it>ZC </it>with the frequencies of phase-independent mononucleotides and dinucleotides are improved but the number of parameters is increased to 15. The <it>GD </it>method shown in boldface in Table <tblr tid="T1">1</tblr>, denoted as <b>M1</b>, turns out to be especially useful as the accuracies are all larger that 85%. Compared with this, the results from the 48 parameters in <it>ZC </it>are not as good even though the number of parameters is increased.</p>
            <p>Secondly, we want to test whether the results can be improved by increasing the number of parameters. It is not possible to test all the subsets of the 141 parameters but we can test all the combinations of the above seven methods (120 altogether). In our test, the accuracies do not simply increase as the number of parameters becomes larger, which indicates there might be some redundancy/correlation among the 141 parameters. For example, the accuracies with the 141 parameters are similar to those with only 117 parameters, suggesting the information from the mononucleotides and phase-independent dinucleotides in <it>ZC </it>is contained in the other methods. Therefore, all these parameters are not really needed. Nevertheless, in some circumstances the results do improve when the number of parameters is increased. Especially, among the 120 combinations, the results are relatively satisfactory in the cases of 96 and 117 parameters, which is shown in boldface in Table <tblr tid="T1">1</tblr>. We denote them by <b>M2 </b>and <b>M3</b>, respectively. In order to see whether multifractal analysis brings out useful information, we remove the 9 parameters of <it>MFA </it>and <it>AMFA </it>from <b>M3 </b>and test the results for such new combination. The <it>p</it><sub><it>c</it></sub>, <it>p</it><sub><it>nc</it></sub>, <it>q</it><sub><it>c</it></sub>, and <it>q</it><sub><it>nc </it></sub>calculated from this combination are: 86.05% 92.67%, 86.02% and 92.00% respectively. They are similar to those from <b>M3 </b>(86.89%, 93.11%, 86.02% and 92.67%), which demonstrates that multifractal analysis does not significantly improve the performance in <b>M3</b>.</p>
            <p>In order to evaluate the correct prediction rate and reliability of a predictive method, the sensitivity (<it>S</it><sub><it>n</it></sub>), specificity (<it>S</it><sub><it>p</it></sub>), accuracy (<it>A</it><sub><it>c</it></sub>) and correlation coefficient (<it>CC</it>) are also used <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>:</p>
            <p>
               <display-formula id="M6"><it>S</it><sub><it>n </it></sub>= <it>T P/(T P + F N)</it>,</display-formula>
            </p>
            <p>
               <display-formula id="M7"><it>S</it><sub><it>p </it></sub>= <it>T P/(T P + F P)</it>,</display-formula>
            </p>
            <p>
               <display-formula id="M8"><it>A</it><sub><it>c </it></sub>= (<it>S</it><sub><it>n</it></sub><it>+ S</it><sub><it>p</it></sub>)/2,</display-formula>
            </p>
            <p>
               <display-formula id="M9">
                  <m:math name="1471-2105-9-113-i8" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:mi>C</m:mi>
                           <m:mi>C</m:mi>
                           <m:mo>=</m:mo>
                           <m:mfrac>
                              <m:mrow>
                                 <m:mo stretchy="false">(</m:mo>
                                 <m:mi>T</m:mi>
                                 <m:mi>P</m:mi>
                                 <m:mo>&#215;</m:mo>
                                 <m:mi>T</m:mi>
                                 <m:mi>N</m:mi>
                                 <m:mo stretchy="false">)</m:mo>
                                 <m:mo>&#8722;</m:mo>
                                 <m:mo stretchy="false">(</m:mo>
                                 <m:mi>F</m:mi>
                                 <m:mi>P</m:mi>
                                 <m:mo>&#215;</m:mo>
                                 <m:mi>F</m:mi>
                                 <m:mi>N</m:mi>
                                 <m:mo stretchy="false">)</m:mo>
                              </m:mrow>
                              <m:mrow>
                                 <m:msqrt>
                                    <m:mrow>
                                       <m:mo stretchy="false">(</m:mo>
                                       <m:mi>T</m:mi>
                                       <m:mi>P</m:mi>
                                       <m:mo>+</m:mo>
                                       <m:mi>F</m:mi>
                                       <m:mi>P</m:mi>
                                       <m:mo stretchy="false">)</m:mo>
                                       <m:mo>&#215;</m:mo>
                                       <m:mo stretchy="false">(</m:mo>
                                       <m:mi>T</m:mi>
                                       <m:mi>N</m:mi>
                                       <m:mo>+</m:mo>
                                       <m:mi>F</m:mi>
                                       <m:mi>N</m:mi>
                                       <m:mo stretchy="false">)</m:mo>
                                       <m:mo>&#215;</m:mo>
                                       <m:mo stretchy="false">(</m:mo>
                                       <m:mi>T</m:mi>
                                       <m:mi>P</m:mi>
                                       <m:mo>+</m:mo>
                                       <m:mi>F</m:mi>
                                       <m:mi>N</m:mi>
                                       <m:mo stretchy="false">)</m:mo>
                                       <m:mo>&#215;</m:mo>
                                       <m:mo stretchy="false">(</m:mo>
                                       <m:mi>T</m:mi>
                                       <m:mi>N</m:mi>
                                       <m:mo>+</m:mo>
                                       <m:mi>F</m:mi>
                                       <m:mi>P</m:mi>
                                       <m:mo stretchy="false">)</m:mo>
                                    </m:mrow>
                                 </m:msqrt>
                              </m:mrow>
                           </m:mfrac>
                           <m:mo>,</m:mo>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGaem4qamKaem4qamKaeyypa0tcfa4aaSaaaeaacqGGOaakcqWGubavcqWGqbaucqGHxdaTcqWGubavcqWGobGtcqGGPaqkcqGHsislcqGGOaakcqWGgbGrcqWGqbaucqGHxdaTcqWGgbGrcqWGobGtcqGGPaqkaeaadaGcaaqaaiabcIcaOiabdsfaujabdcfaqjabgUcaRiabdAeagjabdcfaqjabcMcaPiabgEna0kabcIcaOiabdsfaujabd6eaojabgUcaRiabdAeagjabd6eaojabcMcaPiabgEna0kabcIcaOiabdsfaujabdcfaqjabgUcaRiabdAeagjabd6eaojabcMcaPiabgEna0kabcIcaOiabdsfaujabd6eaojabgUcaRiabdAeagjabdcfaqjabcMcaPaqabaaaaOGaeiilaWcaaa@6569@</m:annotation>
                     </m:semantics>
                  </m:math>
               </display-formula>
            </p>
            <p>where <it>TP </it>denotes the number of correctly recognized promoter sequences, <it>FN </it>the number of promoter sequences recognized as non-promoter sequences, <it>FP </it>the number of non-promoter sequences recognized as promoter sequences, <it>TN </it>the number of correctly recognized non-promoter sequences.</p>
            <p>From Fisher's discriminant algorithm, we calculate the four quantities defined above. The results related to Table <tblr tid="T1">1</tblr> by the "order" mark are listed in Table <tblr tid="T2">2</tblr>.</p>
            <tbl id="T2">
               <title>
                  <p>Table 2</p>
               </title>
               <caption>
                  <p>The accuracies of the prediction for promoter sequences by Fisher's discriminant algorithm. The <it>S</it><sub><it>n</it></sub>, <it>S</it><sub><it>p</it></sub>, <it>A</it><sub><it>c </it></sub>and <it>CC </it>are the results for the training set and <it>S'</it><sub><it>n</it></sub>, <it>S'</it><sub><it>p</it></sub>, <it>A'</it><sub><it>c</it></sub> and <it>CC' </it>are the results for the test set. The rows are related to those in Table 1 according to the mark order.</p>
               </caption>
               <tblbdy cols="9">
                  <r>
                     <c ca="center">
                        <p>Order</p>
                     </c>
                     <c ca="center">
                        <p><it>S</it><sub><it>n</it></sub>(%)</p>
                     </c>
                     <c ca="center">
                        <p><it>S</it><sub><it>p</it></sub>(%)</p>
                     </c>
                     <c ca="center">
                        <p><it>A</it><sub><it>c</it></sub>(%)</p>
                     </c>
                     <c ca="center">
                        <p>
                           <it>CC</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p><inline-formula><m:math name="1471-2105-9-113-i9" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msub><m:msup><m:mi>S</m:mi><m:mo>&#8242;</m:mo></m:msup><m:mi>n</m:mi></m:msub></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGafm4uamLbauaadaWgaaWcbaGaemOBa4gabeaaaaa@2E9F@</m:annotation></m:semantics></m:math></inline-formula>(%)</p>
                     </c>
                     <c ca="center">
                        <p><inline-formula><m:math name="1471-2105-9-113-i10" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msub><m:msup><m:mi>S</m:mi><m:mo>&#8242;</m:mo></m:msup><m:mi>p</m:mi></m:msub></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGafm4uamLbauaadaWgaaWcbaGaemiCaahabeaaaaa@2EA3@</m:annotation></m:semantics></m:math></inline-formula>(%)</p>
                     </c>
                     <c ca="center">
                        <p><inline-formula><m:math name="1471-2105-9-113-i11" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msub><m:msup><m:mi>A</m:mi><m:mo>&#8242;</m:mo></m:msup><m:mi>c</m:mi></m:msub></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGafmyqaeKbauaadaWgaaWcbaGaem4yamgabeaaaaa@2E65@</m:annotation></m:semantics></m:math></inline-formula>(%)</p>
                     </c>
                     <c ca="center">
                        <p>
                           <it>CC'</it>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="9">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>73.05</p>
                     </c>
                     <c ca="center">
                        <p>86.28</p>
                     </c>
                     <c ca="center">
                        <p>79.67</p>
                     </c>
                     <c ca="center">
                        <p>0.58</p>
                     </c>
                     <c ca="center">
                        <p>74.73</p>
                     </c>
                     <c ca="center">
                        <p>84.76</p>
                     </c>
                     <c ca="center">
                        <p>79.74</p>
                     </c>
                     <c ca="center">
                        <p>0.58</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>79.16</p>
                     </c>
                     <c ca="center">
                        <p>80.17</p>
                     </c>
                     <c ca="center">
                        <p>79.67</p>
                     </c>
                     <c ca="center">
                        <p>0.55</p>
                     </c>
                     <c ca="center">
                        <p>76.88</p>
                     </c>
                     <c ca="center">
                        <p>71.86</p>
                     </c>
                     <c ca="center">
                        <p>74.37</p>
                     </c>
                     <c ca="center">
                        <p>0.40</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>3</p>
                     </c>
                     <c ca="center">
                        <p>78.86</p>
                     </c>
                     <c ca="center">
                        <p>89.05</p>
                     </c>
                     <c ca="center">
                        <p>83.95</p>
                     </c>
                     <c ca="center">
                        <p>0.66</p>
                     </c>
                     <c ca="center">
                        <p>79.03</p>
                     </c>
                     <c ca="center">
                        <p>86.98</p>
                     </c>
                     <c ca="center">
                        <p>83.01</p>
                     </c>
                     <c ca="center">
                        <p>0.64</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>4</p>
                     </c>
                     <c ca="center">
                        <p>78.62</p>
                     </c>
                     <c ca="center">
                        <p>90.12</p>
                     </c>
                     <c ca="center">
                        <p>84.37</p>
                     </c>
                     <c ca="center">
                        <p>0.68</p>
                     </c>
                     <c ca="center">
                        <p>79.57</p>
                     </c>
                     <c ca="center">
                        <p>90.24</p>
                     </c>
                     <c ca="center">
                        <p>84.91</p>
                     </c>
                     <c ca="center">
                        <p>0.69</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>5</p>
                     </c>
                     <c ca="center">
                        <p>80.30</p>
                     </c>
                     <c ca="center">
                        <p>91.47</p>
                     </c>
                     <c ca="center">
                        <p>85.89</p>
                     </c>
                     <c ca="center">
                        <p>0.71</p>
                     </c>
                     <c ca="center">
                        <p>80.65</p>
                     </c>
                     <c ca="center">
                        <p>90.91</p>
                     </c>
                     <c ca="center">
                        <p>85.78</p>
                     </c>
                     <c ca="center">
                        <p>0.70</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>
                           <b>6</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>85.75</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>90.06</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>87.91</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>0.74</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>86.02</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>92.49</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>89.25</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>0.77</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>7</p>
                     </c>
                     <c ca="center">
                        <p>81.92</p>
                     </c>
                     <c ca="center">
                        <p>92.25</p>
                     </c>
                     <c ca="center">
                        <p>87.08</p>
                     </c>
                     <c ca="center">
                        <p>0.73</p>
                     </c>
                     <c ca="center">
                        <p>81.72</p>
                     </c>
                     <c ca="center">
                        <p>90.48</p>
                     </c>
                     <c ca="center">
                        <p>86.10</p>
                     </c>
                     <c ca="center">
                        <p>0.71</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>
                           <b>8</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>86.11</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>94.23</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>90.17</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>0.79</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>86.02</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>91.95</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>88.99</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>0.76</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>
                           <b>9</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>86.89</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>93.98</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>90.43</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>0.80</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>86.02</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>93.57</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>89.79</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>0.78</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>10</p>
                     </c>
                     <c ca="center">
                        <p>87.31</p>
                     </c>
                     <c ca="center">
                        <p>94.06</p>
                     </c>
                     <c ca="center">
                        <p>90.68</p>
                     </c>
                     <c ca="center">
                        <p>0.80</p>
                     </c>
                     <c ca="center">
                        <p>86.02</p>
                     </c>
                     <c ca="center">
                        <p>93.02</p>
                     </c>
                     <c ca="center">
                        <p>89.52</p>
                     </c>
                     <c ca="center">
                        <p>0.78</p>
                     </c>
                  </r>
               </tblbdy>
            </tbl>
            <p>Overall, from Tables <tblr tid="T1">1</tblr> and <tblr tid="T2">2</tblr>, when the methods are used independently, we can see that <b>M1 </b>is the best one. The combined methods <b>M2 </b>and <b>M3 </b>improve the results. However, the number of parameters is too high in <b>M3</b>. Taking this aspect into account, a preferred method would be <b>M1 </b>or <b>M2</b>.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Discussion</p>
         </st>
         <p>It is natural to ask whether the method of this paper has a better performance than the existing methods. As was done in Ref. <abbrgrp><abbr bid="B7">7</abbr></abbrgrp>, we can compare the present method with five kinds of promoter prediction tools, which are available on-line, namely Neural Network Promoter Prediction (NNPP version 2.2) <abbrgrp><abbr bid="B27">27</abbr></abbrgrp>, Soft Berry (TSSW) <abbrgrp><abbr bid="B28">28</abbr></abbrgrp>, Dragon Promoter Finder version 1.5 (DFP) <abbrgrp><abbr bid="B17">17</abbr><abbr bid="B29">29</abbr></abbrgrp>, Promoter 2.0 <abbrgrp><abbr bid="B18">18</abbr><abbr bid="B30">30</abbr></abbrgrp> and Promoter Scan version 1.7 <abbrgrp><abbr bid="B19">19</abbr><abbr bid="B31">31</abbr></abbrgrp>. To be within a reasonable workload, we only compare with 10% of the promoter and non-promoter sequences used in Section 4 (186 promoter and 150 non-promoter sequences). The results are listed in Table <tblr tid="T3">3</tblr>. They clearly indicate that our method has a better performance than the other tools.</p>
         <tbl id="T3">
            <title>
               <p>Table 3</p>
            </title>
            <caption>
               <p>The promoter prediction accuracies for the test data set made up of 186 promoter sequences and 150 non-promoter sequences using five kinds of tools and our methods.</p>
            </caption>
            <tblbdy cols="5">
               <r>
                  <c ca="center">
                     <p>Tool</p>
                  </c>
                  <c ca="center">
                     <p><it>S</it><sub><it>n</it></sub>(%)</p>
                  </c>
                  <c ca="center">
                     <p><it>S</it><sub><it>p</it></sub>(%)</p>
                  </c>
                  <c ca="center">
                     <p><it>A</it><sub><it>c</it></sub>(%)</p>
                  </c>
                  <c ca="center">
                     <p>
                        <it>CC</it>
                     </p>
                  </c>
               </r>
               <r>
                  <c cspan="5">
                     <hr/>
                  </c>
               </r>
               <r>
                  <c ca="center">
                     <p>NNPP(threshold 0.8)</p>
                  </c>
                  <c ca="center">
                     <p>69.89</p>
                  </c>
                  <c ca="center">
                     <p>60.75</p>
                  </c>
                  <c ca="center">
                     <p>65.32</p>
                  </c>
                  <c ca="center">
                     <p>0.14</p>
                  </c>
               </r>
               <r>
                  <c ca="center">
                     <p>Soft Berry(TSSW)</p>
                  </c>
                  <c ca="center">
                     <p>67.74</p>
                  </c>
                  <c ca="center">
                     <p>81.29</p>
                  </c>
                  <c ca="center">
                     <p>74.52</p>
                  </c>
                  <c ca="center">
                     <p>0.48</p>
                  </c>
               </r>
               <r>
                  <c ca="center">
                     <p>Promoter Scan version 1.7</p>
                  </c>
                  <c ca="center">
                     <p>67.20</p>
                  </c>
                  <c ca="center">
                     <p>88.65</p>
                  </c>
                  <c ca="center">
                     <p>77.93</p>
                  </c>
                  <c ca="center">
                     <p>0.57</p>
                  </c>
               </r>
               <r>
                  <c ca="center">
                     <p>Dragon Promoter Finder version 1.5</p>
                  </c>
                  <c ca="center">
                     <p>30.65</p>
                  </c>
                  <c ca="center">
                     <p>65.52</p>
                  </c>
                  <c ca="center">
                     <p>48.08</p>
                  </c>
                  <c ca="center">
                     <p>0.12</p>
                  </c>
               </r>
               <r>
                  <c ca="center">
                     <p>Promoter 2.0 Prediction Server</p>
                  </c>
                  <c ca="center">
                     <p>52.15</p>
                  </c>
                  <c ca="center">
                     <p>91.51</p>
                  </c>
                  <c ca="center">
                     <p>71.83</p>
                  </c>
                  <c ca="center">
                     <p>0.49</p>
                  </c>
               </r>
               <r>
                  <c ca="center">
                     <p>Our method (<b>M3)</b></p>
                  </c>
                  <c ca="center">
                     <p>86.02</p>
                  </c>
                  <c ca="center">
                     <p>93.57</p>
                  </c>
                  <c ca="center">
                     <p>89.79</p>
                  </c>
                  <c ca="center">
                     <p>0.78</p>
                  </c>
               </r>
            </tblbdy>
         </tbl>
         <p>However, using 90% of promoter sequences as a training set and only 10% of the promoter sequences as a test set may not provide a fair comparison against these methods. A more realistic performance would be to use 50% of the promoter sequences as a training set and the other 50% as a test set. Therefore, we use such ratio of training and test sets in Fisher's algorithm to see whether the results from our method are still satisfactory. We list the results of <b>M1</b>, <b>M2 </b>and <b>M3 </b>in Table <tblr tid="T4">4</tblr>. It shows that, with a smaller size of training set, the accuracy <it>A</it><sub><it>c </it></sub>for the test set is surprisingly better than before, suggesting that our method is robust.</p>
         <tbl id="T4">
            <title>
               <p>Table 4</p>
            </title>
            <caption>
               <p>The accuracies for M1, M2 and M3 with 50% sequences as training and the remaining 50% as test set in Fisher's discriminant algorithm.</p>
            </caption>
            <tblbdy cols="9">
               <r>
                  <c ca="center">
                     <p>Order</p>
                  </c>
                  <c ca="center">
                     <p><it>S</it><sub><it>n</it></sub>(%)</p>
                  </c>
                  <c ca="center">
                     <p><it>S</it><sub><it>p</it></sub>(%)</p>
                  </c>
                  <c ca="center">
                     <p><it>A</it><sub><it>c</it></sub>(%)</p>
                  </c>
                  <c ca="center">
                     <p>
                        <it>CC</it>
                     </p>
                  </c>
                  <c ca="center">
                     <p><inline-formula><m:math name="1471-2105-9-113-i9" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msub><m:msup><m:mi>S</m:mi><m:mo>&#8242;</m:mo></m:msup><m:mi>n</m:mi></m:msub></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGafm4uamLbauaadaWgaaWcbaGaemOBa4gabeaaaaa@2E9F@</m:annotation></m:semantics></m:math></inline-formula>(%)</p>
                  </c>
                  <c ca="center">
                     <p><inline-formula><m:math name="1471-2105-9-113-i10" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msub><m:msup><m:mi>S</m:mi><m:mo>&#8242;</m:mo></m:msup><m:mi>p</m:mi></m:msub></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGafm4uamLbauaadaWgaaWcbaGaemiCaahabeaaaaa@2EA3@</m:annotation></m:semantics></m:math></inline-formula>(%)</p>
                  </c>
                  <c ca="center">
                     <p><inline-formula><m:math name="1471-2105-9-113-i11" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msub><m:msup><m:mi>A</m:mi><m:mo>&#8242;</m:mo></m:msup><m:mi>c</m:mi></m:msub></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGafmyqaeKbauaadaWgaaWcbaGaem4yamgabeaaaaa@2E65@</m:annotation></m:semantics></m:math></inline-formula>(%)</p>
                  </c>
                  <c ca="center">
                     <p>
                        <it>CC'</it>
                     </p>
                  </c>
               </r>
               <r>
                  <c cspan="9">
                     <hr/>
                  </c>
               </r>
               <r>
                  <c ca="center">
                     <p>
                        <b>M1</b>
                     </p>
                  </c>
                  <c ca="center">
                     <p>81.67</p>
                  </c>
                  <c ca="center">
                     <p>89.53</p>
                  </c>
                  <c ca="center">
                     <p>87.60</p>
                  </c>
                  <c ca="center">
                     <p>0.73</p>
                  </c>
                  <c ca="center">
                     <p>91.49</p>
                  </c>
                  <c ca="center">
                     <p>85.50</p>
                  </c>
                  <c ca="center">
                     <p>88.49</p>
                  </c>
                  <c ca="center">
                     <p>0.73</p>
                  </c>
               </r>
               <r>
                  <c ca="center">
                     <p>
                        <b>M2</b>
                     </p>
                  </c>
                  <c ca="center">
                     <p>87.28</p>
                  </c>
                  <c ca="center">
                     <p>93.32</p>
                  </c>
                  <c ca="center">
                     <p>90.30</p>
                  </c>
                  <c ca="center">
                     <p>0.79</p>
                  </c>
                  <c ca="center">
                     <p>90.41</p>
                  </c>
                  <c ca="center">
                     <p>89.07</p>
                  </c>
                  <c ca="center">
                     <p>89.74</p>
                  </c>
                  <c ca="center">
                     <p>0.77</p>
                  </c>
               </r>
               <r>
                  <c ca="center">
                     <p>
                        <b>M3</b>
                     </p>
                  </c>
                  <c ca="center">
                     <p>88.25</p>
                  </c>
                  <c ca="center">
                     <p>93.17</p>
                  </c>
                  <c ca="center">
                     <p>90.71</p>
                  </c>
                  <c ca="center">
                     <p>0.80</p>
                  </c>
                  <c ca="center">
                     <p>90.52</p>
                  </c>
                  <c ca="center">
                     <p>89.74</p>
                  </c>
                  <c ca="center">
                     <p>90.13</p>
                  </c>
                  <c ca="center">
                     <p>0.78</p>
                  </c>
               </r>
            </tblbdy>
         </tbl>
         <p>Based on support vector machine (SVM), Gangal and Sharma <abbrgrp><abbr bid="B7">7</abbr></abbrgrp> used time series descriptors to identify promoter sequences from non-promoter sequences. They reported an accuracy of more than 85%. It will be interesting to see whether their method also works well in our test data set. But their tool Prometheus is not currently available. So it is not feasible to compare the two methods using the same data set. Nevertheless, by using 80% of data to train and the other 20% to test our method, which is the ratio used by Gangal and Sharma <abbrgrp><abbr bid="B7">7</abbr></abbrgrp>, we are able to produce a rough comparison with the results Gangal and Sharma reported (<it>S</it><sub><it>n </it></sub>= 86% and <it>S</it><sub><it>p </it></sub>= 88%). It is listed in Table <tblr tid="T5">5</tblr>, which shows that our results (<it>S</it><sub><it>n </it></sub>= 87.10% and <it>S</it><sub><it>p </it></sub>= 91.78%) are relatively better.</p>
         <tbl id="T5">
            <title>
               <p>Table 5</p>
            </title>
            <caption>
               <p>The accuracies for M1, M2 and M3 with 80% sequences as training and the remaining 20% as test set in Fisher's discriminant algorithm.</p>
            </caption>
            <tblbdy cols="9">
               <r>
                  <c ca="center">
                     <p>Order</p>
                  </c>
                  <c ca="center">
                     <p><it>S</it><sub><it>n</it></sub>(%)</p>
                  </c>
                  <c ca="center">
                     <p><it>S</it><sub><it>p</it></sub>(%)</p>
                  </c>
                  <c ca="center">
                     <p><it>A</it><sub><it>c</it></sub>(%)</p>
                  </c>
                  <c ca="center">
                     <p>
                        <it>CC</it>
                     </p>
                  </c>
                  <c ca="center">
                     <p><inline-formula><m:math name="1471-2105-9-113-i9" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msub><m:msup><m:mi>S</m:mi><m:mo>&#8242;</m:mo></m:msup><m:mi>n</m:mi></m:msub></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGafm4uamLbauaadaWgaaWcbaGaemOBa4gabeaaaaa@2E9F@</m:annotation></m:semantics></m:math></inline-formula>(%)</p>
                  </c>
                  <c ca="center">
                     <p><inline-formula><m:math name="1471-2105-9-113-i10" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msub><m:msup><m:mi>S</m:mi><m:mo>&#8242;</m:mo></m:msup><m:mi>p</m:mi></m:msub></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGafm4uamLbauaadaWgaaWcbaGaemiCaahabeaaaaa@2EA3@</m:annotation></m:semantics></m:math></inline-formula>(%)</p>
                  </c>
                  <c ca="center">
                     <p><inline-formula><m:math name="1471-2105-9-113-i11" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msub><m:msup><m:mi>A</m:mi><m:mo>&#8242;</m:mo></m:msup><m:mi>c</m:mi></m:msub></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGafmyqaeKbauaadaWgaaWcbaGaem4yamgabeaaaaa@2E65@</m:annotation></m:semantics></m:math></inline-formula>(%)</p>
                  </c>
                  <c ca="center">
                     <p>
                        <it>CC'</it>
                     </p>
                  </c>
               </r>
               <r>
                  <c cspan="9">
                     <hr/>
                  </c>
               </r>
               <r>
                  <c ca="center">
                     <p>
                        <b>M1</b>
                     </p>
                  </c>
                  <c ca="center">
                     <p>85.78</p>
                  </c>
                  <c ca="center">
                     <p>89.65</p>
                  </c>
                  <c ca="center">
                     <p>87.71</p>
                  </c>
                  <c ca="center">
                     <p>0.73</p>
                  </c>
                  <c ca="center">
                     <p>87.10</p>
                  </c>
                  <c ca="center">
                     <p>88.28</p>
                  </c>
                  <c ca="center">
                     <p>87.69</p>
                  </c>
                  <c ca="center">
                     <p>0.73</p>
                  </c>
               </r>
               <r>
                  <c ca="center">
                     <p>
                        <b>M2</b>
                     </p>
                  </c>
                  <c ca="center">
                     <p>86.39</p>
                  </c>
                  <c ca="center">
                     <p>93.71</p>
                  </c>
                  <c ca="center">
                     <p>90.05</p>
                  </c>
                  <c ca="center">
                     <p>0.79</p>
                  </c>
                  <c ca="center">
                     <p>87.90</p>
                  </c>
                  <c ca="center">
                     <p>91.09</p>
                  </c>
                  <c ca="center">
                     <p>89.49</p>
                  </c>
                  <c ca="center">
                     <p>0.77</p>
                  </c>
               </r>
               <r>
                  <c ca="center">
                     <p>
                        <b>M3</b>
                     </p>
                  </c>
                  <c ca="center">
                     <p>86.86</p>
                  </c>
                  <c ca="center">
                     <p>93.88</p>
                  </c>
                  <c ca="center">
                     <p>90.37</p>
                  </c>
                  <c ca="center">
                     <p>0.79</p>
                  </c>
                  <c ca="center">
                     <p>87.10</p>
                  </c>
                  <c ca="center">
                     <p>91.78</p>
                  </c>
                  <c ca="center">
                     <p>89.44</p>
                  </c>
                  <c ca="center">
                     <p>0.77</p>
                  </c>
               </r>
            </tblbdy>
         </tbl>
         <p>Finally, it is important to test our method with real human DNA sequences. For example, a sliding window technique with window size of 600 bp and step size of 10 bp can be used to recognize promoter sequences in the human DNA sequences, similar to the technique adopted by Gao and Zhang <abbrgrp><abbr bid="B32">32</abbr></abbrgrp> to recognize exons. However, because promoter sequences are not clearly marked in the human DNA sequences, we can't use this approach to test our method. Nevertheless, similar to that performed in Ref. <abbrgrp><abbr bid="B7">7</abbr><abbr bid="B33">33</abbr></abbrgrp>, we use the human chromosome 22, in which 20 promoters are experimentally verified <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>. One can refer to Table <tblr tid="T1">1</tblr> in Ref. <abbrgrp><abbr bid="B13">13</abbr></abbrgrp> to get the sequences with the accession numbers. However, as AB016655 and D86746 are not clearly annotated, we do not use them in the test. We use 50% of the promoter (from EPD) and non-promoter (from EID) sequences to train <b>M1</b>. The coefficients in Fisher's algorithm <b>w </b>= (<it>w</it><sub>1</sub>, <it>w</it><sub>2</sub>, &#8943;, <it>w</it><sub>36</sub>) are determined based on the training set. The choice of a promoter/non-promoter sequence is determined by the criterion <b>Z</b>(<b>x</b>) > <b>0</b>/<b>Z</b>(<b>x</b>) &lt;<b>0</b>. Except for AF047576, the other 17 promoter sequences are correctly identified. This suggests that the global descriptor <it>GD </it>(<b>M1</b>), with a smaller number of parameters (36), is a practical method.</p>
      </sec>
      <sec>
         <st>
            <p>Conclusion</p>
         </st>
         <p>Promoter prediction is a difficult but important problem in gene finding, and it is critical for elucidating the regulation of gene expression <abbrgrp><abbr bid="B34">34</abbr></abbrgrp>. We use two kinds of multifractal analysis on the free energy sequences of promoter/non-promoter, Z curve analysis, and the global descriptor for the primary sequences of promoter/non-promoter. A total of 141 parameters are extracted from these four methods. These parameters are used in both independent and combined ways to distinguish promoter sequences from non-promoter sequences.</p>
         <p>Fisher's linear discriminant algorithm provides a quantitative assessment of the recognition methods. If we use these methods independently, the global descriptor of the promoter/non-promoter sequences is the best method based Fisher's algorithm. Combinations of various methods show that the accuracies can be improved in some cases but the improvements are not simply due to the increase of parameter numbers. With all 141 parameters together, the results are satisfactory. However, the number of parameters is too high in this condition. The number is reduced as there is some redundancy/correlation among these parameters. In the case of 117 parameters, similar results are achieved, with the discriminant accuracies <it>p</it><sub><it>c</it></sub>, <it>p</it><sub><it>nc</it></sub>, <it>q</it><sub><it>c</it></sub>, and <it>q</it><sub><it>nc </it></sub>reaching 86.89% 93.11%, 86.02% and 92.67%, respectively. The related sensitivity <it>S</it><sub><it>n</it></sub>, specificity <it>S</it><sub><it>p</it></sub>, accuracy <it>A</it><sub><it>c </it></sub>and correlation coefficient <it>CC </it>for the test set reach 86.02%, 93.57%, 89.79% and 0.78, respectively. A smaller number of parameters (96) also produces relatively satisfactory results. The global descriptor method with only 36 parameters successfully identifies 17 of the 18 experimentally verified promoters in human chromosome 22 <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>. Recognition of promoter sequences with such satisfactory accuracy indicates that the methods is promising for human Pol II promoter prediction.</p>
         <p>The main aim of this work is to develop efficient algorithms that can discriminate between promoters and non-promoters in a given sequence. Another challenge being addressed is the localization of promoters rather than a simple classification considered in current methods <abbrgrp><abbr bid="B7">7</abbr></abbrgrp>. Multifractal analysis, which is especially useful in many other fields <abbrgrp><abbr bid="B25">25</abbr><abbr bid="B26">26</abbr><abbr bid="B35">35</abbr><abbr bid="B36">36</abbr></abbrgrp>, seems to reflect some information for promoter recognition (see first line in Table <tblr tid="T1">1</tblr>). But if we use method <b>M3</b>, multifractal analysis does not significantly improve the performance. The methods considered in this paper seem promising in enhancing the performance of biomolecular sequence analysis and promoter prediction in particular. It is a challenge to predict promoter sequences directly from the real human genome. However, it would be helpful to use first the ENCODE pilot project data set, which spans about 1% of the human genome sequence <abbrgrp><abbr bid="B37">37</abbr></abbrgrp>. Our following work aims to contribute towards this challenging problem.</p>
      </sec>
      <sec>
         <st>
            <p>Methods</p>
         </st>
         <sec>
            <st>
               <p>Conversion of the original data</p>
            </st>
            <p>Some studies suggested that various properties, such as stability, bendability and curvature, of the region immediately upstream of the TSS differ from that of downstream region <abbrgrp><abbr bid="B6">6</abbr><abbr bid="B38">38</abbr><abbr bid="B39">39</abbr></abbrgrp>. The upstream region is less stable, more rigid and more curved than the downstream region. Kan-here and Bansal <abbrgrp><abbr bid="B6">6</abbr></abbrgrp> predicted the prokaryotic promoter based on such difference in DNA stability. We convert the original sequences into new numeric sequences according to the free energy of dinucleotides. A sliding window with size of 2nt is used and moved one base pair forward each time. The numeric sequences can be smoothed with a larger window size. For more details on the smoothing method, one can refer to Ref. <abbrgrp><abbr bid="B40">40</abbr></abbrgrp>. The free energy values corresponding to the 10 unique dinucleotides are taken from the unified parameters proposed in Ref. <abbrgrp><abbr bid="B8">8</abbr></abbrgrp>. They are: AA/TT = -1.00 <it>kcal/mol</it>, AT/TA = -0.88 <it>kcal/mol</it>, TA/AT = -0.58 <it>kcal/mol</it>, CA/GT = -1.45 <it>kcal/mol</it>, CT/GA = -1.44 <it>kcal/mol</it>, GT/CA = -1.28 <it>kcal/mol</it>, GA/CT = -1.30 <it>kcal/mol</it>, CG/GC = -2.17 <it>kcal/mol</it>, GC/CG = -2.24 <it>kcal/mol</it>, GG/CC = -1.84 <it>kcal/mol</it>. The ten values are added by 2.24 <it>kcal/mol </it>(the negative of the smallest free energy) so that all the values are larger than or equal to zero in order to construct a measure from the time series for the multifractal method in the following analysis. For example, the free energy sequence for one of the promoter sequences with a sliding window of size 2nt is given in Figure <figr fid="F1">1</figr>.</p>
            <fig id="F1">
               <title>
                  <p>Figure 1</p>
               </title>
               <caption>
                  <p>The free energy sequence of one promoter sequence</p>
               </caption>
               <text>
                  <p><b>The free energy sequence of one promoter sequence.</b> See text for a detailed description about how to get such numeric sequence.</p>
               </text>
               <graphic file="1471-2105-9-113-1"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>Multifractal analysis (<it>MFA</it>)</p>
            </st>
            <p>Let <it>T</it><sub><it>t</it></sub>, <it>t </it>= 1, 2, &#8943;, <it>N</it>, be the numeric sequence of a promoter/non-promoter with length <it>N </it>. First, we define</p>
            <p>
               <display-formula id="M10">
                  <m:math name="1471-2105-9-113-i12" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:mtable>
                              <m:mtr>
                                 <m:mtd>
                                    <m:mrow>
                                       <m:msub>
                                          <m:mi>F</m:mi>
                                          <m:mi>t</m:mi>
                                       </m:msub>
                                       <m:mo>=</m:mo>
                                       <m:mfrac>
                                          <m:mrow>
                                             <m:msub>
                                                <m:mi>T</m:mi>
                                                <m:mi>t</m:mi>
                                             </m:msub>
                                          </m:mrow>
                                          <m:mrow>
                                             <m:mstyle displaystyle="true">
                                                <m:munderover>
                                                   <m:mo>&#8721;</m:mo>
                                                   <m:mrow>
                                                      <m:mi>j</m:mi>
                                                      <m:mo>=</m:mo>
                                                      <m:mn>1</m:mn>
                                                   </m:mrow>
                                                   <m:mi>N</m:mi>
                                                </m:munderover>
                                                <m:mrow>
                                                   <m:msub>
                                                      <m:mi>T</m:mi>
                                                      <m:mi>j</m:mi>
                                                   </m:msub>
                                                </m:mrow>
                                             </m:mstyle>
                                          </m:mrow>
                                       </m:mfrac>
                                       <m:mo>,</m:mo>
                                    </m:mrow>
                                 </m:mtd>
                                 <m:mtd>
                                    <m:mrow>
                                       <m:mo stretchy="false">(</m:mo>
                                       <m:mi>t</m:mi>
                                       <m:mo>=</m:mo>
                                       <m:mn>1</m:mn>
                                       <m:mo>,</m:mo>
                                       <m:mn>2</m:mn>
                                       <m:mo>,</m:mo>
                                       <m:mo>&#8943;</m:mo>
                                       <m:mo>,</m:mo>
                                       <m:mi>N</m:mi>
                                       <m:mo stretchy="false">)</m:mo>
                                    </m:mrow>
                                 </m:mtd>
                              </m:mtr>
                           </m:mtable>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaqbaeqabeGaaaqaaiabdAeagnaaBaaaleaacqWG0baDaeqaaOGaeyypa0tcfa4aaSaaaeaacqWGubavdaWgaaqaaiabdsha0bqabaaabaWaaabCaeaacqWGubavdaWgaaqaaiabdQgaQbqabaaabaGaemOAaOMaeyypa0JaeGymaedabaGaemOta4eacqGHris5aaaakiabcYcaSaqaaiabcIcaOiabdsha0jabg2da9iabigdaXiabcYcaSiabikdaYiabcYcaSiabl+UimjabcYcaSiabd6eaojabcMcaPaaaaaa@494A@</m:annotation>
                     </m:semantics>
                  </m:math>
               </display-formula>
            </p>
            <p>to be the frequency of <it>T</it><sub><it>t</it></sub>. It follows that <inline-formula><m:math name="1471-2105-9-113-i13" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:mstyle displaystyle="true"><m:munderover><m:mo>&#8721;</m:mo><m:mrow><m:mi>t</m:mi><m:mo>=</m:mo><m:mn>1</m:mn></m:mrow><m:mi>N</m:mi></m:munderover><m:mrow><m:msub><m:mi>F</m:mi><m:mi>t</m:mi></m:msub><m:mo>=</m:mo><m:mn>1</m:mn></m:mrow></m:mstyle></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaWaaabCaeaacqWGgbGrdaWgaaWcbaGaemiDaqhabeaakiabg2da9iabigdaXaWcbaGaemiDaqNaeyypa0JaeGymaedabaGaemOta4eaniabggHiLdaaaa@3753@</m:annotation></m:semantics></m:math></inline-formula>. We define a measure <it>&#956; </it>on the interval [0, 1) by</p>
            <p>
               <display-formula id="M11"><it>&#956;</it>(<it>dx</it>) = <it>Y</it>(<it>x</it>) <it>dx</it>,</display-formula>
            </p>
            <p>where</p>
            <p>
               <display-formula id="M12">
                  <m:math name="1471-2105-9-113-i14" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:mtable>
                              <m:mtr>
                                 <m:mtd>
                                    <m:mrow>
                                       <m:mi>Y</m:mi>
                                       <m:mo stretchy="false">(</m:mo>
                                       <m:mi>x</m:mi>
                                       <m:mo stretchy="false">)</m:mo>
                                       <m:mo>=</m:mo>
                                       <m:mi>N</m:mi>
                                       <m:mo>&#215;</m:mo>
                                       <m:msub>
                                          <m:mi>F</m:mi>
                                          <m:mi>t</m:mi>
                                       </m:msub>
                                       <m:mo>=</m:mo>
                                       <m:mfrac>
                                          <m:mrow>
                                             <m:msub>
                                                <m:mi>T</m:mi>
                                                <m:mi>t</m:mi>
                                             </m:msub>
                                          </m:mrow>
                                          <m:mrow>
                                             <m:mfrac>
                                                <m:mn>1</m:mn>
                                                <m:mi>N</m:mi>
                                             </m:mfrac>
                                             <m:mstyle displaystyle="true">
                                                <m:munderover>
                                                   <m:mo>&#8721;</m:mo>
                                                   <m:mrow>
                                                      <m:mi>j</m:mi>
                                                      <m:mo>=</m:mo>
                                                      <m:mn>1</m:mn>
                                                   </m:mrow>
                                                   <m:mi>N</m:mi>
                                                </m:munderover>
                                                <m:mrow>
                                                   <m:msub>
                                                      <m:mi>T</m:mi>
                                                      <m:mi>j</m:mi>
                                                   </m:msub>
                                                </m:mrow>
                                             </m:mstyle>
                                          </m:mrow>
                                       </m:mfrac>
                                       <m:mo>,</m:mo>
                                    </m:mrow>
                                 </m:mtd>
                                 <m:mtd>
                                    <m:mrow>
                                       <m:mi>x</m:mi>
                                       <m:mo>&#8712;</m:mo>
                                       <m:mo stretchy="false">[</m:mo>
                                       <m:mfrac>
                                          <m:mrow>
                                             <m:mi>t</m:mi>
                                             <m:mo>&#8722;</m:mo>
                                             <m:mn>1</m:mn>
                                          </m:mrow>
                                          <m:mi>N</m:mi>
                                       </m:mfrac>
                                       <m:mo>,</m:mo>
                                       <m:mfrac>
                                          <m:mi>t</m:mi>
                                          <m:mi>N</m:mi>
                                       </m:mfrac>
                                       <m:mo stretchy="false">)</m:mo>
                                       <m:mo>.</m:mo>
                                    </m:mrow>
                                 </m:mtd>
                              </m:mtr>
                           </m:mtable>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaqbaeqabeGaaaqaaiabdMfazjabcIcaOiabdIha4jabcMcaPiabg2da9iabd6eaojabgEna0kabdAeagnaaBaaaleaacqWG0baDaeqaaOGaeyypa0tcfa4aaSaaaeaacqWGubavdaWgaaqaaiabdsha0bqabaaabaWaaSaaaeaacqaIXaqmaeaacqWGobGtaaWaaabCaeaacqWGubavdaWgaaqaaiabdQgaQbqabaaabaGaemOAaOMaeyypa0JaeGymaedabaGaemOta4eacqGHris5aaaakiabcYcaSaqaaiabdIha4jabgIGiolabcUfaBLqbaoaalaaabaGaemiDaqNaeyOeI0IaeGymaedabaGaemOta4eaaOGaeiilaWscfa4aaSaaaeaacqWG0baDaeaacqWGobGtaaGccqGGPaqkcqGGUaGlaaaaaa@578B@</m:annotation>
                     </m:semantics>
                  </m:math>
               </display-formula>
            </p>
            <p>We denote the interval <inline-formula><m:math name="1471-2105-9-113-i15" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:mo stretchy="false">[</m:mo><m:mfrac><m:mrow><m:mi>t</m:mi><m:mo>&#8722;</m:mo><m:mn>1</m:mn></m:mrow><m:mi>N</m:mi></m:mfrac><m:mo>,</m:mo><m:mfrac><m:mi>t</m:mi><m:mi>N</m:mi></m:mfrac><m:mo stretchy="false">)</m:mo></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGaei4waSvcfa4aaSaaaeaacqWG0baDcqGHsislcqaIXaqmaeaacqWGobGtaaGccqGGSaaljuaGdaWcaaqaaiabdsha0bqaaiabd6eaobaakiabcMcaPaaa@3724@</m:annotation></m:semantics></m:math></inline-formula> by <it>I</it><sub><it>t</it></sub>. It is easy to see that <it>&#956;</it>([0, 1)) = 1 and <it>&#956;</it>(<it>I</it><sub><it>t</it></sub>) = <it>F</it><sub><it>t</it></sub>. We call <it>&#956;</it>(<it>x</it>) the <it>measure representation </it><abbrgrp><abbr bid="B26">26</abbr><abbr bid="B41">41</abbr></abbrgrp> for the numeric sequence of a promoter/non-promoter.</p>
            <p>The most common algorithms of multifractal analysis are the so called <it>fixed-size box-counting algorithms </it><abbrgrp><abbr bid="B42">42</abbr></abbrgrp>. In the one-dimensional case, for a given measure <it>&#956; </it>with support <it>E </it>&#8834; &#8477;, we consider the <it>partition sum</it></p>
            <p>
               <display-formula id="M13">
                  <m:math name="1471-2105-9-113-i16" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:mtable>
                              <m:mtr>
                                 <m:mtd>
                                    <m:mrow>
                                       <m:msub>
                                          <m:mi>Z</m:mi>
                                          <m:mi>&#949;</m:mi>
                                       </m:msub>
                                       <m:mo stretchy="false">(</m:mo>
                                       <m:mi>q</m:mi>
                                       <m:mo stretchy="false">)</m:mo>
                                       <m:mo>=</m:mo>
                                       <m:mstyle displaystyle="true">
                                          <m:munder>
                                             <m:mo>&#8721;</m:mo>
                                             <m:mrow>
                                                <m:mi>&#956;</m:mi>
                                                <m:mo stretchy="false">(</m:mo>
                                                <m:mi>B</m:mi>
                                                <m:mo stretchy="false">)</m:mo>
                                                <m:mo>&#8800;</m:mo>
                                                <m:mn>0</m:mn>
                                             </m:mrow>
                                          </m:munder>
                                          <m:mrow>
                                             <m:msup>
                                                <m:mrow>
                                                   <m:mo stretchy="false">[</m:mo>
                                                   <m:mi>&#956;</m:mi>
                                                   <m:mo stretchy="false">(</m:mo>
                                                   <m:mi>B</m:mi>
                                                   <m:mo stretchy="false">)</m:mo>
                                                   <m:mo stretchy="false">]</m:mo>
                                                </m:mrow>
                                                <m:mi>q</m:mi>
                                             </m:msup>
                                          </m:mrow>
                                       </m:mstyle>
                                       <m:mo>,</m:mo>
                                    </m:mrow>
                                 </m:mtd>
                                 <m:mtd>
                                    <m:mrow>
                                       <m:mi>q</m:mi>
                                       <m:mo>&#8712;</m:mo>
                                       <m:mi>&#8477;</m:mi>
                                       <m:mo>,</m:mo>
                                    </m:mrow>
                                 </m:mtd>
                              </m:mtr>
                           </m:mtable>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaqbaeqabeGaaaqaaiabdQfaAnaaBaaaleaaiiGacqWF1oqzaeqaaOGaeiikaGIaemyCaeNaeiykaKIaeyypa0ZaaabuaeaacqGGBbWwcqWF8oqBcqGGOaakcqWGcbGqcqGGPaqkcqGGDbqxdaahaaWcbeqaaiabdghaXbaaaeaacqWF8oqBcqGGOaakcqWGcbGqcqGGPaqkcqGHGjsUcqaIWaamaeqaniabggHiLdGccqGGSaalaeaacqWGXbqCcqGHiiIZtuuDJXwAK1uy0HMmaeHbfv3ySLgzG0uy0HgiuD3BaGabaiab+1risjabcYcaSaaaaaa@54A6@</m:annotation>
                     </m:semantics>
                  </m:math>
               </display-formula>
            </p>
            <p>where the sum runs over all different nonempty boxes <it>B </it>of a given side <it>&#949; </it>in a grid covering of the support <it>E</it>, that is,</p>
            <p>
               <display-formula id="M14"><it>B </it>= [<it>k&#949;</it>, (<it>k </it>+ 1)<it>&#949;</it>).</display-formula>
            </p>
            <p>The <it>mass exponent &#964; </it>(<it>q</it>) is defined <abbrgrp><abbr bid="B43">43</abbr><abbr bid="B44">44</abbr></abbrgrp> as</p>
            <p>
               <display-formula id="M15">
                  <m:math name="1471-2105-9-113-i17" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:mi>&#964;</m:mi>
                           <m:mo stretchy="false">(</m:mo>
                           <m:mi>q</m:mi>
                           <m:mo stretchy="false">)</m:mo>
                           <m:mo>=</m:mo>
                           <m:munder>
                              <m:mrow>
                                 <m:mi>lim</m:mi>
                                 <m:mo>&#8289;</m:mo>
                              </m:mrow>
                              <m:mrow>
                                 <m:mi>&#949;</m:mi>
                                 <m:mo>&#8594;</m:mo>
                                 <m:mn>0</m:mn>
                              </m:mrow>
                           </m:munder>
                           <m:mfrac>
                              <m:mrow>
                                 <m:mi>ln</m:mi>
                                 <m:mo>&#8289;</m:mo>
                                 <m:msub>
                                    <m:mi>Z</m:mi>
                                    <m:mi>&#949;</m:mi>
                                 </m:msub>
                                 <m:mo stretchy="false">(</m:mo>
                                 <m:mi>q</m:mi>
                                 <m:mo stretchy="false">)</m:mo>
                              </m:mrow>
                              <m:mrow>
                                 <m:mi>ln</m:mi>
                                 <m:mo>&#8289;</m:mo>
                                 <m:mi>&#949;</m:mi>
                              </m:mrow>
                           </m:mfrac>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaacciGae8hXdqNaeiikaGIaemyCaeNaeiykaKIaeyypa0ZaaCbeaeaacyGGSbaBcqGGPbqAcqGGTbqBaSqaaiab=v7aLjabgkziUkabicdaWaqabaqcfa4aaSaaaeaacyGGSbaBcqGGUbGBcqWGAbGwdaWgaaqaaiab=v7aLbqabaGaeiikaGIaemyCaeNaeiykaKcabaGagiiBaWMaeiOBa4Mae8xTdugaaaaa@48CD@</m:annotation>
                     </m:semantics>
                  </m:math>
               </display-formula>
            </p>
            <p>and the generalized <it>fractal dimensions </it><abbrgrp><abbr bid="B43">43</abbr><abbr bid="B44">44</abbr></abbrgrp> of the measure are defined as</p>
            <p>
               <display-formula id="M16">
                  <m:math name="1471-2105-9-113-i18" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:mtable>
                              <m:mtr>
                                 <m:mtd>
                                    <m:mrow>
                                       <m:mi>D</m:mi>
                                       <m:mo stretchy="false">(</m:mo>
                                       <m:mi>q</m:mi>
                                       <m:mo stretchy="false">)</m:mo>
                                       <m:mo>=</m:mo>
                                       <m:mfrac>
                                          <m:mrow>
                                             <m:mi>&#964;</m:mi>
                                             <m:mo stretchy="false">(</m:mo>
                                             <m:mi>q</m:mi>
                                             <m:mo stretchy="false">)</m:mo>
                                          </m:mrow>
                                          <m:mrow>
                                             <m:mi>q</m:mi>
                                             <m:mo>&#8722;</m:mo>
                                             <m:mn>1</m:mn>
                                          </m:mrow>
                                       </m:mfrac>
                                       <m:mo>,</m:mo>
                                    </m:mrow>
                                 </m:mtd>
                                 <m:mtd>
                                    <m:mrow>
                                       <m:mi>f</m:mi>
                                       <m:mi>o</m:mi>
                                       <m:mi>r</m:mi>
                                    </m:mrow>
                                 </m:mtd>
                                 <m:mtd>
                                    <m:mrow>
                                       <m:mi>q</m:mi>
                                       <m:mo>&#8800;</m:mo>
                                       <m:mn>1</m:mn>
                                       <m:mo>,</m:mo>
                                    </m:mrow>
                                 </m:mtd>
                              </m:mtr>
                           </m:mtable>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaqbaeqabeWaaaqaaiabdseaejabcIcaOiabdghaXjabcMcaPiabg2da9KqbaoaalaaabaacciGae8hXdqNaeiikaGIaemyCaeNaeiykaKcabaGaemyCaeNaeyOeI0IaeGymaedaaOGaeiilaWcabaGaemOzayMaem4Ba8MaemOCaihabaGaemyCaeNaeyiyIKRaeGymaeJaeiilaWcaaaaa@4448@</m:annotation>
                     </m:semantics>
                  </m:math>
               </display-formula>
            </p>
            <p>and</p>
            <p>
               <display-formula id="M17">
                  <m:math name="1471-2105-9-113-i19" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:mtable>
                              <m:mtr>
                                 <m:mtd>
                                    <m:mrow>
                                       <m:mi>D</m:mi>
                                       <m:mo stretchy="false">(</m:mo>
                                       <m:mi>q</m:mi>
                                       <m:mo stretchy="false">)</m:mo>
                                       <m:mo>=</m:mo>
                                       <m:munder>
                                          <m:mrow>
                                             <m:mi>lim</m:mi>
                                             <m:mo>&#8289;</m:mo>
                                          </m:mrow>
                                          <m:mrow>
                                             <m:mi>&#949;</m:mi>
                                             <m:mo>&#8594;</m:mo>
                                             <m:mn>0</m:mn>
                                          </m:mrow>
                                       </m:munder>
                                       <m:mfrac>
                                          <m:mrow>
                                             <m:msub>
                                                <m:mi>Z</m:mi>
                                                <m:mrow>
                                                   <m:mn>1</m:mn>
                                                   <m:mo>,</m:mo>
                                                   <m:mi>&#949;</m:mi>
                                                </m:mrow>
                                             </m:msub>
                                          </m:mrow>
                                          <m:mrow>
                                             <m:mi>ln</m:mi>
                                             <m:mo>&#8289;</m:mo>
                                             <m:mi>&#949;</m:mi>
                                          </m:mrow>
                                       </m:mfrac>
                                       <m:mo>,</m:mo>
                                    </m:mrow>
                                 </m:mtd>
                                 <m:mtd>
                                    <m:mrow>
                                       <m:mi>f</m:mi>
                                       <m:mi>o</m:mi>
                                       <m:mi>r</m:mi>
                                    </m:mrow>
                                 </m:mtd>
                                 <m:mtd>
                                    <m:mrow>
                                       <m:mi>q</m:mi>
                                       <m:mo>=</m:mo>
                                       <m:mn>1</m:mn>
                                       <m:mo>,</m:mo>
                                    </m:mrow>
                                 </m:mtd>
                              </m:mtr>
                           </m:mtable>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaqbaeqabeWaaaqaaiabdseaejabcIcaOiabdghaXjabcMcaPiabg2da9maaxababaGagiiBaWMaeiyAaKMaeiyBa0galeaaiiGacqWF1oqzcqGHsgIRcqaIWaamaeqaaKqbaoaalaaabaGaemOwaO1aaSbaaeaacqaIXaqmcqGGSaalcqWF1oqzaeqaaaqaaiGbcYgaSjabc6gaUjab=v7aLbaakiabcYcaSaqaaiabdAgaMjabd+gaVjabdkhaYbqaaiabdghaXjabg2da9iabigdaXiabcYcaSaaaaaa@4D6E@</m:annotation>
                     </m:semantics>
                  </m:math>
               </display-formula>
            </p>
            <p>where <inline-formula><m:math name="1471-2105-9-113-i20" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msub><m:mi>Z</m:mi><m:mrow><m:mn>1</m:mn><m:mo>,</m:mo><m:mi>&#949;</m:mi></m:mrow></m:msub><m:mo>=</m:mo><m:mstyle displaystyle="true"><m:munder><m:mo>&#8721;</m:mo><m:mrow><m:mi>&#956;</m:mi><m:mo stretchy="false">(</m:mo><m:mi>B</m:mi><m:mo stretchy="false">)</m:mo><m:mo>&#8800;</m:mo><m:mn>0</m:mn></m:mrow></m:munder><m:mrow><m:mi>&#956;</m:mi><m:mo stretchy="false">(</m:mo><m:mi>B</m:mi><m:mo stretchy="false">)</m:mo><m:mi>ln</m:mi><m:mo>&#8289;</m:mo><m:mi>&#956;</m:mi><m:mo stretchy="false">(</m:mo><m:mi>B</m:mi><m:mo stretchy="false">)</m:mo></m:mrow></m:mstyle></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGaemOwaO1aaSbaaSqaaiabigdaXiabcYcaSGGaciab=v7aLbqabaGccqGH9aqpdaaeqbqaaiab=X7aTjabcIcaOiabdkeacjabcMcaPiGbcYgaSjabc6gaUjab=X7aTjabcIcaOiabdkeacjabcMcaPaWcbaGae8hVd0MaeiikaGIaemOqaiKaeiykaKIaeyiyIKRaeGimaadabeqdcqGHris5aaaa@46B8@</m:annotation></m:semantics></m:math></inline-formula>. The generalized fractal dimensions are numerically estimated through a linear regression of ln <it>Z</it><sub><it>&#949; </it></sub>(<it>q</it>)/(<it>q </it>- 1) against ln <it>&#949; </it>for <it>q </it>&#8800; 1, and similarly through a linear regression of <it>Z</it><sub>1, <it>&#949; </it></sub>against ln <it>&#949; </it>for <it>q </it>= 1 <abbrgrp><abbr bid="B25">25</abbr><abbr bid="B42">42</abbr><abbr bid="B45">45</abbr></abbrgrp>. <it>D</it>(1) is called the <it>information dimension </it>and <it>D</it>(2) the <it>correlation dimension </it><abbrgrp><abbr bid="B43">43</abbr><abbr bid="B44">44</abbr></abbrgrp>.</p>
            <p>The concept of <it>phase transitions </it>in multifractal spectra was introduced in the study of logistic maps, Julia sets, and other simple systems. Evidence of a phase transition was found in the multifractal spectrum of diffusion-limited aggregation <abbrgrp><abbr bid="B46">46</abbr></abbrgrp>. By following the thermodynamic formulation of multifractal measures, Canessa <abbrgrp><abbr bid="B47">47</abbr></abbrgrp> derived an expression for the analogous specific heat as</p>
            <p>
               <display-formula id="M18">
                  <m:math name="1471-2105-9-113-i21" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:msub>
                              <m:mi>C</m:mi>
                              <m:mi>q</m:mi>
                           </m:msub>
                           <m:mo>&#8801;</m:mo>
                           <m:mo>&#8722;</m:mo>
                           <m:mfrac>
                              <m:mrow>
                                 <m:msup>
                                    <m:mo>&#8706;</m:mo>
                                    <m:mn>2</m:mn>
                                 </m:msup>
                                 <m:mi>&#964;</m:mi>
                                 <m:mo stretchy="false">(</m:mo>
                                 <m:mi>q</m:mi>
                                 <m:mo stretchy="false">)</m:mo>
                              </m:mrow>
                              <m:mrow>
                                 <m:mo>&#8706;</m:mo>
                                 <m:msup>
                                    <m:mi>q</m:mi>
                                    <m:mn>2</m:mn>
                                 </m:msup>
                              </m:mrow>
                           </m:mfrac>
                           <m:mo>&#8776;</m:mo>
                           <m:mn>2</m:mn>
                           <m:mi>&#964;</m:mi>
                           <m:mo stretchy="false">(</m:mo>
                           <m:mi>q</m:mi>
                           <m:mo stretchy="false">)</m:mo>
                           <m:mo>&#8722;</m:mo>
                           <m:mi>&#964;</m:mi>
                           <m:mo stretchy="false">(</m:mo>
                           <m:mi>q</m:mi>
                           <m:mo>+</m:mo>
                           <m:mn>1</m:mn>
                           <m:mo stretchy="false">)</m:mo>
                           <m:mo>&#8722;</m:mo>
                           <m:mi>&#964;</m:mi>
                           <m:mo stretchy="false">(</m:mo>
                           <m:mi>q</m:mi>
                           <m:mo>&#8722;</m:mo>
                           <m:mn>1</m:mn>
                           <m:mo stretchy="false">)</m:mo>
                           <m:mo>.</m:mo>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGaem4qam0aaSbaaSqaaiabdghaXbqabaGccqGHHjIUcqGHsisljuaGdaWcaaqaaiabgkGi2oaaCaaabeqaaiabikdaYaaaiiGacqWFepaDcqGGOaakcqWGXbqCcqGGPaqkaeaacqGHciITcqWGXbqCdaahaaqabeaacqaIYaGmaaaaaOGaeyisISRaeGOmaiJae8hXdqNaeiikaGIaemyCaeNaeiykaKIaeyOeI0Iae8hXdqNaeiikaGIaemyCaeNaey4kaSIaeGymaeJaeiykaKIaeyOeI0Iae8hXdqNaeiikaGIaemyCaeNaeyOeI0IaeGymaeJaeiykaKIaeiOla4caaa@551E@</m:annotation>
                     </m:semantics>
                  </m:math>
               </display-formula>
            </p>
            <p>He showed that the form of <it>C</it><sub><it>q </it></sub>resembles a classical phase transition at a critical point for financial time series.</p>
            <p>The singularities of a measure are characterized by the <it>Lipschitz-H&#246;lder exponent &#945;</it>(<it>q</it>) <abbrgrp><abbr bid="B44">44</abbr></abbrgrp>, which is related to <it>&#964; </it>(<it>q</it>) by</p>
            <p>
               <display-formula id="M19">
                  <m:math name="1471-2105-9-113-i22" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:mi>&#945;</m:mi>
                           <m:mo stretchy="false">(</m:mo>
                           <m:mi>q</m:mi>
                           <m:mo stretchy="false">)</m:mo>
                           <m:mo>=</m:mo>
                           <m:mfrac>
                              <m:mi>d</m:mi>
                              <m:mrow>
                                 <m:mi>d</m:mi>
                                 <m:mi>q</m:mi>
                              </m:mrow>
                           </m:mfrac>
                           <m:mi>&#964;</m:mi>
                           <m:mo stretchy="false">(</m:mo>
                           <m:mi>q</m:mi>
                           <m:mo stretchy="false">)</m:mo>
                           <m:mo>.</m:mo>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaacciGae8xSdeMaeiikaGIaemyCaeNaeiykaKIaeyypa0tcfa4aaSaaaeaacqWGKbazaeaacqWGKbazcqWGXbqCaaGccqWFepaDcqGGOaakcqWGXbqCcqGGPaqkcqGGUaGlaaa@3C60@</m:annotation>
                     </m:semantics>
                  </m:math>
               </display-formula>
            </p>
            <p>Substitution of Eq. (15) into Eq. (19) yields</p>
            <p>
               <display-formula id="M20">
                  <m:math name="1471-2105-9-113-i23" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:mi>&#945;</m:mi>
                           <m:mo stretchy="false">(</m:mo>
                           <m:mi>q</m:mi>
                           <m:mo stretchy="false">)</m:mo>
                           <m:mo>=</m:mo>
                           <m:munder>
                              <m:mrow>
                                 <m:mi>lim</m:mi>
                                 <m:mo>&#8289;</m:mo>
                              </m:mrow>
                              <m:mrow>
                                 <m:mi>&#949;</m:mi>
                                 <m:mo>&#8594;</m:mo>
                                 <m:mn>0</m:mn>
                              </m:mrow>
                           </m:munder>
                           <m:mfrac>
                              <m:mrow>
                                 <m:mstyle displaystyle="true">
                                    <m:munder>
                                       <m:mo>&#8721;</m:mo>
                                       <m:mrow>
                                          <m:mi>&#956;</m:mi>
                                          <m:mo stretchy="false">(</m:mo>
                                          <m:mi>B</m:mi>
                                          <m:mo stretchy="false">)</m:mo>
                                          <m:mo>&#8800;</m:mo>
                                          <m:mn>0</m:mn>
                                       </m:mrow>
                                    </m:munder>
                                    <m:mrow>
                                       <m:msup>
                                          <m:mrow>
                                             <m:mo stretchy="false">[</m:mo>
                                             <m:mi>&#956;</m:mi>
                                             <m:mo stretchy="false">(</m:mo>
                                             <m:mi>B</m:mi>
                                             <m:mo stretchy="false">)</m:mo>
                                             <m:mo stretchy="false">]</m:mo>
                                          </m:mrow>
                                          <m:mi>q</m:mi>
                                       </m:msup>
                                       <m:mi>ln</m:mi>
                                       <m:mo>&#8289;</m:mo>
                                       <m:mi>&#956;</m:mi>
                                       <m:mo stretchy="false">(</m:mo>
                                       <m:mi>B</m:mi>
                                       <m:mo stretchy="false">)</m:mo>
                                    </m:mrow>
                                 </m:mstyle>
                              </m:mrow>
                              <m:mrow>
                                 <m:msub>
                                    <m:mi>Z</m:mi>
                                    <m:mi>&#949;</m:mi>
                                 </m:msub>
                                 <m:mo stretchy="false">(</m:mo>
                                 <m:mi>q</m:mi>
                                 <m:mo stretchy="false">)</m:mo>
                                 <m:mi>ln</m:mi>
                                 <m:mo>&#8289;</m:mo>
                                 <m:mi>&#949;</m:mi>
                              </m:mrow>
                           </m:mfrac>
                           <m:mo>.</m:mo>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaacciGae8xSdeMaeiikaGIaemyCaeNaeiykaKIaeyypa0ZaaCbeaeaacyGGSbaBcqGGPbqAcqGGTbqBaSqaaiab=v7aLjabgkziUkabicdaWaqabaqcfa4aaSaaaeaadaaeqbqaaiabcUfaBjab=X7aTjabcIcaOiabdkeacjabcMcaPiabc2faDnaaCaaabeqaaiabdghaXbaacyGGSbaBcqGGUbGBcqWF8oqBcqGGOaakcqWGcbGqcqGGPaqkaeaacqWF8oqBcqGGOaakcqWGcbGqcqGGPaqkcqGHGjsUcqaIWaamaeqacqGHris5aaqaaiabdQfaAnaaBaaabaGae8xTdugabeaacqGGOaakcqWGXbqCcqGGPaqkcyGGSbaBcqGGUbGBcqWF1oqzaaGaeiOla4caaa@5FA8@</m:annotation>
                     </m:semantics>
                  </m:math>
               </display-formula>
            </p>
            <p>Again, the exponent <it>&#945;(q</it>) can be estimated through a linear regression of <inline-formula><m:math name="1471-2105-9-113-i24" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:mo>{</m:mo><m:mstyle displaystyle="true"><m:munder><m:mo>&#8721;</m:mo><m:mrow><m:mi>&#956;</m:mi><m:mo stretchy="false">(</m:mo><m:mi>B</m:mi><m:mo stretchy="false">)</m:mo><m:mo>&#8800;</m:mo><m:mn>0</m:mn></m:mrow></m:munder><m:mrow><m:msup><m:mrow><m:mo stretchy="false">[</m:mo><m:mi>&#956;</m:mi><m:mo stretchy="false">(</m:mo><m:mi>B</m:mi><m:mo stretchy="false">)</m:mo><m:mo stretchy="false">]</m:mo></m:mrow><m:mi>q</m:mi></m:msup><m:mi>ln</m:mi><m:mo>&#8289;</m:mo><m:mi>&#956;</m:mi><m:mo stretchy="false">(</m:mo><m:mi>B</m:mi><m:mo stretchy="false">)</m:mo></m:mrow></m:mstyle><m:mo>}</m:mo><m:mo>/</m:mo><m:msub><m:mi>Z</m:mi><m:mi>&#949;</m:mi></m:msub><m:mo stretchy="false">(</m:mo><m:mi>q</m:mi><m:mo stretchy="false">)</m:mo></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGaei4EaS3aaabuaeaacqGGBbWwiiGacqWF8oqBcqGGOaakcqWGcbGqcqGGPaqkcqGGDbqxdaahaaWcbeqaaiabdghaXbaakiGbcYgaSjabc6gaUjab=X7aTjabcIcaOiabdkeacjabcMcaPaWcbaGae8hVd0MaeiikaGIaemOqaiKaeiykaKIaeyiyIKRaeGimaadabeqdcqGHris5aOGaeiyFa0Naei4la8IaemOwaO1aaSbaaSqaaiab=v7aLbqabaGccqGGOaakcqWGXbqCcqGGPaqkaaa@4F11@</m:annotation></m:semantics></m:math></inline-formula> against ln <it>&#949;</it>. The multifractal spectrum <it>f </it>(<it>&#945;</it>) versus <it>&#945; </it>can be calculated according to a relationship known as <it>Legendre transformation </it><abbrgrp><abbr bid="B44">44</abbr></abbrgrp>:</p>
            <p>
               <display-formula id="M21">
                  <m:math name="1471-2105-9-113-i25" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:mi>f</m:mi>
                           <m:mo stretchy="false">(</m:mo>
                           <m:mi>&#945;</m:mi>
                           <m:mo stretchy="false">)</m:mo>
                           <m:mo>=</m:mo>
                           <m:munder>
                              <m:mrow>
                                 <m:mi>min</m:mi>
                                 <m:mo>&#8289;</m:mo>
                              </m:mrow>
                              <m:mi>q</m:mi>
                           </m:munder>
                           <m:mo>{</m:mo>
                           <m:mi>q</m:mi>
                           <m:mi>&#945;</m:mi>
                           <m:mo stretchy="false">(</m:mo>
                           <m:mi>q</m:mi>
                           <m:mo stretchy="false">)</m:mo>
                           <m:mo>&#8722;</m:mo>
                           <m:mi>&#964;</m:mi>
                           <m:mo stretchy="false">(</m:mo>
                           <m:mi>q</m:mi>
                           <m:mo stretchy="false">)</m:mo>
                           <m:mo>}</m:mo>
                           <m:mo>.</m:mo>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGaemOzayMaeiikaGccciGae8xSdeMaeiykaKIaeyypa0ZaaCbeaeaacyGGTbqBcqGGPbqAcqGGUbGBaSqaaiabdghaXbqabaGccqGG7bWEcqWGXbqCcqWFXoqycqGGOaakcqWGXbqCcqGGPaqkcqGHsislcqWFepaDcqGGOaakcqWGXbqCcqGGPaqkcqGG9bqFcqGGUaGlaaa@4774@</m:annotation>
                     </m:semantics>
                  </m:math>
               </display-formula>
            </p>
            <p>We first construct a measure for the numeric sequences according to Eq. (11), then analyze the measure with the above multifractal method. The <it>D</it>(<it>q</it>), <it>C</it><sub><it>q</it></sub>, <it>&#945;</it>(<it>q</it>) and <it>f </it>(<it>&#945;</it>) curves for one of the promoter, exon and intron sequences are shown in Figure <figr fid="F2">2</figr>. We select 5 parameters from <it>MFA </it>to distinguish between promoter and non-promoter sequences: <it>D</it>(2), <it>C</it><sub>1</sub>, <it>C</it><sub><it>max </it></sub>(the maximum value of <it>C</it><sub><it>q</it></sub>), &#916;<it>&#945; </it>= <it>&#945;</it><sub><it>max </it></sub>- <it>&#945;</it><sub><it>min </it></sub>and &#916;<it>f </it>= <it>f </it>(<it>&#945;</it><sub><it>max</it></sub>) - <it>f </it>(<it>&#945;</it><sub><it>min</it></sub>).</p>
            <fig id="F2">
               <title>
                  <p>Figure 2</p>
               </title>
               <caption>
                  <p>The four kinds of fractal curves for the promoter, exon and intron sequences</p>
               </caption>
               <text>
                  <p><b>The four kinds of fractal curves for the promoter, exon and intron sequences.</b> The figures show that there are some differences between the promoter and non-promoter (exon/intron) sequences, which suggests that it's possible to extract some values from them to distinguish the promoter sequences from the non-promoter sequences.</p>
               </text>
               <graphic file="1471-2105-9-113-2"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>Analogous multifractal analysis (<it>AMFA</it>)</p>
            </st>
            <p>Analogous multifractal analysis is similar to <it>multiaffinity analysis </it>which is a useful method in many fields. It was recently proposed in <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>. We denote a time series as <it>X</it>(<it>t</it>), <it>t </it>= 1, 2, &#8943;, <it>N</it>. First, the time series is integrated as</p>
            <p>
               <display-formula id="M22">
                  <m:math name="1471-2105-9-113-i26" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:mtable>
                              <m:mtr>
                                 <m:mtd>
                                    <m:mrow>
                                       <m:msub>
                                          <m:msup>
                                             <m:mi>y</m:mi>
                                             <m:mo>&#8242;</m:mo>
                                          </m:msup>
                                          <m:mi>q</m:mi>
                                       </m:msub>
                                       <m:mo stretchy="false">(</m:mo>
                                       <m:mi>k</m:mi>
                                       <m:mo stretchy="false">)</m:mo>
                                       <m:mo>=</m:mo>
                                       <m:mstyle displaystyle="true">
                                          <m:munderover>
                                             <m:mo>&#8721;</m:mo>
                                             <m:mrow>
                                                <m:mi>t</m:mi>
                                                <m:mo>=</m:mo>
                                                <m:mn>1</m:mn>
                                             </m:mrow>
                                             <m:mi>k</m:mi>
                                          </m:munderover>
                                          <m:mrow>
                                             <m:msup>
                                                <m:mrow>
                                                   <m:mo stretchy="false">(</m:mo>
                                                   <m:mi>X</m:mi>
                                                   <m:mo stretchy="false">(</m:mo>
                                                   <m:mi>t</m:mi>
                                                   <m:mo stretchy="false">)</m:mo>
                                                   <m:mo>&#8722;</m:mo>
                                                   <m:msub>
                                                      <m:mi>X</m:mi>
                                                      <m:mrow>
                                                         <m:mi>a</m:mi>
                                                         <m:mi>v</m:mi>
                                                         <m:mi>e</m:mi>
                                                      </m:mrow>
                                                   </m:msub>
                                                   <m:mo stretchy="false">)</m:mo>
                                                </m:mrow>
                                                <m:mi>q</m:mi>
                                             </m:msup>
                                          </m:mrow>
                                       </m:mstyle>
                                       <m:mo>,</m:mo>
                                    </m:mrow>
                                 </m:mtd>
                                 <m:mtd>
                                    <m:mrow>
                                       <m:mo stretchy="false">(</m:mo>
                                       <m:mi>q</m:mi>
                                       <m:mo>&#8712;</m:mo>
                                       <m:msub>
                                          <m:mi>&#8484;</m:mi>
                                          <m:mo>+</m:mo>
                                       </m:msub>
                                       <m:mo>,</m:mo>
                                       <m:mi>k</m:mi>
                                       <m:mo>=</m:mo>
                                       <m:mn>1</m:mn>
                                       <m:mo>,</m:mo>
                                       <m:mn>2</m:mn>
                                       <m:mo>,</m:mo>
                                       <m:mo>&#8943;</m:mo>
                                       <m:mo>,</m:mo>
                                       <m:mi>N</m:mi>
                                       <m:mo stretchy="false">)</m:mo>
                                    </m:mrow>
                                 </m:mtd>
                              </m:mtr>
                           </m:mtable>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaqbaeqabeGaaaqaaiqbdMha5zaafaWaaSbaaSqaaiabdghaXbqabaGccqGGOaakcqWGRbWAcqGGPaqkcqGH9aqpdaaeWbqaaiabcIcaOiabdIfayjabcIcaOiabdsha0jabcMcaPiabgkHiTiabdIfaynaaBaaaleaacqWGHbqycqWG2bGDcqWGLbqzaeqaaOGaeiykaKYaaWbaaSqabeaacqWGXbqCaaaabaGaemiDaqNaeyypa0JaeGymaedabaGaem4AaSganiabggHiLdGccqGGSaalaeaacqGGOaakcqWGXbqCcqGHiiIZtuuDJXwAK1uy0HMmaeHbfv3ySLgzG0uy0HgiuD3BaGabaiab=rsiAnaaBaaaleaacqGHRaWkaeqaaOGaeiilaWIaem4AaSMaeyypa0JaeGymaeJaeiilaWIaeGOmaiJaeiilaWIaeS47IWKaeiilaWIaemOta4KaeiykaKcaaaaa@64C2@</m:annotation>
                     </m:semantics>
                  </m:math>
               </display-formula>
            </p>
            <p>
               <display-formula id="M23">
                  <m:math name="1471-2105-9-113-i27" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:mtable>
                              <m:mtr>
                                 <m:mtd>
                                    <m:mrow>
                                       <m:msub>
                                          <m:mi>y</m:mi>
                                          <m:mi>q</m:mi>
                                       </m:msub>
                                       <m:mo stretchy="false">(</m:mo>
                                       <m:mi>k</m:mi>
                                       <m:mo stretchy="false">)</m:mo>
                                       <m:mo>=</m:mo>
                                       <m:mstyle displaystyle="true">
                                          <m:munderover>
                                             <m:mo>&#8721;</m:mo>
                                             <m:mrow>
                                                <m:mi>t</m:mi>
                                                <m:mo>=</m:mo>
                                                <m:mn>1</m:mn>
                                             </m:mrow>
                                             <m:mi>k</m:mi>
                                          </m:munderover>
                                          <m:mrow>
                                             <m:mo>|</m:mo>
                                             <m:mi>X</m:mi>
                                             <m:mo stretchy="false">(</m:mo>
                                             <m:mi>t</m:mi>
                                             <m:mo stretchy="false">)</m:mo>
                                             <m:mo>&#8722;</m:mo>
                                             <m:msub>
                                                <m:mi>X</m:mi>
                                                <m:mrow>
                                                   <m:mi>a</m:mi>
                                                   <m:mi>v</m:mi>
                                                   <m:mi>e</m:mi>
                                                </m:mrow>
                                             </m:msub>
                                             <m:msup>
                                                <m:mo>|</m:mo>
                                                <m:mi>q</m:mi>
                                             </m:msup>
                                          </m:mrow>
                                       </m:mstyle>
                                       <m:mo>,</m:mo>
                                    </m:mrow>
                                 </m:mtd>
                                 <m:mtd>
                                    <m:mrow>
                                       <m:mo stretchy="false">(</m:mo>
                                       <m:mi>q</m:mi>
                                       <m:mo>&#8800;</m:mo>
                                       <m:mn>0</m:mn>
                                       <m:mo>,</m:mo>
                                       <m:mi>k</m:mi>
                                       <m:mo>=</m:mo>
                                       <m:mn>1</m:mn>
                                       <m:mo>,</m:mo>
                                       <m:mn>2</m:mn>
                                       <m:mo>,</m:mo>
                                       <m:mo>&#8943;</m:mo>
                                       <m:mo>,</m:mo>
                                       <m:mi>N</m:mi>
                                       <m:mo stretchy="false">)</m:mo>
                                    </m:mrow>
                                 </m:mtd>
                              </m:mtr>
                           </m:mtable>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaqbaeqabeGaaaqaaiabdMha5naaBaaaleaacqWGXbqCaeqaaOGaeiikaGIaem4AaSMaeiykaKIaeyypa0ZaaabCaeaacqGG8baFcqWGybawcqGGOaakcqWG0baDcqGGPaqkcqGHsislcqWGybawdaWgaaWcbaGaemyyaeMaemODayNaemyzaugabeaakiabcYha8naaCaaaleqabaGaemyCaehaaaqaaiabdsha0jabg2da9iabigdaXaqaaiabdUgaRbqdcqGHris5aOGaeiilaWcabaGaeiikaGIaemyCaeNaeyiyIKRaeGimaaJaeiilaWIaem4AaSMaeyypa0JaeGymaeJaeiilaWIaeGOmaiJaeiilaWIaeS47IWKaeiilaWIaemOta4KaeiykaKcaaaaa@5B57@</m:annotation>
                     </m:semantics>
                  </m:math>
               </display-formula>
            </p>
            <p>where <it>X</it><sub><it>ave </it></sub>is the average over the whole time period and <it>k </it>&#8712; [1, <it>N</it>]. Then two quantities <it>M</it><sub><it>q </it></sub>(<it>L</it>) and <inline-formula><m:math name="1471-2105-9-113-i28" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msub><m:msup><m:mi>M</m:mi><m:mo>&#8242;</m:mo></m:msup><m:mi>q</m:mi></m:msub><m:mo stretchy="false">(</m:mo><m:mi>L</m:mi><m:mo stretchy="false">)</m:mo></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGafmyta0KbauaadaWgaaWcbaGaemyCaehabeaakiabcIcaOiabdYeamjabcMcaPaaa@3176@</m:annotation></m:semantics></m:math></inline-formula> are defined as</p>
            <p>
               <display-formula id="M24">
                  <m:math name="1471-2105-9-113-i29" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:mtable>
                              <m:mtr>
                                 <m:mtd>
                                    <m:mrow>
                                       <m:msub>
                                          <m:msup>
                                             <m:mi>M</m:mi>
                                             <m:mo>&#8242;</m:mo>
                                          </m:msup>
                                          <m:mi>q</m:mi>
                                       </m:msub>
                                       <m:mo stretchy="false">(</m:mo>
                                       <m:mi>L</m:mi>
                                       <m:mo stretchy="false">)</m:mo>
                                       <m:mo>=</m:mo>
                                       <m:msup>
                                          <m:mrow>
                                             <m:mo stretchy="false">[</m:mo>
                                             <m:msub>
                                                <m:mrow>
                                                   <m:mo>&#9001;</m:mo>
                                                   <m:mo>|</m:mo>
                                                   <m:msup>
                                                      <m:mi>y</m:mi>
                                                      <m:mo>&#8242;</m:mo>
                                                   </m:msup>
                                                   <m:mo stretchy="false">(</m:mo>
                                                   <m:mi>j</m:mi>
                                                   <m:mo stretchy="false">)</m:mo>
                                                   <m:mo>&#8722;</m:mo>
                                                   <m:msup>
                                                      <m:mi>y</m:mi>
                                                      <m:mo>&#8242;</m:mo>
                                                   </m:msup>
                                                   <m:mo stretchy="false">(</m:mo>
                                                   <m:mi>j</m:mi>
                                                   <m:mo>+</m:mo>
                                                   <m:mi>L</m:mi>
                                                   <m:mo stretchy="false">)</m:mo>
                                                   <m:mo>|</m:mo>
                                                   <m:mo>&#9002;</m:mo>
                                                </m:mrow>
                                                <m:mi>j</m:mi>
                                             </m:msub>
                                             <m:mo stretchy="false">]</m:mo>
                                          </m:mrow>
                                          <m:mrow>
                                             <m:mfrac>
                                                <m:mn>1</m:mn>
                                                <m:mi>q</m:mi>
                                             </m:mfrac>
                                          </m:mrow>
                                       </m:msup>
                                       <m:mo>,</m:mo>
                                    </m:mrow>
                                 </m:mtd>
                                 <m:mtd>
                                    <m:mrow>
                                       <m:mo stretchy="false">(</m:mo>
                                       <m:mi>q</m:mi>
                                       <m:mo>&#8712;</m:mo>
                                       <m:msub>
                                          <m:mi>&#8484;</m:mi>
                                          <m:mo>+</m:mo>
                                       </m:msub>
                                       <m:mo stretchy="false">)</m:mo>
                                    </m:mrow>
                                 </m:mtd>
                              </m:mtr>
                           </m:mtable>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaqbaeqabeGaaaqaaiqbd2eanzaafaWaaSbaaSqaaiabdghaXbqabaGccqGGOaakcqWGmbatcqGGPaqkcqGH9aqpcqGGBbWwcqGHPms4cqGG8baFcuWG5bqEgaqbaiabcIcaOiabdQgaQjabcMcaPiabgkHiTiqbdMha5zaafaGaeiikaGIaemOAaOMaey4kaSIaemitaWKaeiykaKIaeiiFaWNaeyOkJe=aaSbaaSqaaiabdQgaQbqabaGccqGGDbqxdaahaaWcbeqcfayaamaalaaabaGaeGymaedabaGaemyCaehaaaaakiabcYcaSaqaaiabcIcaOiabdghaXjabgIGioprr1ngBPrwtHrhAYaqeguuDJXwAKbstHrhAGq1DVbaceaGae8hjHO1aaSbaaSqaaiabgUcaRaqabaGccqGGPaqkaaaaaa@5E18@</m:annotation>
                     </m:semantics>
                  </m:math>
               </display-formula>
            </p>
            <p>
               <display-formula id="M25">
                  <m:math name="1471-2105-9-113-i30" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:mtable>
                              <m:mtr>
                                 <m:mtd>
                                    <m:mrow>
                                       <m:msub>
                                          <m:mi>M</m:mi>
                                          <m:mi>q</m:mi>
                                       </m:msub>
                                       <m:mo stretchy="false">(</m:mo>
                                       <m:mi>L</m:mi>
                                       <m:mo stretchy="false">)</m:mo>
                                       <m:mo>=</m:mo>
                                       <m:msup>
                                          <m:mrow>
                                             <m:mo stretchy="false">[</m:mo>
                                             <m:msub>
                                                <m:mrow>
                                                   <m:mo>&#9001;</m:mo>
                                                   <m:mo>|</m:mo>
                                                   <m:mi>y</m:mi>
                                                   <m:mo stretchy="false">(</m:mo>
                                                   <m:mi>j</m:mi>
                                                   <m:mo stretchy="false">)</m:mo>
                                                   <m:mo>&#8722;</m:mo>
                                                   <m:mi>y</m:mi>
                                                   <m:mo stretchy="false">(</m:mo>
                                                   <m:mi>j</m:mi>
                                                   <m:mo>+</m:mo>
                                                   <m:mi>L</m:mi>
                                                   <m:mo stretchy="false">)</m:mo>
                                                   <m:mo>|</m:mo>
                                                   <m:mo>&#9002;</m:mo>
                                                </m:mrow>
                                                <m:mi>j</m:mi>
                                             </m:msub>
                                             <m:mo stretchy="false">]</m:mo>
                                          </m:mrow>
                                          <m:mrow>
                                             <m:mfrac>
                                                <m:mn>1</m:mn>
                                                <m:mi>q</m:mi>
                                             </m:mfrac>
                                          </m:mrow>
                                       </m:msup>
                                       <m:mo>,</m:mo>
                                    </m:mrow>
                                 </m:mtd>
                                 <m:mtd>
                                    <m:mrow>
                                       <m:mo stretchy="false">(</m:mo>
                                       <m:mi>q</m:mi>
                                       <m:mo>&#8800;</m:mo>
                                       <m:mn>0</m:mn>
                                       <m:mo stretchy="false">)</m:mo>
                                    </m:mrow>
                                 </m:mtd>
                              </m:mtr>
                           </m:mtable>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaqbaeqabeGaaaqaaiabd2eannaaBaaaleaacqWGXbqCaeqaaOGaeiikaGIaemitaWKaeiykaKIaeyypa0Jaei4waSLaeyykJeUaeiiFaWNaemyEaKNaeiikaGIaemOAaOMaeiykaKIaeyOeI0IaemyEaKNaeiikaGIaemOAaOMaey4kaSIaemitaWKaeiykaKIaeiiFaWNaeyOkJe=aaSbaaSqaaiabdQgaQbqabaGccqGGDbqxdaahaaWcbeqcfayaamaalaaabaGaeGymaedabaGaemyCaehaaaaakiabcYcaSaqaaiabcIcaOiabdghaXjabgcMi5kabicdaWiabcMcaPaaaaaa@5347@</m:annotation>
                     </m:semantics>
                  </m:math>
               </display-formula>
            </p>
            <p>where &#9001;&#9002;<sub><it>j </it></sub>denotes the average over <it>j, j </it>= 1, 2, &#8943;, <it>N </it>&#8211; <it>L</it>; <it>L </it>typically varies from 1 to <it>N</it><sub>1 </sub>in which the linear fit is good. From the ln <it>L </it>vs ln <it>M</it><sub><it>q </it></sub>(<it>L</it>) and ln <it>L </it>vs ln <inline-formula><m:math name="1471-2105-9-113-i28" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msub><m:msup><m:mi>M</m:mi><m:mo>&#8242;</m:mo></m:msup><m:mi>q</m:mi></m:msub><m:mo stretchy="false">(</m:mo><m:mi>L</m:mi><m:mo stretchy="false">)</m:mo></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGafmyta0KbauaadaWgaaWcbaGaemyCaehabeaakiabcIcaOiabdYeamjabcMcaPaaa@3176@</m:annotation></m:semantics></m:math></inline-formula> planes, one can determine the relations:</p>
            <p>
               <display-formula id="M26">
                  <m:math name="1471-2105-9-113-i31" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:mtable>
                              <m:mtr>
                                 <m:mtd>
                                    <m:mrow>
                                       <m:msub>
                                          <m:msup>
                                             <m:mi>M</m:mi>
                                             <m:mo>&#8242;</m:mo>
                                          </m:msup>
                                          <m:mi>q</m:mi>
                                       </m:msub>
                                       <m:mo stretchy="false">(</m:mo>
                                       <m:mi>L</m:mi>
                                       <m:mo stretchy="false">)</m:mo>
                                       <m:mo>&#8733;</m:mo>
                                       <m:msup>
                                          <m:mi>L</m:mi>
                                          <m:mrow>
                                             <m:msup>
                                                <m:mi>h</m:mi>
                                                <m:mo>&#8242;</m:mo>
                                             </m:msup>
                                             <m:mo stretchy="false">(</m:mo>
                                             <m:mi>q</m:mi>
                                             <m:mo stretchy="false">)</m:mo>
                                          </m:mrow>
                                       </m:msup>
                                    </m:mrow>
                                 </m:mtd>
                                 <m:mtd>
                                    <m:mrow>
                                       <m:mi>f</m:mi>
                                       <m:mi>o</m:mi>
                                       <m:mi>r</m:mi>
                                    </m:mrow>
                                 </m:mtd>
                                 <m:mtd>
                                    <m:mrow>
                                       <m:mi>q</m:mi>
                                       <m:mo>&#8712;</m:mo>
                                       <m:msub>
                                          <m:mi>&#8484;</m:mi>
                                          <m:mo>+</m:mo>
                                       </m:msub>
                                       <m:mo>,</m:mo>
                                    </m:mrow>
                                 </m:mtd>
                              </m:mtr>
                           </m:mtable>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaqbaeqabeWaaaqaaiqbd2eanzaafaWaaSbaaSqaaiabdghaXbqabaGccqGGOaakcqWGmbatcqGGPaqkcqGHDisTcqWGmbatdaahaaWcbeqaaiqbdIgaOzaafaGaeiikaGIaemyCaeNaeiykaKcaaaGcbaGaemOzayMaem4Ba8MaemOCaihabaGaemyCaeNaeyicI48efv3ySLgznfgDOjdaryqr1ngBPrginfgDObcv39gaiqaacqWFKeIwdaWgaaWcbaGaey4kaScabeaakiabcYcaSaaaaaa@4D03@</m:annotation>
                     </m:semantics>
                  </m:math>
               </display-formula>
            </p>
            <p>
               <display-formula id="M27">
                  <m:math name="1471-2105-9-113-i32" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:mtable>
                              <m:mtr>
                                 <m:mtd>
                                    <m:mrow>
                                       <m:msub>
                                          <m:mi>M</m:mi>
                                          <m:mi>q</m:mi>
                                       </m:msub>
                                       <m:mo stretchy="false">(</m:mo>
                                       <m:mi>L</m:mi>
                                       <m:mo stretchy="false">)</m:mo>
                                       <m:mo>&#8733;</m:mo>
                                       <m:msup>
                                          <m:mi>L</m:mi>
                                          <m:mrow>
                                             <m:mi>h</m:mi>
                                             <m:mo stretchy="false">(</m:mo>
                                             <m:mi>q</m:mi>
                                             <m:mo stretchy="false">)</m:mo>
                                          </m:mrow>
                                       </m:msup>
                                    </m:mrow>
                                 </m:mtd>
                                 <m:mtd>
                                    <m:mrow>
                                       <m:mi>f</m:mi>
                                       <m:mi>o</m:mi>
                                       <m:mi>r</m:mi>
                                    </m:mrow>
                                 </m:mtd>
                                 <m:mtd>
                                    <m:mrow>
                                       <m:mi>q</m:mi>
                                       <m:mo>&#8800;</m:mo>
                                       <m:mn>0.</m:mn>
                                    </m:mrow>
                                 </m:mtd>
                              </m:mtr>
                           </m:mtable>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaqbaeqabeWaaaqaaiabd2eannaaBaaaleaacqWGXbqCaeqaaOGaeiikaGIaemitaWKaeiykaKIaeyyhIuRaemitaW0aaWbaaSqabeaacqWGObaAcqGGOaakcqWGXbqCcqGGPaqkaaaakeaacqWGMbGzcqWGVbWBcqWGYbGCaeaacqWGXbqCcqGHGjsUcqaIWaamcqGGUaGlaaaaaa@4242@</m:annotation>
                     </m:semantics>
                  </m:math>
               </display-formula>
            </p>
            <p>Linear regressions of ln <inline-formula><m:math name="1471-2105-9-113-i28" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msub><m:msup><m:mi>M</m:mi><m:mo>&#8242;</m:mo></m:msup><m:mi>q</m:mi></m:msub><m:mo stretchy="false">(</m:mo><m:mi>L</m:mi><m:mo stretchy="false">)</m:mo></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGafmyta0KbauaadaWgaaWcbaGaemyCaehabeaakiabcIcaOiabdYeamjabcMcaPaaa@3176@</m:annotation></m:semantics></m:math></inline-formula> and ln <it>M</it><sub><it>q </it></sub>(<it>L</it>) against ln <it>L </it>will yield the exponents <it>h' </it>(<it>q</it>) and <it>h</it>(<it>q</it>) respectively.</p>
            <p>The exponent <it>h</it>(<it>q</it>) has a nonlinear dependence on <it>q</it>. When <it>q </it>= 1, the methods are just those reported in Refs. <abbrgrp><abbr bid="B48">48</abbr><abbr bid="B49">49</abbr></abbrgrp> and these methods are used to study the length sequences from the complete genomes by Yu <it>et al</it>. <abbrgrp><abbr bid="B49">49</abbr></abbrgrp>. <it>M'</it>(<it>L</it>) may be assessed to determine long-range correlation <abbrgrp><abbr bid="B50">50</abbr></abbrgrp>. From Ref. <abbrgrp><abbr bid="B49">49</abbr></abbrgrp>, the linear fit to get the exponent <it>h</it>(1) is better than that to get the exponent <it>h'</it>(1). Our numerical results show that the exponents <it>h</it>(<it>q</it>) are more robust than the exponents <it>h'</it>(<it>q</it>), so we suggest to use the exponents <it>h</it>(<it>q</it>). We have used <it>h</it>(<it>q</it>) in clustering the structure of large proteins and it turns out to be a useful method <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>.</p>
            <p>Figure <figr fid="F3">3</figr> gives an example in applying the AMFA to the free energy sequence of a promoter sequence. It shows a good linear relationship between ln <it>M</it>(<it>L</it>) and ln(<it>L</it>). For different values of <it>q</it>, we get the exponents <it>h</it>(<it>q</it>) from linear regressions of ln <it>M</it>(<it>L</it>) against ln (<it>L</it>) according to Eq. (27). The exponent spectrum <it>h</it>(<it>q</it>) of the promoter sequence is shown in the right panel of Figure <figr fid="F3">3</figr>. We extract four parameters from AMFA: <it>h</it>(-2), <it>h</it>(-1), <it>h</it>(1) and <it>h</it>(2).</p>
            <fig id="F3">
               <title>
                  <p>Figure 3</p>
               </title>
               <caption>
                  <p>The relationship between ln <it>M</it>(<it>L</it>) and ln(<it>L</it>) using the free energy sequence of one promoter (Left); the h(q) spectra for the one promoter calculated by AMFA (Right)</p>
               </caption>
               <text>
                  <p>
                     <b>The relationship between ln <it>M</it>(<it>L</it>) and ln(<it>L</it>) using the free energy sequence of one promoter (Left); the h(q) spectra for the one promoter calculated by AMFA (Right).</b>
                  </p>
               </text>
               <graphic file="1471-2105-9-113-3"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>Z curve (<it>ZC</it>)</p>
            </st>
            <p>The concept of the Z curve representation of a DNA sequence was first proposed by Zhang and Zhang <abbrgrp><abbr bid="B51">51</abbr></abbrgrp>, and was used to distinguish coding and noncoding DNA sequences <abbrgrp><abbr bid="B52">52</abbr><abbr bid="B53">53</abbr></abbrgrp>. A new system based on <it>ZC</it>, Z CURVE 1.0, for finding protein-coding genes in bacterial and archaeal genomes has been proposed <abbrgrp><abbr bid="B10">10</abbr></abbrgrp>. Recently, another new self-training system based on the <it>ZC </it>method, ZCURVE_V <abbrgrp><abbr bid="B11">11</abbr></abbrgrp>, for recognizing protein-coding genes in viral and phage genomes was reported.</p>
            <p>In this paper, we apply the <it>ZC </it>method in distinguishing promoter and non-promoter sequences. For convenience, we give a brief description of the methods in Refs. <abbrgrp><abbr bid="B10">10</abbr></abbrgrp> and <abbrgrp><abbr bid="B11">11</abbr></abbrgrp>. The frequencies of bases A, C, G and T occurring in a promoter/non-promoter sequence with bases at positions 1, 4, 7, &#8943;; 2, 5, 8, &#8943;; 3, 6, 9, &#8943;, are denoted by <it>a</it><sub>1</sub>, <it>c</it><sub>1</sub>, <it>g</it><sub>1</sub>, <it>t</it><sub>1</sub>; <it>a</it><sub>2</sub>, <it>c</it><sub>2</sub>, <it>g</it><sub>2</sub>, <it>t</it><sub>2</sub>; <it>a</it><sub>3</sub>, <it>c</it><sub>3</sub>, <it>g</it><sub>3</sub>, <it>t</it><sub>3</sub>, respectively. They are in fact the frequencies of bases at the first, second and third codon positions, which can be called <it>codon-position-dependent </it>frequencies of mononucleotides. Based on the <it>ZC </it><abbrgrp><abbr bid="B54">54</abbr></abbrgrp>, <it>a</it><sub><it>i</it></sub>, <it>c</it><sub><it>i</it></sub>, <it>g</it><sub><it>i</it></sub>, <it>t</it><sub><it>i </it></sub>for each <it>i </it>can be used to construct three coordinates, denoted by <it>x</it><sub><it>i</it></sub>, <it>y</it><sub><it>i </it></sub>and <it>z</it><sub><it>i </it></sub>according to the Z transform <abbrgrp><abbr bid="B54">54</abbr></abbrgrp>:</p>
            <p>
               <display-formula id="M28">
                  <m:math name="1471-2105-9-113-i33" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:mrow>
                              <m:mo>{</m:mo>
                              <m:mrow>
                                 <m:mtable columnalign="left">
                                    <m:mtr columnalign="left">
                                       <m:mtd columnalign="left">
                                          <m:mrow>
                                             <m:msub>
                                                <m:mi>x</m:mi>
                                                <m:mi>i</m:mi>
                                             </m:msub>
                                             <m:mo>=</m:mo>
                                             <m:mo stretchy="false">(</m:mo>
                                             <m:msub>
                                                <m:mi>a</m:mi>
                                                <m:mi>i</m:mi>
                                             </m:msub>
                                             <m:mo>+</m:mo>
                                             <m:msub>
                                                <m:mi>g</m:mi>
                                                <m:mi>i</m:mi>
                                             </m:msub>
                                             <m:mo stretchy="false">)</m:mo>
                                             <m:mo>&#8722;</m:mo>
                                             <m:mo stretchy="false">(</m:mo>
                                             <m:msub>
                                                <m:mi>c</m:mi>
                                                <m:mi>i</m:mi>
                                             </m:msub>
                                             <m:mo>+</m:mo>
                                             <m:msub>
                                                <m:mi>t</m:mi>
                                                <m:mi>i</m:mi>
                                             </m:msub>
                                             <m:mo stretchy="false">)</m:mo>
                                             <m:mo>,</m:mo>
                                          </m:mrow>
                                       </m:mtd>
                                    </m:mtr>
                                    <m:mtr columnalign="left">
                                       <m:mtd columnalign="left">
                                          <m:mrow>
                                             <m:msub>
                                                <m:mi>y</m:mi>
                                                <m:mi>i</m:mi>
                                             </m:msub>
                                             <m:mo>=</m:mo>
                                             <m:mo stretchy="false">(</m:mo>
                                             <m:msub>
                                                <m:mi>a</m:mi>
                                                <m:mi>i</m:mi>
                                             </m:msub>
                                             <m:mo>+</m:mo>
                                             <m:msub>
                                                <m:mi>c</m:mi>
                                                <m:mi>i</m:mi>
                                             </m:msub>
                                             <m:mo stretchy="false">)</m:mo>
                                             <m:mo>&#8722;</m:mo>
                                             <m:mo stretchy="false">(</m:mo>
                                             <m:msub>
                                                <m:mi>g</m:mi>
                                                <m:mi>i</m:mi>
                                             </m:msub>
                                             <m:mo>+</m:mo>
                                             <m:msub>
                                                <m:mi>t</m:mi>
                                                <m:mi>i</m:mi>
                                             </m:msub>
                                             <m:mo stretchy="false">)</m:mo>
                                             <m:mo>,</m:mo>
                                          </m:mrow>
                                       </m:mtd>
                                    </m:mtr>
                                    <m:mtr columnalign="left">
                                       <m:mtd columnalign="left">
                                          <m:mrow>
                                             <m:msub>
                                                <m:mi>z</m:mi>
                                                <m:mi>i</m:mi>
                                             </m:msub>
                                             <m:mo>=</m:mo>
                                             <m:mo stretchy="false">(</m:mo>
                                             <m:msub>
                                                <m:mi>a</m:mi>
                                                <m:mi>i</m:mi>
                                             </m:msub>
                                             <m:mo>+</m:mo>
                                             <m:msub>
                                                <m:mi>t</m:mi>
                                                <m:mi>i</m:mi>
                                             </m:msub>
                                             <m:mo stretchy="false">)</m:mo>
                                             <m:mo>&#8722;</m:mo>
                                             <m:mo stretchy="false">(</m:mo>
                                             <m:msub>
                                                <m:mi>g</m:mi>
                                                <m:mi>i</m:mi>
                                             </m:msub>
                                             <m:mo>+</m:mo>
                                             <m:msub>
                                                <m:mi>c</m:mi>
                                                <m:mi>i</m:mi>
                                             </m:msub>
                                             <m:mo stretchy="false">)</m:mo>
                                             <m:mo>,</m:mo>
                                          </m:mrow>
                                       </m:mtd>
                                    </m:mtr>
                                 </m:mtable>
                              </m:mrow>
                           </m:mrow>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaWaaiqaaeaafaqaaeWabaaabaGaemiEaG3aaSbaaSqaaiabdMgaPbqabaGccqGH9aqpcqGGOaakcqWGHbqydaWgaaWcbaGaemyAaKgabeaakiabgUcaRiabdEgaNnaaBaaaleaacqWGPbqAaeqaaOGaeiykaKIaeyOeI0IaeiikaGIaem4yam2aaSbaaSqaaiabdMgaPbqabaGccqGHRaWkcqWG0baDdaWgaaWcbaGaemyAaKgabeaakiabcMcaPiabcYcaSaqaaiabdMha5naaBaaaleaacqWGPbqAaeqaaOGaeyypa0JaeiikaGIaemyyae2aaSbaaSqaaiabdMgaPbqabaGccqGHRaWkcqWGJbWydaWgaaWcbaGaemyAaKgabeaakiabcMcaPiabgkHiTiabcIcaOiabdEgaNnaaBaaaleaacqWGPbqAaeqaaOGaey4kaSIaemiDaq3aaSbaaSqaaiabdMgaPbqabaGccqGGPaqkcqGGSaalaeaacqWG6bGEdaWgaaWcbaGaemyAaKgabeaakiabg2da9iabcIcaOiabdggaHnaaBaaaleaacqWGPbqAaeqaaOGaey4kaSIaemiDaq3aaSbaaSqaaiabdMgaPbqabaGccqGGPaqkcqGHsislcqGGOaakcqWGNbWzdaWgaaWcbaGaemyAaKgabeaakiabgUcaRiabdogaJnaaBaaaleaacqWGPbqAaeqaaOGaeiykaKIaeiilaWcaaaGaay5Eaaaaaa@7150@</m:annotation>
                     </m:semantics>
                  </m:math>
               </display-formula>
            </p>
            <p>where <it>x</it><sub><it>i</it></sub>, <it>y</it><sub><it>i</it></sub>, <it>z</it><sub><it>i </it></sub>&#8712; [-1, 1], <it>i </it>= 1, 2, 3.</p>
            <p>We can use the above 9 parameters in the promoter/non-promoter problem. We can also consider the <it>codon-position-independent </it>frequencies of single bases, which results in the following three coordinates:</p>
            <p>
               <display-formula id="M29">
                  <m:math name="1471-2105-9-113-i34" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:mrow>
                              <m:mo>{</m:mo>
                              <m:mrow>
                                 <m:mtable columnalign="left">
                                    <m:mtr columnalign="left">
                                       <m:mtd columnalign="left">
                                          <m:mrow>
                                             <m:mi>x</m:mi>
                                             <m:mo>=</m:mo>
                                             <m:mo stretchy="false">(</m:mo>
                                             <m:mi>a</m:mi>
                                             <m:mo>+</m:mo>
                                             <m:mi>g</m:mi>
                                             <m:mo stretchy="false">)</m:mo>
                                             <m:mo>&#8722;</m:mo>
                                             <m:mo stretchy="false">(</m:mo>
                                             <m:mi>c</m:mi>
                                             <m:mo>+</m:mo>
                                             <m:mi>t</m:mi>
                                             <m:mo stretchy="false">)</m:mo>
                                             <m:mo>,</m:mo>
                                          </m:mrow>
                                       </m:mtd>
                                    </m:mtr>
                                    <m:mtr columnalign="left">
                                       <m:mtd columnalign="left">
                                          <m:mrow>
                                             <m:mi>y</m:mi>
                                             <m:mo>=</m:mo>
                                             <m:mo stretchy="false">(</m:mo>
                                             <m:mi>a</m:mi>
                                             <m:mo>+</m:mo>
                                             <m:mi>c</m:mi>
                                             <m:mo stretchy="false">)</m:mo>
                                             <m:mo>&#8722;</m:mo>
                                             <m:mo stretchy="false">(</m:mo>
                                             <m:mi>g</m:mi>
                                             <m:mo>+</m:mo>
                                             <m:mi>t</m:mi>
                                             <m:mo stretchy="false">)</m:mo>
                                             <m:mo>,</m:mo>
                                          </m:mrow>
                                       </m:mtd>
                                    </m:mtr>
                                    <m:mtr columnalign="left">
                                       <m:mtd columnalign="left">
                                          <m:mrow>
                                             <m:mi>z</m:mi>
                                             <m:mo>=</m:mo>
                                             <m:mo stretchy="false">(</m:mo>
                                             <m:mi>a</m:mi>
                                             <m:mo>+</m:mo>
                                             <m:mi>t</m:mi>
                                             <m:mo stretchy="false">)</m:mo>
                                             <m:mo>&#8722;</m:mo>
                                             <m:mo stretchy="false">(</m:mo>
                                             <m:mi>g</m:mi>
                                             <m:mo>+</m:mo>
                                             <m:mi>c</m:mi>
                                             <m:mo stretchy="false">)</m:mo>
                                             <m:mo>,</m:mo>
                                          </m:mrow>
                                       </m:mtd>
                                    </m:mtr>
                                 </m:mtable>
                              </m:mrow>
                           </m:mrow>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaWaaiqaaeaafaqaaeWabaaabaGaemiEaGNaeyypa0JaeiikaGIaemyyaeMaey4kaSIaem4zaCMaeiykaKIaeyOeI0IaeiikaGIaem4yamMaey4kaSIaemiDaqNaeiykaKIaeiilaWcabaGaemyEaKNaeyypa0JaeiikaGIaemyyaeMaey4kaSIaem4yamMaeiykaKIaeyOeI0IaeiikaGIaem4zaCMaey4kaSIaemiDaqNaeiykaKIaeiilaWcabaGaemOEaONaeyypa0JaeiikaGIaemyyaeMaey4kaSIaemiDaqNaeiykaKIaeyOeI0IaeiikaGIaem4zaCMaey4kaSIaem4yamMaeiykaKIaeiilaWcaaaGaay5Eaaaaaa@59D1@</m:annotation>
                     </m:semantics>
                  </m:math>
               </display-formula>
            </p>
            <p>where <it>x, y, z </it>&#8712; [-1, 1], <it>a, c, g </it>and <it>t </it>are the frequencies for the bases A, C, G and T in a promoter/non-promoter sequence, respectively.</p>
            <p>In addition to the frequencies of codon-position-dependent mononucleotide, we also consider the frequencies of <it>phase-specific </it>dinucleotides. We denote the frequencies of the 16 dinucleotides AA, AC, &#8943;, and TT occurring at the codon positions 1&#8211;2 and 2&#8211;3 of a promoter or non-promoter sequence by <it>p</it><sub>12</sub>(<it>AA</it>), <it>p</it><sub>12</sub>(<it>AC</it>), &#8943;, <it>p</it><sub>12</sub>(<it>T T</it>); <it>p</it><sub>23</sub>(<it>AA</it>), <it>p</it><sub>23</sub>(<it>AC</it>), &#8943;, and <it>p</it><sub>23</sub>(<it>T T</it>), respectively. Using the Z transform <abbrgrp><abbr bid="B54">54</abbr></abbrgrp>, the following 24 coordinates can be defined:</p>
            <p>
               <display-formula id="M30">
                  <m:math name="1471-2105-9-113-i35" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:mrow>
                              <m:mo>{</m:mo>
                              <m:mrow>
                                 <m:mtable columnalign="left">
                                    <m:mtr columnalign="left">
                                       <m:mtd columnalign="left">
                                          <m:mrow>
                                             <m:msubsup>
                                                <m:mi>x</m:mi>
                                                <m:mi>k</m:mi>
                                                <m:mi>X</m:mi>
                                             </m:msubsup>
                                             <m:mo>=</m:mo>
                                             <m:mo stretchy="false">(</m:mo>
                                             <m:msub>
                                                <m:mi>p</m:mi>
                                                <m:mi>k</m:mi>
                                             </m:msub>
                                             <m:mo stretchy="false">(</m:mo>
                                             <m:mi>X</m:mi>
                                             <m:mi>A</m:mi>
                                             <m:mo stretchy="false">)</m:mo>
                                             <m:mo>+</m:mo>
                                             <m:msub>
                                                <m:mi>p</m:mi>
                                                <m:mi>k</m:mi>
                                             </m:msub>
                                             <m:mo stretchy="false">(</m:mo>
                                             <m:mi>X</m:mi>
                                             <m:mi>G</m:mi>
                                             <m:mo stretchy="false">)</m:mo>
                                             <m:mo stretchy="false">)</m:mo>
                                             <m:mo>&#8722;</m:mo>
                                             <m:mo stretchy="false">(</m:mo>
                                             <m:msub>
                                                <m:mi>p</m:mi>
                                                <m:mi>k</m:mi>
                                             </m:msub>
                                             <m:mo stretchy="false">(</m:mo>
                                             <m:mi>X</m:mi>
                                             <m:mi>C</m:mi>
                                             <m:mo stretchy="false">)</m:mo>
                                             <m:mo>+</m:mo>
                                             <m:msub>
                                                <m:mi>p</m:mi>
                                                <m:mi>k</m:mi>
                                             </m:msub>
                                             <m:mo stretchy="false">(</m:mo>
                                             <m:mi>X</m:mi>
                                             <m:mi>T</m:mi>
                                             <m:mo stretchy="false">)</m:mo>
                                             <m:mo stretchy="false">)</m:mo>
                                             <m:mo>,</m:mo>
                                          </m:mrow>
                                       </m:mtd>
                                    </m:mtr>
                                    <m:mtr columnalign="left">
                                       <m:mtd columnalign="left">
                                          <m:mrow>
                                             <m:msubsup>
                                                <m:mi>y</m:mi>
                                                <m:mi>k</m:mi>
                                                <m:mi>X</m:mi>
                                             </m:msubsup>
                                             <m:mo>=</m:mo>
                                             <m:mo stretchy="false">(</m:mo>
                                             <m:msub>
                                                <m:mi>p</m:mi>
                                                <m:mi>k</m:mi>
                                             </m:msub>
                                             <m:mo stretchy="false">(</m:mo>
                                             <m:mi>X</m:mi>
                                             <m:mi>A</m:mi>
                                             <m:mo stretchy="false">)</m:mo>
                                             <m:mo>+</m:mo>
                                             <m:msub>
                                                <m:mi>p</m:mi>
                                                <m:mi>k</m:mi>
                                             </m:msub>
                                             <m:mo stretchy="false">(</m:mo>
                                             <m:mi>X</m:mi>
                                             <m:mi>C</m:mi>
                                             <m:mo stretchy="false">)</m:mo>
                                             <m:mo stretchy="false">)</m:mo>
                                             <m:mo>&#8722;</m:mo>
                                             <m:mo stretchy="false">(</m:mo>
                                             <m:msub>
                                                <m:mi>p</m:mi>
                                                <m:mi>k</m:mi>
                                             </m:msub>
                                             <m:mo stretchy="false">(</m:mo>
                                             <m:mi>X</m:mi>
                                             <m:mi>G</m:mi>
                                             <m:mo stretchy="false">)</m:mo>
                                             <m:mo>+</m:mo>
                                             <m:msub>
                                                <m:mi>p</m:mi>
                                                <m:mi>k</m:mi>
                                             </m:msub>
                                             <m:mo stretchy="false">(</m:mo>
                                             <m:mi>X</m:mi>
                                             <m:mi>T</m:mi>
                                             <m:mo stretchy="false">)</m:mo>
                                             <m:mo stretchy="false">)</m:mo>
                                             <m:mo>,</m:mo>
                                          </m:mrow>
                                       </m:mtd>
                                    </m:mtr>
                                    <m:mtr columnalign="left">
                                       <m:mtd columnalign="left">
                                          <m:mrow>
                                             <m:msubsup>
                                                <m:mi>z</m:mi>
                                                <m:mi>k</m:mi>
                                                <m:mi>X</m:mi>
                                             </m:msubsup>
                                             <m:mo>=</m:mo>
                                             <m:mo stretchy="false">(</m:mo>
                                             <m:msub>
                                                <m:mi>p</m:mi>
                                                <m:mi>k</m:mi>
                                             </m:msub>
                                             <m:mo stretchy="false">(</m:mo>
                                             <m:mi>X</m:mi>
                                             <m:mi>A</m:mi>
                                             <m:mo stretchy="false">)</m:mo>
                                             <m:mo>+</m:mo>
                                             <m:msub>
                                                <m:mi>p</m:mi>
                                                <m:mi>k</m:mi>
                                             </m:msub>
                                             <m:mo stretchy="false">(</m:mo>
                                             <m:mi>X</m:mi>
                                             <m:mi>T</m:mi>
                                             <m:mo stretchy="false">)</m:mo>
                                             <m:mo stretchy="false">)</m:mo>
                                             <m:mo>&#8722;</m:mo>
                                             <m:mo stretchy="false">(</m:mo>
                                             <m:msub>
                                                <m:mi>p</m:mi>
                                                <m:mi>k</m:mi>
                                             </m:msub>
                                             <m:mo stretchy="false">(</m:mo>
                                             <m:mi>X</m:mi>
                                             <m:mi>G</m:mi>
                                             <m:mo stretchy="false">)</m:mo>
                                             <m:mo>+</m:mo>
                                             <m:msub>
                                                <m:mi>p</m:mi>
                                                <m:mi>k</m:mi>
                                             </m:msub>
                                             <m:mo stretchy="false">(</m:mo>
                                             <m:mi>X</m:mi>
                                             <m:mi>C</m:mi>
                                             <m:mo stretchy="false">)</m:mo>
                                             <m:mo stretchy="false">)</m:mo>
                                             <m:mo>,</m:mo>
                                          </m:mrow>
                                       </m:mtd>
                                    </m:mtr>
                                 </m:mtable>
                              </m:mrow>
                           </m:mrow>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaWaaiqaaeaafaqaaeWabaaabaGaemiEaG3aa0baaSqaaiabdUgaRbqaaiabdIfaybaakiabg2da9iabcIcaOiabdchaWnaaBaaaleaacqWGRbWAaeqaaOGaeiikaGIaemiwaGLaemyqaeKaeiykaKIaey4kaSIaemiCaa3aaSbaaSqaaiabdUgaRbqabaGccqGGOaakcqWGybawcqWGhbWrcqGGPaqkcqGGPaqkcqGHsislcqGGOaakcqWGWbaCdaWgaaWcbaGaem4AaSgabeaakiabcIcaOiabdIfayjabdoeadjabcMcaPiabgUcaRiabdchaWnaaBaaaleaacqWGRbWAaeqaaOGaeiikaGIaemiwaGLaemivaqLaeiykaKIaeiykaKIaeiilaWcabaGaemyEaK3aa0baaSqaaiabdUgaRbqaaiabdIfaybaakiabg2da9iabcIcaOiabdchaWnaaBaaaleaacqWGRbWAaeqaaOGaeiikaGIaemiwaGLaemyqaeKaeiykaKIaey4kaSIaemiCaa3aaSbaaSqaaiabdUgaRbqabaGccqGGOaakcqWGybawcqWGdbWqcqGGPaqkcqGGPaqkcqGHsislcqGGOaakcqWGWbaCdaWgaaWcbaGaem4AaSgabeaakiabcIcaOiabdIfayjabdEeahjabcMcaPiabgUcaRiabdchaWnaaBaaaleaacqWGRbWAaeqaaOGaeiikaGIaemiwaGLaemivaqLaeiykaKIaeiykaKIaeiilaWcabaGaemOEaO3aa0baaSqaaiabdUgaRbqaaiabdIfaybaakiabg2da9iabcIcaOiabdchaWnaaBaaaleaacqWGRbWAaeqaaOGaeiikaGIaemiwaGLaemyqaeKaeiykaKIaey4kaSIaemiCaa3aaSbaaSqaaiabdUgaRbqabaGccqGGOaakcqWGybawcqWGubavcqGGPaqkcqGGPaqkcqGHsislcqGGOaakcqWGWbaCdaWgaaWcbaGaem4AaSgabeaakiabcIcaOiabdIfayjabdEeahjabcMcaPiabgUcaRiabdchaWnaaBaaaleaacqWGRbWAaeqaaOGaeiikaGIaemiwaGLaem4qamKaeiykaKIaeiykaKIaeiilaWcaaaGaay5Eaaaaaa@A62A@</m:annotation>
                     </m:semantics>
                  </m:math>
               </display-formula>
            </p>
            <p>where <inline-formula><m:math name="1471-2105-9-113-i36" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>x</m:mi><m:mi>k</m:mi><m:mi>X</m:mi></m:msubsup><m:mo>,</m:mo><m:msubsup><m:mi>y</m:mi><m:mi>k</m:mi><m:mi>X</m:mi></m:msubsup><m:mo>,</m:mo><m:msubsup><m:mi>z</m:mi><m:mi>k</m:mi><m:mi>X</m:mi></m:msubsup><m:mo>&#8712;</m:mo><m:mo stretchy="false">[</m:mo><m:mo>&#8722;</m:mo><m:mn>1</m:mn><m:mo>,</m:mo><m:mn>1</m:mn><m:mo stretchy="false">]</m:mo></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGaemiEaG3aa0baaSqaaiabdUgaRbqaaiabdIfaybaakiabcYcaSiabdMha5naaDaaaleaacqWGRbWAaeaacqWGybawaaGccqGGSaalcqWG6bGEdaqhaaWcbaGaem4AaSgabaGaemiwaGfaaOGaeyicI4Saei4waSLaeyOeI0IaeGymaeJaeiilaWIaeGymaeJaeiyxa0faaa@4222@</m:annotation></m:semantics></m:math></inline-formula><it>p</it><sub><it>k </it></sub>(<it>XY</it>) = <it>n</it><sub><it>k</it></sub>(<it>XY</it>)/[<it>n</it><sub><it>k </it></sub>(<it>XA</it>) + <it>n</it><sub><it>k </it></sub>(<it>XC</it>) + <it>n</it><sub><it>k</it></sub>(<it>XG</it>) + <it>n</it><sub><it>k </it></sub>(<it>XT</it>)], <it>n</it><sub><it>k</it></sub>(<it>XY</it>) are the occurring times of dinucleotides XY, X, Y = A, C, G, T, <it>k </it>= 12, 23.</p>
            <p>We can also consider the frequencies of phase-specific dinucleotides and the frequencies of <it>phase-independent </it>dinucleotides. For this purpose, a sliding window with size 2nt is used and moved forward one base each time to count the number of times of the occurring dinucleotides. With this method, 12 new coordinates can be defined:</p>
            <p>
               <display-formula id="M31">
                  <m:math name="1471-2105-9-113-i37" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:mrow>
                              <m:mo>{</m:mo>
                              <m:mrow>
                                 <m:mtable columnalign="left">
                                    <m:mtr columnalign="left">
                                       <m:mtd columnalign="left">
                                          <m:mrow>
                                             <m:msup>
                                                <m:mi>x</m:mi>
                                                <m:mi>X</m:mi>
                                             </m:msup>
                                             <m:mo>=</m:mo>
                                             <m:mo stretchy="false">(</m:mo>
                                             <m:mi>p</m:mi>
                                             <m:mo stretchy="false">(</m:mo>
                                             <m:mi>X</m:mi>
                                             <m:mi>A</m:mi>
                                             <m:mo stretchy="false">)</m:mo>
                                             <m:mo>+</m:mo>
                                             <m:mi>p</m:mi>
                                             <m:mo stretchy="false">(</m:mo>
                                             <m:mi>X</m:mi>
                                             <m:mi>G</m:mi>
                                             <m:mo stretchy="false">)</m:mo>
                                             <m:mo stretchy="false">)</m:mo>
                                             <m:mo>&#8722;</m:mo>
                                             <m:mo stretchy="false">(</m:mo>
                                             <m:mi>p</m:mi>
                                             <m:mo stretchy="false">(</m:mo>
                                             <m:mi>X</m:mi>
                                             <m:mi>C</m:mi>
                                             <m:mo stretchy="false">)</m:mo>
                                             <m:mo>+</m:mo>
                                             <m:mi>p</m:mi>
                                             <m:mo stretchy="false">(</m:mo>
                                             <m:mi>X</m:mi>
                                             <m:mi>T</m:mi>
                                             <m:mo stretchy="false">)</m:mo>
                                             <m:mo stretchy="false">)</m:mo>
                                             <m:mo>,</m:mo>
                                          </m:mrow>
                                       </m:mtd>
                                    </m:mtr>
                                    <m:mtr columnalign="left">
                                       <m:mtd columnalign="left">
                                          <m:mrow>
                                             <m:msup>
                                                <m:mi>y</m:mi>
                                                <m:mi>X</m:mi>
                                             </m:msup>
                                             <m:mo>=</m:mo>
                                             <m:mo stretchy="false">(</m:mo>
                                             <m:mi>p</m:mi>
                                             <m:mo stretchy="false">(</m:mo>
                                             <m:mi>X</m:mi>
                                             <m:mi>A</m:mi>
                                             <m:mo stretchy="false">)</m:mo>
                                             <m:mo>+</m:mo>
                                             <m:mi>p</m:mi>
                                             <m:mo stretchy="false">(</m:mo>
                                             <m:mi>X</m:mi>
                                             <m:mi>C</m:mi>
                                             <m:mo stretchy="false">)</m:mo>
                                             <m:mo stretchy="false">)</m:mo>
                                             <m:mo>&#8722;</m:mo>
                                             <m:mo stretchy="false">(</m:mo>
                                             <m:mi>p</m:mi>
                                             <m:mo stretchy="false">(</m:mo>
                                             <m:mi>X</m:mi>
                                             <m:mi>G</m:mi>
                                             <m:mo stretchy="false">)</m:mo>
                                             <m:mo>+</m:mo>
                                             <m:mi>p</m:mi>
                                             <m:mo stretchy="false">(</m:mo>
                                             <m:mi>X</m:mi>
                                             <m:mi>T</m:mi>
                                             <m:mo stretchy="false">)</m:mo>
                                             <m:mo stretchy="false">)</m:mo>
                                             <m:mo>,</m:mo>
                                          </m:mrow>
                                       </m:mtd>
                                    </m:mtr>
                                    <m:mtr columnalign="left">
                                       <m:mtd columnalign="left">
                                          <m:mrow>
                                             <m:msup>
                                                <m:mi>z</m:mi>
                                                <m:mi>X</m:mi>
                                             </m:msup>
                                             <m:mo>=</m:mo>
                                             <m:mo stretchy="false">(</m:mo>
                                             <m:mi>p</m:mi>
                                             <m:mo stretchy="false">(</m:mo>
                                             <m:mi>X</m:mi>
                                             <m:mi>A</m:mi>
                                             <m:mo stretchy="false">)</m:mo>
                                             <m:mo>+</m:mo>
                                             <m:mi>p</m:mi>
                                             <m:mo stretchy="false">(</m:mo>
                                             <m:mi>X</m:mi>
                                             <m:mi>T</m:mi>
                                             <m:mo stretchy="false">)</m:mo>
                                             <m:mo stretchy="false">)</m:mo>
                                             <m:mo>&#8722;</m:mo>
                                             <m:mo stretchy="false">(</m:mo>
                                             <m:mi>p</m:mi>
                                             <m:mo stretchy="false">(</m:mo>
                                             <m:mi>X</m:mi>
                                             <m:mi>G</m:mi>
                                             <m:mo stretchy="false">)</m:mo>
                                             <m:mo>+</m:mo>
                                             <m:mi>p</m:mi>
                                             <m:mo stretchy="false">(</m:mo>
                                             <m:mi>X</m:mi>
                                             <m:mi>C</m:mi>
                                             <m:mo stretchy="false">)</m:mo>
                                             <m:mo stretchy="false">)</m:mo>
                                             <m:mo>,</m:mo>
                                          </m:mrow>
                                       </m:mtd>
                                    </m:mtr>
                                 </m:mtable>
                              </m:mrow>
                           </m:mrow>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaWaaiqaaeaafaqaaeWabaaabaGaemiEaG3aaWbaaSqabeaacqWGybawaaGccqGH9aqpcqGGOaakcqWGWbaCcqGGOaakcqWGybawcqWGbbqqcqGGPaqkcqGHRaWkcqWGWbaCcqGGOaakcqWGybawcqWGhbWrcqGGPaqkcqGGPaqkcqGHsislcqGGOaakcqWGWbaCcqGGOaakcqWGybawcqWGdbWqcqGGPaqkcqGHRaWkcqWGWbaCcqGGOaakcqWGybawcqWGubavcqGGPaqkcqGGPaqkcqGGSaalaeaacqWG5bqEdaahaaWcbeqaaiabdIfaybaakiabg2da9iabcIcaOiabdchaWjabcIcaOiabdIfayjabdgeabjabcMcaPiabgUcaRiabdchaWjabcIcaOiabdIfayjabdoeadjabcMcaPiabcMcaPiabgkHiTiabcIcaOiabdchaWjabcIcaOiabdIfayjabdEeahjabcMcaPiabgUcaRiabdchaWjabcIcaOiabdIfayjabdsfaujabcMcaPiabcMcaPiabcYcaSaqaaiabdQha6naaCaaaleqabaGaemiwaGfaaOGaeyypa0JaeiikaGIaemiCaaNaeiikaGIaemiwaGLaemyqaeKaeiykaKIaey4kaSIaemiCaaNaeiikaGIaemiwaGLaemivaqLaeiykaKIaeiykaKIaeyOeI0IaeiikaGIaemiCaaNaeiikaGIaemiwaGLaem4raCKaeiykaKIaey4kaSIaemiCaaNaeiikaGIaemiwaGLaem4qamKaeiykaKIaeiykaKIaeiilaWcaaaGaay5Eaaaaaa@8F11@</m:annotation>
                     </m:semantics>
                  </m:math>
               </display-formula>
            </p>
            <p>where <it>x</it><sup><it>X</it></sup>, <it>y</it><sup><it>X</it></sup>, <it>z</it><sup><it>X </it></sup>&#8712; [-1, 1], <it>p</it>(<it>XY</it>) = <it>n</it>(<it>XY</it>)/[<it>n</it>(<it>XA</it>) + <it>n</it>(<it>XC</it>) + <it>n</it>(<it>XG</it>) + <it>n</it>(<it>XT</it>)], <it>n</it>(<it>XY</it>) are the occurring times of dinucleotides XY, X, Y = A, C, G, T.</p>
            <p>Gao and Zhang <abbrgrp><abbr bid="B32">32</abbr></abbrgrp> compared various algorithms for recognizing short coding sequences of human genes and they defined 48 quantities, which were the frequencies of <it>phase-dependent </it>tri-nucleotides. In Ref. <abbrgrp><abbr bid="B32">32</abbr></abbrgrp>, Gao and Zhang used a sliding window with size 3nt and the window was moved forward three bases each time to count the frequencies for the 64 tri-nucleotides. Now we move forward the sliding window with size 3nt one base each time. The definition for the 48 coordinates is</p>
            <p>
               <display-formula id="M32">
                  <m:math name="1471-2105-9-113-i38" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:mrow>
                              <m:mo>{</m:mo>
                              <m:mrow>
                                 <m:mtable columnalign="left">
                                    <m:mtr columnalign="left">
                                       <m:mtd columnalign="left">
                                          <m:mrow>
                                             <m:msup>
                                                <m:mi>x</m:mi>
                                                <m:mrow>
                                                   <m:mi>X</m:mi>
                                                   <m:mi>Y</m:mi>
                                                </m:mrow>
                                             </m:msup>
                                             <m:mo>=</m:mo>
                                             <m:mo stretchy="false">(</m:mo>
                                             <m:mi>p</m:mi>
                                             <m:mo stretchy="false">(</m:mo>
                                             <m:mi>X</m:mi>
                                             <m:mi>Y</m:mi>
                                             <m:mi>A</m:mi>
                                             <m:mo stretchy="false">)</m:mo>
                                             <m:mo>+</m:mo>
                                             <m:mi>p</m:mi>
                                             <m:mo stretchy="false">(</m:mo>
                                             <m:mi>X</m:mi>
                                             <m:mi>Y</m:mi>
                                             <m:mi>G</m:mi>
                                             <m:mo stretchy="false">)</m:mo>
                                             <m:mo stretchy="false">)</m:mo>
                                             <m:mo>&#8722;</m:mo>
                                             <m:mo stretchy="false">(</m:mo>
                                             <m:mi>p</m:mi>
                                             <m:mo stretchy="false">(</m:mo>
                                             <m:mi>X</m:mi>
                                             <m:mi>Y</m:mi>
                                             <m:mi>C</m:mi>
                                             <m:mo stretchy="false">)</m:mo>
                                             <m:mo>+</m:mo>
                                             <m:mi>p</m:mi>
                                             <m:mo stretchy="false">(</m:mo>
                                             <m:mi>X</m:mi>
                                             <m:mi>Y</m:mi>
                                             <m:mi>T</m:mi>
                                             <m:mo stretchy="false">)</m:mo>
                                             <m:mo stretchy="false">)</m:mo>
                                             <m:mo>,</m:mo>
                                          </m:mrow>
                                       </m:mtd>
                                    </m:mtr>
                                    <m:mtr columnalign="left">
                                       <m:mtd columnalign="left">
                                          <m:mrow>
                                             <m:msup>
                                                <m:mi>y</m:mi>
                                                <m:mrow>
                                                   <m:mi>X</m:mi>
                                                   <m:mi>Y</m:mi>
                                                </m:mrow>
                                             </m:msup>
                                             <m:mo>=</m:mo>
                                             <m:mo stretchy="false">(</m:mo>
                                             <m:mi>p</m:mi>
                                             <m:mo stretchy="false">(</m:mo>
                                             <m:mi>X</m:mi>
                                             <m:mi>Y</m:mi>
                                             <m:mi>A</m:mi>
                                             <m:mo stretchy="false">)</m:mo>
                                             <m:mo>+</m:mo>
                                             <m:mi>p</m:mi>
                                             <m:mo stretchy="false">(</m:mo>
                                             <m:mi>X</m:mi>
                                             <m:mi>Y</m:mi>
                                             <m:mi>C</m:mi>
                                             <m:mo stretchy="false">)</m:mo>
                                             <m:mo stretchy="false">)</m:mo>
                                             <m:mo>&#8722;</m:mo>
                                             <m:mo stretchy="false">(</m:mo>
                                             <m:mi>p</m:mi>
                                             <m:mo stretchy="false">(</m:mo>
                                             <m:mi>X</m:mi>
                                             <m:mi>Y</m:mi>
                                             <m:mi>G</m:mi>
                                             <m:mo stretchy="false">)</m:mo>
                                             <m:mo>+</m:mo>
                                             <m:mi>p</m:mi>
                                             <m:mo stretchy="false">(</m:mo>
                                             <m:mi>X</m:mi>
                                             <m:mi>Y</m:mi>
                                             <m:mi>T</m:mi>
                                             <m:mo stretchy="false">)</m:mo>
                                             <m:mo stretchy="false">)</m:mo>
                                             <m:mo>,</m:mo>
                                          </m:mrow>
                                       </m:mtd>
                                    </m:mtr>
                                    <m:mtr columnalign="left">
                                       <m:mtd columnalign="left">
                                          <m:mrow>
                                             <m:msup>
                                                <m:mi>z</m:mi>
                                                <m:mrow>
                                                   <m:mi>X</m:mi>
                                                   <m:mi>Y</m:mi>
                                                </m:mrow>
                                             </m:msup>
                                             <m:mo>=</m:mo>
                                             <m:mo stretchy="false">(</m:mo>
                                             <m:mi>p</m:mi>
                                             <m:mo stretchy="false">(</m:mo>
                                             <m:mi>X</m:mi>
                                             <m:mi>Y</m:mi>
                                             <m:mi>A</m:mi>
                                             <m:mo stretchy="false">)</m:mo>
                                             <m:mo>+</m:mo>
                                             <m:mi>p</m:mi>
                                             <m:mo stretchy="false">(</m:mo>
                                             <m:mi>X</m:mi>
                                             <m:mi>Y</m:mi>
                                             <m:mi>T</m:mi>
                                             <m:mo stretchy="false">)</m:mo>
                                             <m:mo stretchy="false">)</m:mo>
                                             <m:mo>&#8722;</m:mo>
                                             <m:mo stretchy="false">(</m:mo>
                                             <m:mi>p</m:mi>
                                             <m:mo stretchy="false">(</m:mo>
                                             <m:mi>X</m:mi>
                                             <m:mi>Y</m:mi>
                                             <m:mi>G</m:mi>
                                             <m:mo stretchy="false">)</m:mo>
                                             <m:mo>+</m:mo>
                                             <m:mi>p</m:mi>
                                             <m:mo stretchy="false">(</m:mo>
                                             <m:mi>X</m:mi>
                                             <m:mi>Y</m:mi>
                                             <m:mi>C</m:mi>
                                             <m:mo stretchy="false">)</m:mo>
                                             <m:mo stretchy="false">)</m:mo>
                                             <m:mo>,</m:mo>
                                          </m:mrow>
                                       </m:mtd>
                                    </m:mtr>
                                 </m:mtable>
                              </m:mrow>
                           </m:mrow>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaWaaiqaaeaafaqaaeWabaaabaGaemiEaG3aaWbaaSqabeaacqWGybawcqWGzbqwaaGccqGH9aqpcqGGOaakcqWGWbaCcqGGOaakcqWGybawcqWGzbqwcqWGbbqqcqGGPaqkcqGHRaWkcqWGWbaCcqGGOaakcqWGybawcqWGzbqwcqWGhbWrcqGGPaqkcqGGPaqkcqGHsislcqGGOaakcqWGWbaCcqGGOaakcqWGybawcqWGzbqwcqWGdbWqcqGGPaqkcqGHRaWkcqWGWbaCcqGGOaakcqWGybawcqWGzbqwcqWGubavcqGGPaqkcqGGPaqkcqGGSaalaeaacqWG5bqEdaahaaWcbeqaaiabdIfayjabdMfazbaakiabg2da9iabcIcaOiabdchaWjabcIcaOiabdIfayjabdMfazjabdgeabjabcMcaPiabgUcaRiabdchaWjabcIcaOiabdIfayjabdMfazjabdoeadjabcMcaPiabcMcaPiabgkHiTiabcIcaOiabdchaWjabcIcaOiabdIfayjabdMfazjabdEeahjabcMcaPiabgUcaRiabdchaWjabcIcaOiabdIfayjabdMfazjabdsfaujabcMcaPiabcMcaPiabcYcaSaqaaiabdQha6naaCaaaleqabaGaemiwaGLaemywaKfaaOGaeyypa0JaeiikaGIaemiCaaNaeiikaGIaemiwaGLaemywaKLaemyqaeKaeiykaKIaey4kaSIaemiCaaNaeiikaGIaemiwaGLaemywaKLaemivaqLaeiykaKIaeiykaKIaeyOeI0IaeiikaGIaemiCaaNaeiikaGIaemiwaGLaemywaKLaem4raCKaeiykaKIaey4kaSIaemiCaaNaeiikaGIaemiwaGLaemywaKLaem4qamKaeiykaKIaeiykaKIaeiilaWcaaaGaay5Eaaaaaa@A186@</m:annotation>
                     </m:semantics>
                  </m:math>
               </display-formula>
            </p>
            <p>where <it>x</it><sup><it>XY</it></sup>, <it>y</it><sup><it>XY</it></sup>, <it>z</it><sup><it>XY </it></sup>&#8712; [-1, 1], <it>p</it>(<it>XYZ</it>) = <it>n</it>(<it>XYZ</it>)/[<it>n</it>(<it>XYA</it>) + <it>n</it>(<it>XYC</it>) + <it>n</it>(<it>XYG</it>) + <it>n</it>(<it>XYT</it>)], <it>n</it>(<it>XY Z</it>) are the occurring times of trinucleotides XYZ, X, Y, Z = A, C, G, T. The difference between Ref. <abbrgrp><abbr bid="B32">32</abbr></abbrgrp> and here is in the calculation of <it>n</it>(<it>XYZ</it>); the present method can be regarded as a <it>phase-independent </it>method.</p>
         </sec>
         <sec>
            <st>
               <p>Global descriptor of promoter/nonpromoter sequence (<it>GD</it>)</p>
            </st>
            <p>Dubchak <it>et al</it>. <abbrgrp><abbr bid="B12">12</abbr></abbrgrp> proposed a method for predicting protein folding classes based on a global protein chain description. The protein-chain descriptor includes overall composition, transition, and distribution of amino acid attributes. Similar methods have also been used in Refs. <abbrgrp><abbr bid="B55">55</abbr><abbr bid="B56">56</abbr><abbr bid="B57">57</abbr><abbr bid="B58">58</abbr></abbrgrp>. In this paper, we propose the global descriptor of promoter/non-promoter sequences.</p>
            <p>The global description contains three parts: composition (<it>Comp</it>), transition (<it>Tran</it>) and distribution (<it>Dist</it>). In order to explain the method, we suppose that a sequence consists of only two kinds of letters (A and B). The composition is used to measure the frequency of occurrence of each kind of letters in the sequences. For example, for the sequence: BABBABABBABBAABABABBAAAB-BABABA, there are 14 As and 16 Bs, hence the frequencies for A and B are 100.00 &#215; 14/(14+16) = 46.67, 100.00 &#215; 16/(14+16) = 53.33, respectively. These two numbers represent the first part of the global description, <it>Comp</it>. The second part, <it>Tran</it>, characterizes the percent frequency with which A is followed by B or B is followed by A. For example, for the above sequence, there are 21 transitions of this type, that is, (21/29) &#215; 100.00 = 72.14. The third part of the global description, <it>Dist</it>, measures the chain length within which the first, 25%, 50%, 75% and 100% of certain type of letters is located, respectively. For example, for the above sequence, the first, 25%, 50%, 75% and 100% of Bs are located within the first, 6th, 12th, 20th and 29th nucleotides, respectively. The <it>Dist </it>descriptor for Bs is thus: 1/30 &#215; 100.00 = 3.33, 6/30 &#215; 100.00 = 20.00, 12/30 &#215; 100.00 = 40.00, 20/30 &#215; 100.00 = 66.67 and 29/30 &#215; 100.00 = 96.67. Likewise, the <it>Dist </it>descriptor for As is 6.67, 23.33, 53.33, 73.33 and 100.00. As a result, the global description for the above sequence is (<it>Comp</it>; <it>Tran</it>; <it>Dist</it>) = (46.67, 53.33; 72.14; 6.67, 23.33, 53.33, 73.33, 100.00, 3.33, 20.00, 40.00, 66.67, 96.67). A more detailed description of global description of sequences is given in Refs. <abbrgrp><abbr bid="B12">12</abbr><abbr bid="B55">55</abbr><abbr bid="B56">56</abbr><abbr bid="B57">57</abbr><abbr bid="B58">58</abbr></abbrgrp>.</p>
            <p>The global description for the promoter/non-promoter sequences can be computed by a similar procedure. As the sequences consist of four types of nucleotides (A, C, G and T), there are 4 parameters for <it>Comp</it>, 6 parameters for <it>Tran </it>and 20 parameters for <it>Dist</it>. Overall, a total of 30 parameters are used to give a global description of a promoter/non-promoter sequence.</p>
            <p>The Entropy Density Profile (EDP) model is a global statistical description for a DNA sequence, which employs Shannon's artificial linguistic description for a DNA sequence of finite length like an open reading frame (ORF) <abbrgrp><abbr bid="B59">59</abbr></abbrgrp>. Zhu <it>et al</it>. <abbrgrp><abbr bid="B59">59</abbr></abbrgrp> developed a new non-supervised gene prediction algorithm for bacterial and archaeal genomes based on EDP. Here we describe such method briefly. If <it>p</it><sub><it>i</it></sub>(<it>i </it>= 1, 2, 3, 4) are the frequencies for the four types of nucleotides of a promoter/non-promoter sequence, then an EDP vector <it>S </it>= {<it>s</it><sub><it>i</it></sub>} inferred from {<it>p</it><sub><it>i</it></sub>} is used to represent the sequence with an emphasis on the information content, where <it>i </it>is the index of the four kinds of nucleotides. The EDP <it>s</it><sub><it>i </it></sub>is defined as <abbrgrp><abbr bid="B59">59</abbr></abbrgrp></p>
            <p>
               <display-formula id="M33">
                  <m:math name="1471-2105-9-113-i39" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:mtable>
                              <m:mtr>
                                 <m:mtd>
                                    <m:mrow>
                                       <m:msub>
                                          <m:mi>s</m:mi>
                                          <m:mi>i</m:mi>
                                       </m:msub>
                                       <m:mo>=</m:mo>
                                       <m:mo>&#8722;</m:mo>
                                       <m:mfrac>
                                          <m:mn>1</m:mn>
                                          <m:mi>H</m:mi>
                                       </m:mfrac>
                                       <m:msub>
                                          <m:mi>p</m:mi>
                                          <m:mi>i</m:mi>
                                       </m:msub>
                                       <m:mi>log</m:mi>
                                       <m:mo>&#8289;</m:mo>
                                       <m:msub>
                                          <m:mi>p</m:mi>
                                          <m:mi>i</m:mi>
                                       </m:msub>
                                       <m:mo>,</m:mo>
                                    </m:mrow>
                                 </m:mtd>
                                 <m:mtd>
                                    <m:mrow>
                                       <m:mi>i</m:mi>
                                       <m:mo>=</m:mo>
                                       <m:mn>1</m:mn>
                                       <m:mo>,</m:mo>
                                       <m:mn>2</m:mn>
                                       <m:mo>,</m:mo>
                                       <m:mn>3</m:mn>
                                       <m:mo>,</m:mo>
                                       <m:mn>4</m:mn>
                                       <m:mo>,</m:mo>
                                    </m:mrow>
                                 </m:mtd>
                              </m:mtr>
                           </m:mtable>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaqbaeqabeGaaaqaaiabdohaZnaaBaaaleaacqWGPbqAaeqaaOGaeyypa0JaeyOeI0scfa4aaSaaaeaacqaIXaqmaeaacqWGibasaaGccqWGWbaCdaWgaaWcbaGaemyAaKgabeaakiGbcYgaSjabc+gaVjabcEgaNjabdchaWnaaBaaaleaacqWGPbqAaeqaaOGaeiilaWcabaGaemyAaKMaeyypa0JaeGymaeJaeiilaWIaeGOmaiJaeiilaWIaeG4mamJaeiilaWIaeGinaqJaeiilaWcaaaaa@4871@</m:annotation>
                     </m:semantics>
                  </m:math>
               </display-formula>
            </p>
            <p>where <inline-formula><m:math name="1471-2105-9-113-i40" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:mi>H</m:mi><m:mo>=</m:mo><m:mo>&#8722;</m:mo><m:mstyle displaystyle="true"><m:munderover><m:mo>&#8721;</m:mo><m:mrow><m:mi>i</m:mi><m:mo>=</m:mo><m:mn>1</m:mn></m:mrow><m:mn>4</m:mn></m:munderover><m:mrow><m:msub><m:mi>p</m:mi><m:mi>i</m:mi></m:msub><m:mi>log</m:mi><m:mo>&#8289;</m:mo><m:msub><m:mi>p</m:mi><m:mi>i</m:mi></m:msub></m:mrow></m:mstyle></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGaemisaGKaeyypa0JaeyOeI0YaaabCaeaacqWGWbaCdaWgaaWcbaGaemyAaKgabeaakiGbcYgaSjabc+gaVjabcEgaNjabdchaWnaaBaaaleaacqWGPbqAaeqaaaqaaiabdMgaPjabg2da9iabigdaXaqaaiabisda0aqdcqGHris5aaaa@3F65@</m:annotation></m:semantics></m:math></inline-formula> is the Shannon entropy.</p>
            <p>It was shown that <inline-formula><m:math name="1471-2105-9-113-i41" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:mi>P</m:mi><m:mo>=</m:mo><m:msubsup><m:mi>p</m:mi><m:mn>1</m:mn><m:mn>2</m:mn></m:msubsup><m:mo>+</m:mo><m:msubsup><m:mi>p</m:mi><m:mn>2</m:mn><m:mn>2</m:mn></m:msubsup><m:mo>+</m:mo><m:msubsup><m:mi>p</m:mi><m:mn>3</m:mn><m:mn>2</m:mn></m:msubsup><m:mo>+</m:mo><m:msubsup><m:mi>p</m:mi><m:mn>4</m:mn><m:mn>2</m:mn></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGaemiuaaLaeyypa0JaemiCaa3aa0baaSqaaiabigdaXaqaaiabikdaYaaakiabgUcaRiabdchaWnaaDaaaleaacqaIYaGmaeaacqaIYaGmaaGccqGHRaWkcqWGWbaCdaqhaaWcbaGaeG4mamdabaGaeGOmaidaaOGaey4kaSIaemiCaa3aa0baaSqaaiabisda0aqaaiabikdaYaaaaaa@3EB2@</m:annotation></m:semantics></m:math></inline-formula> is a useful statistical quantity for analysis of DNA sequences <abbrgrp><abbr bid="B54">54</abbr><abbr bid="B60">60</abbr></abbrgrp>, which was called a nucleotide composition constraint of genomes <abbrgrp><abbr bid="B61">61</abbr></abbrgrp>. As a result, we obtain 6 parameters <it>s</it><sub>1</sub>, <it>s</it><sub>2</sub>, <it>s</it><sub>3</sub>, <it>s</it><sub>4</sub>, <it>H </it>and <it>P </it>from EDP.</p>
            <p>Overall, combining the above two description systems, we get 36 parameters for the global descriptor of a promoter/non-promoter sequence.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Authors' contributions</p>
         </st>
         <p>JYY conceived of the study, downloaded the data, analyzed the results, has been involved in programming, drafting and revising the manuscript. YZ and LQZ have been involved in the programming and discussion on the results. ZGY coordinated the study and participated in its design, analyzed the results, has been involved in drafting and revising the manuscript. VA participated in the design of the study and the results discussion, has been involved in drafting and revising the manuscript. All authors read and approved the final manuscript.</p>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>The authors would like to thank Dr. Feng-Biao Guo of Tianjin University for his helpful discussions about the Z curve method and Dr. Jun Lu in Inner Mongolia University for his help with the promoter data on human chromosome 22, and the referees for their detailed comments and useful suggestions to improve the paper. Financial support was provided by the Chinese National Natural Science Foundation (grant no. 30570426), Fok Ying Tung Education Foundation (grant no. 101004), the Youth Foundation of Educational Department of Hunan Province in China (grant no. 05B007) (Z.-G. Yu), the Australian Research Council (grant no. DP0559807) (V.V. Anh), and the Scientific Research Fund of the Department of Education in Hunan Province of China (no. 06C830) (L.Q. Zhou).</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>The recognition and prediction of <it>&#963;</it><sup>70 </sup>promoters in <it>Escherichia coli </it>K-12</p>
            </title>
            <aug>
               <au>
                  <snm>Li</snm>
                  <fnm>QZ</fnm>
               </au>
               <au>
                  <snm>Lin</snm>
                  <fnm>H</fnm>
               </au>
            </aug>
            <source>J Theor Biol</source>
            <pubdate>2006</pubdate>
            <volume>242</volume>
            <fpage>135</fpage>
            <lpage>141</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.jtbi.2006.02.007</pubid>
                  <pubid idtype="pmpid" link="fulltext">16603195</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <title>
               <p>Promoter Prediction on a Genomic Scale-The Adh Experience</p>
            </title>
            <aug>
               <au>
                  <snm>Ohler</snm>
                  <fnm>U</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2000</pubdate>
            <volume>10</volume>
            <fpage>539</fpage>
            <lpage>542</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">310866</pubid>
                  <pubid idtype="pmpid" link="fulltext">10779494</pubid>
                  <pubid idtype="doi">10.1101/gr.10.4.539</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>Eukaryotic Promoter Recognition</p>
            </title>
            <aug>
               <au>
                  <snm>Fickett</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Hatzigeorgiou</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>1997</pubdate>
            <volume>7</volume>
            <fpage>861</fpage>
            <lpage>878</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">9314492</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>The state of the art of mammalian promoter recognition</p>
            </title>
            <aug>
               <au>
                  <snm>Werner</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Breif Bioinform</source>
            <pubdate>2003</pubdate>
            <volume>4</volume>
            <issue>1</issue>
            <fpage>22</fpage>
            <lpage>30</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bib/4.1.22</pubid>
                  <pubid idtype="pmpid" link="fulltext">12715831</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>The biology of eukaryotic promoter prediction-a review</p>
            </title>
            <aug>
               <au>
                  <snm>Pedersen</snm>
                  <fnm>AG</fnm>
               </au>
               <au>
                  <snm>Baldi</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Chauvin</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Brunak</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Comput Chem</source>
            <pubdate>1999</pubdate>
            <volume>23</volume>
            <fpage>191</fpage>
            <lpage>207</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0097-8485(99)00015-7</pubid>
                  <pubid idtype="pmpid" link="fulltext">10404615</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>A novel method for prokaryotic promoter prediction based on DNA stability</p>
            </title>
            <aug>
               <au>
                  <snm>Kanhere</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Bansal</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>BMC Bioinformatics</source>
            <pubdate>2005</pubdate>
            <volume>6</volume>
            <fpage>1</fpage>
            <lpage>10</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">545949</pubid>
                  <pubid idtype="pmpid" link="fulltext">15631638</pubid>
                  <pubid idtype="doi">10.1186/1471-2105-6-1</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>Human pol II promoter prediction: time series descriptors and machine learning</p>
            </title>
            <aug>
               <au>
                  <snm>Gangal</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Sharma</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2005</pubdate>
            <volume>33</volume>
            <fpage>1332</fpage>
            <lpage>1336</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">552959</pubid>
                  <pubid idtype="pmpid" link="fulltext">15741185</pubid>
                  <pubid idtype="doi">10.1093/nar/gki271</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>A unified view of polymer, dumbbell, and oligonucleotide DNA nearest-neighbor thermodynamics</p>
            </title>
            <aug>
               <au>
                  <snm>Santalucta</snm>
                  <fnm>JR</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci</source>
            <pubdate>1998</pubdate>
            <volume>95</volume>
            <fpage>1460</fpage>
            <lpage>1465</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">19045</pubid>
                  <pubid idtype="pmpid" link="fulltext">9465037</pubid>
                  <pubid idtype="doi">10.1073/pnas.95.4.1460</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p>Clustering structures of large proteins using multifractal analyses based on a 6-letter model and hydrophobicity scale of amino acids</p>
            </title>
            <aug>
               <au>
                  <snm>Yang</snm>
                  <fnm>JY</fnm>
               </au>
               <au>
                  <snm>Yu</snm>
                  <fnm>ZG</fnm>
               </au>
               <au>
                  <snm>Anh</snm>
                  <fnm>V</fnm>
               </au>
            </aug>
            <source>Chaos, Solitons and Fractals</source>
            <pubdate>2007</pubdate>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.chaos.2007.08.014</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B10">
            <title>
               <p>ZCURVE: a new system for recognizing proteincoding genes in bacterial and archaeal genome</p>
            </title>
            <aug>
               <au>
                  <snm>Guo</snm>
                  <fnm>FB</fnm>
               </au>
               <au>
                  <snm>Ou</snm>
                  <fnm>HY</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>CT</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2003</pubdate>
            <volume>31</volume>
            <fpage>1780</fpage>
            <lpage>1789</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">152858</pubid>
                  <pubid idtype="pmpid" link="fulltext">12626720</pubid>
                  <pubid idtype="doi">10.1093/nar/gkg254</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>ZCURVE_V: a new self-training system for recognizing protein-coding genes in viral and phage genomes</p>
            </title>
            <aug>
               <au>
                  <snm>Guo</snm>
                  <fnm>FB</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>CT</fnm>
               </au>
            </aug>
            <source>BMC Bioinformatics</source>
            <pubdate>2006</pubdate>
            <volume>7</volume>
            <fpage>1</fpage>
            <lpage>11</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1360682</pubid>
                  <pubid idtype="pmpid" link="fulltext">16393334</pubid>
                  <pubid idtype="doi">10.1186/1471-2105-7-9</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>Prediction of protein folding class using global description of amino acid sequence</p>
            </title>
            <aug>
               <au>
                  <snm>Dubchak</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Muchanikt</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Holbrook</snm>
                  <fnm>SR</fnm>
               </au>
               <au>
                  <snm>Kim</snm>
                  <fnm>SH</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci</source>
            <pubdate>1995</pubdate>
            <volume>92</volume>
            <fpage>8700</fpage>
            <lpage>8704</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">41034</pubid>
                  <pubid idtype="pmpid" link="fulltext">7568000</pubid>
                  <pubid idtype="doi">10.1073/pnas.92.19.8700</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B13">
            <title>
               <p>First pass annotation of promoters of human chromosome 22</p>
            </title>
            <aug>
               <au>
                  <snm>Scherf</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Klingenho3</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Frech</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Quandt</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Schneider</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Grote</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Frisch</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Gailus-Durner</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Seidel</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Brack-Werner</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Werner</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2001</pubdate>
            <volume>11</volume>
            <fpage>333</fpage>
            <lpage>340</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">311038</pubid>
                  <pubid idtype="pmpid" link="fulltext">11230158</pubid>
                  <pubid idtype="doi">10.1101/gr.154601</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B14">
            <title>
               <p>Website <it>EPD</it></p>
            </title>
            <url>http://www.epd.isb-sib.ch</url>
         </bibl>
         <bibl id="B15">
            <title>
               <p>The Eukaryotic Promoter Database EPD</p>
            </title>
            <aug>
               <au>
                  <snm>Perier</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Junier</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Bucher</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>1998</pubdate>
            <volume>26</volume>
            <fpage>353</fpage>
            <lpage>357</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">147208</pubid>
                  <pubid idtype="pmpid" link="fulltext">9399872</pubid>
                  <pubid idtype="doi">10.1093/nar/26.1.353</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B16">
            <title>
               <p>Computational modeling of oligonucleotide positional densities for human promoter prediction</p>
            </title>
            <aug>
               <au>
                  <snm>Narang</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Saeys</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Sung</snm>
                  <fnm>WK</fnm>
               </au>
               <au>
                  <snm>Mittal</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Artif Intell Med</source>
            <pubdate>2005</pubdate>
            <volume>35</volume>
            <fpage>107</fpage>
            <lpage>119</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.artmed.2005.02.005</pubid>
                  <pubid idtype="pmpid" link="fulltext">16076553</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B17">
            <title>
               <p>Dragon Promoter Finder: recognition of vertebrate RNA polymerase II promoters</p>
            </title>
            <aug>
               <au>
                  <snm>Bajic</snm>
                  <fnm>VB</fnm>
               </au>
               <au>
                  <snm>Seah</snm>
                  <fnm>SH</fnm>
               </au>
               <au>
                  <snm>Chong</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Koh</snm>
                  <fnm>JLY</fnm>
               </au>
               <au>
                  <snm>Brusic</snm>
                  <fnm>CV</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2002</pubdate>
            <volume>18</volume>
            <fpage>198</fpage>
            <lpage>199</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/18.1.198</pubid>
                  <pubid idtype="pmpid" link="fulltext">11836231</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B18">
            <title>
               <p>Promoter 2.0: for the recognition of Pol II promoter sequences</p>
            </title>
            <aug>
               <au>
                  <snm>Knudsen</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>1999</pubdate>
            <volume>15</volume>
            <fpage>356</fpage>
            <lpage>361</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/15.5.356</pubid>
                  <pubid idtype="pmpid" link="fulltext">10366655</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B19">
            <title>
               <p>Predicting Pol II Promoter Sequences using Transcription Factor Binding Sites</p>
            </title>
            <aug>
               <au>
                  <snm>Prestridge Dan</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>J Mol Biol</source>
            <pubdate>1995</pubdate>
            <volume>249</volume>
            <fpage>923</fpage>
            <lpage>932</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1006/jmbi.1995.0349</pubid>
                  <pubid idtype="pmpid" link="fulltext">7791218</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B20">
            <title>
               <p>Computational modeling of oligonucleotide positional densities for human promoter prediction</p>
            </title>
            <aug>
               <au>
                  <snm>Saxonov</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Daizadeh</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Fedorov</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Gilbert</snm>
                  <fnm>W</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2000</pubdate>
            <volume>28</volume>
            <fpage>185</fpage>
            <lpage>190</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">102483</pubid>
                  <pubid idtype="pmpid" link="fulltext">10592221</pubid>
                  <pubid idtype="doi">10.1093/nar/28.1.185</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B21">
            <title>
               <p>Website <it>EID</it></p>
            </title>
            <url>http://hsc.utoledo.edu/bioinfo/eid/index.html</url>
         </bibl>
         <bibl id="B22">
            <aug>
               <au>
                  <snm>Mardia</snm>
                  <fnm>KV</fnm>
               </au>
               <au>
                  <snm>Kent</snm>
                  <fnm>JT</fnm>
               </au>
               <au>
                  <snm>Bibby</snm>
                  <fnm>JM</fnm>
               </au>
            </aug>
            <source>Multivariate Analysis</source>
            <publisher>Academic Press, London</publisher>
            <pubdate>1979</pubdate>
         </bibl>
         <bibl id="B23">
            <aug>
               <au>
                  <snm>Duda</snm>
                  <fnm>RO</fnm>
               </au>
               <au>
                  <snm>Hart</snm>
                  <fnm>PE</fnm>
               </au>
               <au>
                  <snm>Stork</snm>
                  <fnm>DG</fnm>
               </au>
            </aug>
            <source>Pattern Classification</source>
            <publisher>John Wiley &amp; Sons, New York</publisher>
            <edition>2</edition>
            <pubdate>2001</pubdate>
         </bibl>
         <bibl id="B24">
            <aug>
               <au>
                  <snm>Sneath</snm>
                  <fnm>PH</fnm>
               </au>
               <au>
                  <snm>Sokal</snm>
                  <fnm>RR</fnm>
               </au>
            </aug>
            <source>Numerical Taxonomy</source>
            <publisher>Freeman, San Francisco</publisher>
            <pubdate>1973</pubdate>
         </bibl>
         <bibl id="B25">
            <title>
               <p>Clustering of protein structures using hydrophobic free energy and solvent accessibility of proteins</p>
            </title>
            <aug>
               <au>
                  <snm>Yu</snm>
                  <fnm>ZG</fnm>
               </au>
               <au>
                  <snm>Anh</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Lau</snm>
                  <fnm>KS</fnm>
               </au>
               <au>
                  <snm>Zhou</snm>
                  <fnm>LQ</fnm>
               </au>
            </aug>
            <source>Phys Rev E</source>
            <pubdate>2006</pubdate>
            <volume>73</volume>
            <issue>3</issue>
            <fpage>031920</fpage>
            <note>Epub 2006 Mar 21.</note>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1103/PhysRevE.73.031920</pubid>
                  <pubid idtype="pmpid" link="fulltext">16605571</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B26">
            <title>
               <p>A fractal method to distinguish coding and non-coding sequences in a complete genome based on a number sequence representation</p>
            </title>
            <aug>
               <au>
                  <snm>Zhou</snm>
                  <fnm>LQ</fnm>
               </au>
               <au>
                  <snm>Yu</snm>
                  <fnm>ZG</fnm>
               </au>
               <au>
                  <snm>Deng</snm>
                  <fnm>JQ</fnm>
               </au>
               <au>
                  <snm>Anh</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Long</snm>
                  <fnm>SC</fnm>
               </au>
            </aug>
            <source>J Theor Biol</source>
            <pubdate>2005</pubdate>
            <volume>232</volume>
            <fpage>559</fpage>
            <lpage>567</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.jtbi.2004.09.002</pubid>
                  <pubid idtype="pmpid" link="fulltext">15588636</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B27">
            <title>
               <p>Website <it>NNPP version 2.2</it></p>
            </title>
            <url>http://www.fruitfly.org/seq_tools/promoter.html</url>
         </bibl>
         <bibl id="B28">
            <title>
               <p>Website <it>TSSW</it></p>
            </title>
            <url>http://www.softberry.com/berry.phtml?topic=tssw&amp;group=programs&amp;subgroup=promoter</url>
         </bibl>
         <bibl id="B29">
            <title>
               <p>Website <it>DFP version 1.5</it></p>
            </title>
            <url>http://www.research.i2r.a-star.edu.sg/promoter/promoter1_5/DPF.htm</url>
         </bibl>
         <bibl id="B30">
            <title>
               <p>Website <it>Promoter 2.0</it></p>
            </title>
            <url>http://www.cbs.dtu.dk/services/Promoter</url>
         </bibl>
         <bibl id="B31">
            <title>
               <p>Website <it>Promoter Scan version 1.7</it></p>
            </title>
            <url>http://www-bimas.cit.nih.gov/molbio/proscan/</url>
         </bibl>
         <bibl id="B32">
            <title>
               <p>Comparison of various algorithms for recognizing short coding sequences of human genes</p>
            </title>
            <aug>
               <au>
                  <snm>Gao</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>CT</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2004</pubdate>
            <volume>20</volume>
            <fpage>673</fpage>
            <lpage>681</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/btg467</pubid>
                  <pubid idtype="pmpid" link="fulltext">14764563</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B33">
            <title>
               <p>Human Pol II promoter prediction (in Chinese)</p>
            </title>
            <aug>
               <au>
                  <snm>Lu</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Luo</snm>
                  <fnm>LF</fnm>
               </au>
            </aug>
            <source>Progress in Biochemistry and Biophysics</source>
            <pubdate>2005</pubdate>
            <volume>32</volume>
            <fpage>1185</fpage>
            <lpage>1191</lpage>
         </bibl>
         <bibl id="B34">
            <title>
               <p>Boosting with stumps for predicting transcription start sites</p>
            </title>
            <aug>
               <au>
                  <snm>Zhao</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Xuan</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Genome Biology</source>
            <pubdate>2007</pubdate>
            <volume>8</volume>
            <fpage>R17</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1852414</pubid>
                  <pubid idtype="pmpid" link="fulltext">17274821</pubid>
                  <pubid idtype="doi">10.1186/gb-2007-8-2-r17</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B35">
            <title>
               <p>Chaos game representation of the <it>D</it><sub>st </sub>index and prediction of geomagnetic storm events</p>
            </title>
            <aug>
               <au>
                  <snm>Yu</snm>
                  <fnm>ZG</fnm>
               </au>
               <au>
                  <snm>Anh</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Wanliss</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Watson</snm>
                  <fnm>SM</fnm>
               </au>
            </aug>
            <source>Chaos, Solitons and Fractals</source>
            <pubdate>2007</pubdate>
            <volume>31</volume>
            <fpage>736</fpage>
            <lpage>746</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1016/j.chaos.2005.12.046</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B36">
            <title>
               <p>Multifractal nature of network induced time delay in networked control systems</p>
            </title>
            <aug>
               <au>
                  <snm>Tian</snm>
                  <fnm>YC</fnm>
               </au>
               <au>
                  <snm>Yu</snm>
                  <fnm>ZG</fnm>
               </au>
               <au>
                  <snm>Fidge</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>Phys Lett A</source>
            <pubdate>2007</pubdate>
            <volume>361</volume>
            <fpage>103</fpage>
            <lpage>107</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1016/j.physleta.2006.09.046</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B37">
            <title>
               <p>The ENCODE (ENCyclopedia Of DNA Elements) Project</p>
            </title>
            <aug>
               <au>
                  <cnm>The ENCODE Project Consortium</cnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2004</pubdate>
            <volume>306</volume>
            <fpage>636</fpage>
            <lpage>640</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1105136</pubid>
                  <pubid idtype="pmpid" link="fulltext">15499007</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B38">
            <title>
               <p>Helix stability in prokaryotic promoter regions</p>
            </title>
            <aug>
               <au>
                  <snm>Margalit</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Shapiro</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Nussinov</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Owens</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Jernigan</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Biochemistry</source>
            <pubdate>1998</pubdate>
            <volume>27</volume>
            <issue>14</issue>
            <fpage>5179</fpage>
            <lpage>5188</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1021/bi00414a035</pubid>
                  <pubid idtype="pmpid">3167040</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B39">
            <title>
               <p>A relationship between DNA helix stability and recognition sites for RNA polymerase</p>
            </title>
            <aug>
               <au>
                  <snm>Vollenweider</snm>
                  <fnm>HJ</fnm>
               </au>
               <au>
                  <snm>Fiandt</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Szybalski</snm>
                  <fnm>W</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>1979</pubdate>
            <volume>205</volume>
            <fpage>508</fpage>
            <lpage>511</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.377494</pubid>
                  <pubid idtype="pmpid" link="fulltext">377494</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B40">
            <title>
               <p>Large-scale structural analysis of the core promoter in mammalian and plant genomes</p>
            </title>
            <aug>
               <au>
                  <snm>Florquin</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Saeys</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Degroeve</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Rouz&#233;</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>de Peer</snm>
                  <fnm>YV</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2005</pubdate>
            <volume>33</volume>
            <fpage>4255</fpage>
            <lpage>4264</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1181242</pubid>
                  <pubid idtype="pmpid" link="fulltext">16049029</pubid>
                  <pubid idtype="doi">10.1093/nar/gki737</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B41">
            <title>
               <p>Multifractal characterisation of length sequences of coding and noncoding segments in a complete genome</p>
            </title>
            <aug>
               <au>
                  <snm>Yu</snm>
                  <fnm>ZG</fnm>
               </au>
               <au>
                  <snm>Anh</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Lau</snm>
                  <fnm>KS</fnm>
               </au>
            </aug>
            <source>Physica A</source>
            <pubdate>2001</pubdate>
            <volume>301</volume>
            <fpage>351</fpage>
            <lpage>361</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1016/S0378-4371(01)00391-0</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B42">
            <title>
               <p>Measure representation and multifractal analysis of complete genomes</p>
            </title>
            <aug>
               <au>
                  <snm>Yu</snm>
                  <fnm>ZG</fnm>
               </au>
               <au>
                  <snm>Anh</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Lau</snm>
                  <fnm>KS</fnm>
               </au>
            </aug>
            <source>Phys Rev E</source>
            <pubdate>2001</pubdate>
            <volume>64</volume>
            <issue>3</issue>
            <fpage>031903</fpage>
            <note>Epub 2001 Aug 24.</note>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1103/PhysRevE.64.031903</pubid>
                  <pubid idtype="pmpid" link="fulltext">11580363</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B43">
            <aug>
               <au>
                  <snm>Mandelbrot</snm>
                  <fnm>BB</fnm>
               </au>
            </aug>
            <source>The Fractal Geometry of Nature</source>
            <publisher>Academic Press, New York</publisher>
            <pubdate>1983</pubdate>
         </bibl>
         <bibl id="B44">
            <aug>
               <au>
                  <snm>Feder</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Fractals</source>
            <publisher>Plenum, New York</publisher>
            <pubdate>1988</pubdate>
         </bibl>
         <bibl id="B45">
            <title>
               <p>Fractal analysis of measure representation of large proteins based on the detailed HP model</p>
            </title>
            <aug>
               <au>
                  <snm>Yu</snm>
                  <fnm>ZG</fnm>
               </au>
               <au>
                  <snm>Anh</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Lau</snm>
                  <fnm>KS</fnm>
               </au>
            </aug>
            <source>Physica A</source>
            <pubdate>2004</pubdate>
            <volume>337</volume>
            <fpage>171</fpage>
            <lpage>184</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1016/j.physa.2004.01.057</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B46">
            <title>
               <p>Phase Transition in the Multifractal Spectrum of Diffusion-Limited Aggregation</p>
            </title>
            <aug>
               <au>
                  <snm>Lee</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Stanley</snm>
                  <fnm>HE</fnm>
               </au>
            </aug>
            <source>Phys Rev Lett</source>
            <pubdate>1988</pubdate>
            <volume>61</volume>
            <fpage>2945</fpage>
            <lpage>2948</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1103/PhysRevLett.61.2945</pubid>
                  <pubid idtype="pmpid" link="fulltext">10039271</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B47">
            <title>
               <p>Multifractality in time series</p>
            </title>
            <aug>
               <au>
                  <snm>Canessa</snm>
                  <fnm>E</fnm>
               </au>
            </aug>
            <source>J Phys A</source>
            <pubdate>2000</pubdate>
            <volume>33</volume>
            <fpage>3637</fpage>
            <lpage>3651</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1088/0305-4470/33/19/302</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B48">
            <title>
               <p>Scaling properties in temporal patterns of schizophrenia</p>
            </title>
            <aug>
               <au>
                  <snm>Dunki</snm>
                  <fnm>RM</fnm>
               </au>
               <au>
                  <snm>Ambuhl</snm>
                  <fnm>B</fnm>
               </au>
            </aug>
            <source>Physica A</source>
            <pubdate>1996</pubdate>
            <volume>230</volume>
            <fpage>544</fpage>
            <lpage>553</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1016/0378-4371(96)00097-0</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B49">
            <title>
               <p>Correlation property of length sequences based on global structure of the complete genome</p>
            </title>
            <aug>
               <au>
                  <snm>Yu</snm>
                  <fnm>ZG</fnm>
               </au>
               <au>
                  <snm>Anh</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>B</fnm>
               </au>
            </aug>
            <source>Phy Rev E</source>
            <pubdate>2001</pubdate>
            <volume>63</volume>
            <issue>1</issue>
            <fpage>011903</fpage>
            <note>Epub 2000 Dec 20.</note>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1103/PhysRevE.63.011903</pubid>
                  <pubid idtype="pmpid" link="fulltext">11304283</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B50">
            <aug>
               <au>
                  <snm>Bunde</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Havlin</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <cnm>eds</cnm>
               </au>
            </aug>
            <source>Fractals in Science</source>
            <publisher>Springer-verlag, Berlin</publisher>
            <pubdate>1979</pubdate>
         </bibl>
         <bibl id="B51">
            <title>
               <p>Z curves, an intutive tool for visualizing and analyzing the DNA sequences</p>
            </title>
            <aug>
               <au>
                  <snm>Zhang</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>CT</fnm>
               </au>
            </aug>
            <source>J Biomol Struct Dyn</source>
            <pubdate>1994</pubdate>
            <volume>11</volume>
            <issue>4</issue>
            <fpage>767</fpage>
            <lpage>782</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmpid">8204213</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B52">
            <title>
               <p>A novel approach to distinguish between intron-containing and intronless genes based on the format of Z curves</p>
            </title>
            <aug>
               <au>
                  <snm>Zhang</snm>
                  <fnm>CT</fnm>
               </au>
               <au>
                  <snm>Lin</snm>
                  <fnm>ZS</fnm>
               </au>
               <au>
                  <snm>Yan</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>J Theor Biol</source>
            <pubdate>1998</pubdate>
            <volume>192</volume>
            <fpage>467</fpage>
            <lpage>473</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1006/jtbi.1998.0671</pubid>
                  <pubid idtype="pmpid" link="fulltext">9680720</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B53">
            <title>
               <p>A new fourier transform approach for protein. coding measure based on the format of the Z curve</p>
            </title>
            <aug>
               <au>
                  <snm>Yan</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Lin</snm>
                  <fnm>ZS</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>CT</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>1998</pubdate>
            <volume>14</volume>
            <fpage>685</fpage>
            <lpage>690</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/14.8.685</pubid>
                  <pubid idtype="pmpid" link="fulltext">9789094</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B54">
            <title>
               <p>Analysis of distribution of bases in the coding sequences by a diagrammatic technique</p>
            </title>
            <aug>
               <au>
                  <snm>Zhang</snm>
                  <fnm>CT</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>1991</pubdate>
            <volume>19</volume>
            <fpage>6313</fpage>
            <lpage>6317</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">329145</pubid>
                  <pubid idtype="pmpid" link="fulltext">1956790</pubid>
                  <pubid idtype="doi">10.1093/nar/19.22.6313</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B55">
            <title>
               <p>A computational approach to identify genes for functional RNAs in genomic sequences</p>
            </title>
            <aug>
               <au>
                  <snm>Carter</snm>
                  <fnm>RJ</fnm>
               </au>
               <au>
                  <snm>Dubchak</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Holbrook</snm>
                  <fnm>SR</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2001</pubdate>
            <volume>29</volume>
            <fpage>3928</fpage>
            <lpage>3938</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">60242</pubid>
                  <pubid idtype="pmpid" link="fulltext">11574674</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B56">
            <title>
               <p>SVM-Prot: web-based support vector machine software for functional classification of a protein from its primary sequence</p>
            </title>
            <aug>
               <au>
                  <snm>Cai</snm>
                  <fnm>CZ</fnm>
               </au>
               <au>
                  <snm>Han</snm>
                  <fnm>LY</fnm>
               </au>
               <au>
                  <snm>Ji</snm>
                  <fnm>ZL</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>YZ</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2003</pubdate>
            <volume>31</volume>
            <fpage>3692</fpage>
            <lpage>3697</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">169006</pubid>
                  <pubid idtype="pmpid" link="fulltext">12824396</pubid>
                  <pubid idtype="doi">10.1093/nar/gkg600</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B57">
            <title>
               <p>Descriptor-based protein remote homology identification</p>
            </title>
            <aug>
               <au>
                  <snm>Zhang</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Kochhar</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Grigorov</snm>
                  <fnm>MG</fnm>
               </au>
            </aug>
            <source>Protein Sci</source>
            <pubdate>2005</pubdate>
            <volume>14</volume>
            <fpage>431</fpage>
            <lpage>444</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1110/ps.041035505</pubid>
                  <pubid idtype="pmpid" link="fulltext">15632283</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B58">
            <title>
               <p>PROFEAT: a web server for computing structural and physicochemical features of proteins and peptides from amino acid sequence</p>
            </title>
            <aug>
               <au>
                  <snm>Li</snm>
                  <fnm>ZR</fnm>
               </au>
               <au>
                  <snm>Lin</snm>
                  <fnm>HH</fnm>
               </au>
               <au>
                  <snm>Han</snm>
                  <fnm>LY</fnm>
               </au>
               <au>
                  <snm>Jiang</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>YZ</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2006</pubdate>
            <volume>34</volume>
            <fpage>W32</fpage>
            <lpage>W37</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1538821</pubid>
                  <pubid idtype="pmpid" link="fulltext">16845018</pubid>
                  <pubid idtype="doi">10.1093/nar/gkl305</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B59">
            <title>
               <p>MED: a new non-supervised gene prediction algorithm for bacterial and archaeal genomes</p>
            </title>
            <aug>
               <au>
                  <snm>Zhu</snm>
                  <fnm>HQ</fnm>
               </au>
               <au>
                  <snm>Hu</snm>
                  <fnm>GQ</fnm>
               </au>
               <au>
                  <snm>Yang</snm>
                  <fnm>YF</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>She</snm>
                  <fnm>ZS</fnm>
               </au>
            </aug>
            <source>BMC Bioinformatics</source>
            <pubdate>2007</pubdate>
            <volume>8</volume>
            <fpage>1</fpage>
            <lpage>11</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1766362</pubid>
                  <pubid idtype="pmpid" link="fulltext">17199892</pubid>
                  <pubid idtype="doi">10.1186/1471-2105-8-1</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B60">
            <title>
               <p>Recognition of Protein Coding Genes in the Yeast Genome at Better Than 95% Accuracy Based on the Z curve</p>
            </title>
            <aug>
               <au>
                  <snm>Zhang</snm>
                  <fnm>CT</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2000</pubdate>
            <volume>28</volume>
            <fpage>2804</fpage>
            <lpage>2814</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">102655</pubid>
                  <pubid idtype="pmpid" link="fulltext">10908339</pubid>
                  <pubid idtype="doi">10.1093/nar/28.14.2804</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B61">
            <title>
               <p>A nucleotide composition constraint of genome sequences</p>
            </title>
            <aug>
               <au>
                  <snm>Zhang</snm>
                  <fnm>CT</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Comput Biol Chem</source>
            <pubdate>2004</pubdate>
            <volume>28</volume>
            <fpage>149</fpage>
            <lpage>153</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.compbiolchem.2004.02.002</pubid>
                  <pubid idtype="pmpid" link="fulltext">15130543</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
      </refgrp>
   </bm>
</art>
