<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1471-2105-8-418</ui>
   <ji>1471-2105</ji>
   <fm>
      <dochead>Software</dochead>
      <bibl>
         <title>
            <p>HeliCis: a DNA motif discovery tool for colocalized motif pairs with periodic spacing</p>
         </title>
         <aug>
            <au id="A1" ca="yes">
               <snm>Larsson</snm>
               <fnm>Erik</fnm>
               <insr iid="I1"/>
               <email>erik.larsson@wlab.gu.se</email>
            </au>
            <au id="A2">
               <snm>Lindahl</snm>
               <fnm>Per</fnm>
               <insr iid="I1"/>
               <email>per.lindahl@wlab.gu.se</email>
            </au>
            <au id="A3">
               <snm>Mostad</snm>
               <fnm>Petter</fnm>
               <insr iid="I2"/>
               <email>mostad@chalmers.se</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Wallenberg Laboratory for Cardiovascular Research, Bruna Str&#229;ket 16, Sahlgrenska University Hospital, SE-413 45 G&#246;teborg, Sweden</p>
            </ins>
            <ins id="I2">
               <p>Mathematical Sciences, Chalmers University of Technology and Mathematical Sciences, G&#246;teborg University, SE-412 96 G&#246;teborg, Sweden</p>
            </ins>
         </insg>
         <source>BMC Bioinformatics</source>
         <issn>1471-2105</issn>
         <pubdate>2007</pubdate>
         <volume>8</volume>
         <issue>1</issue>
         <fpage>418</fpage>
         <url>http://www.biomedcentral.com/1471-2105/8/418</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">17963524</pubid>
               <pubid idtype="doi">10.1186/1471-2105-8-418</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>23</day>
               <month>5</month>
               <year>2007</year>
            </date>
         </rec>
         <acc>
            <date>
               <day>28</day>
               <month>10</month>
               <year>2007</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>28</day>
               <month>10</month>
               <year>2007</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2007</year>
         <collab>Larsson et al; licensee BioMed Central Ltd.</collab>
         <note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>Correct temporal and spatial gene expression during metazoan development relies on combinatorial interactions between different transcription factors. As a consequence, <it>cis</it>-regulatory elements often colocalize in clusters termed <it>cis-</it>regulatory modules. These may have requirements on organizational features such as spacing, order and <it>helical phasing </it>(periodic spacing) between binding sites. Due to the turning of the DNA helix, a small modification of the distance between a pair of sites may sometimes drastically disrupt function, while insertion of a full helical turn of DNA (10&#8211;11 bp) between <it>cis </it>elements may cause functionality to be restored. Recently, <it>de novo </it>motif discovery methods which incorporate organizational properties such as colocalization and order preferences have been developed, but there are no tools which incorporate periodic spacing into the model.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>We have developed a web based motif discovery tool, HeliCis, which features a flexible model which allows <it>de novo </it>detection of motifs with periodic spacing. Depending on the parameter settings it may also be used for discovering colocalized motifs without periodicity or motifs separated by a fixed gap of known or unknown length. We show on simulated data that it can efficiently capture the synergistic effects of colocalization and periodic spacing to improve detection of weak DNA motifs. It provides a simple to use web interface which interactively visualizes the current settings and thereby makes it easy to understand the parameters and the model structure.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusion</p>
               </st>
               <p>HeliCis provides simple and efficient <it>de novo </it>discovery of colocalized DNA motif pairs, with or without periodic spacing. Our evaluations show that it can detect weak periodic patterns which are not easily discovered using a sequential approach, i.e. first finding the binding sites and second analyzing the properties of their pairwise distances.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <meta>
      <classifications>
         <classification type="bmc" subtype="user_supplied_xml" id="endnote"/>
      </classifications>
   </meta>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>DNA sequence motifs recognized by transcription factors are usually short (~10 bp) with low information content, and matching sequence elements therefore occur randomly in large numbers in the genome. The precise specificity required for correct temporal and spatial transcription during metazoan development relies on combinatorial interactions between binding sites in relatively dense clusters <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>. These clusters, termed <it>cis</it>-regulatory modules (CRMs), typically contain sites (<it>cis</it>-regulatory elements) for several different transcriptional activators and repressors. CRMs may be unstructured, serving as "billboards" that bring DNA binding proteins into proximity <abbrgrp><abbr bid="B2">2</abbr></abbrgrp>. In this case, the balance of activators and repressors, rather than the order or spacing between factors, is the most important property. They may however also be highly structured, the extreme example being the "enhanceosome"-type CRM, with very little flexibility in the arrangement of recognition sites <abbrgrp><abbr bid="B3">3</abbr></abbrgrp>. Others are more flexible, but with requirements on organizational features such as spacing, order and <it>helical phasing </it>between binding sites.</p>
         <p>Numerous examples demonstrate the importance of the last feature, the <it>phase</it>. A small modification of the distance between a pair of sites may sometimes drastically disrupt function and this is usually attributed to the turning of the DNA helix. In many cases, insertion of a full helical turn of DNA (10&#8211;11 bp <abbrgrp><abbr bid="B4">4</abbr></abbrgrp>) between <it>cis </it>elements will cause functionality to be restored, as this will cause the same face of the binding protein to be exposed to cofactors and nearby DNA binding factors. The phenomenon has been observed in many studies of single genes, e.g. for AP-1 and RD binding sites in the collagenase-3 promoter <abbrgrp><abbr bid="B5">5</abbr></abbrgrp> as well as for the smooth muscle <it>&#945;</it>-actin promoter, where introduction of a 20 bp spacer caused significantly higher reporter activity than a 15 bp spacer <abbrgrp><abbr bid="B6">6</abbr></abbrgrp>. Other examples include the HPV18 enhancer <abbrgrp><abbr bid="B7">7</abbr></abbrgrp>, lung surfactant protein B <abbrgrp><abbr bid="B8">8</abbr></abbrgrp>, TNF-<it>&#945; </it><abbrgrp><abbr bid="B9">9</abbr></abbrgrp> and Igamma1 <abbrgrp><abbr bid="B10">10</abbr></abbrgrp>. In study of four coregulated <it>Drosophila </it>developmental enhancers, a conserved shared organization with pairwise periodic distances between neighboring sites was identified <abbrgrp><abbr bid="B11">11</abbr></abbrgrp>. Periodic signals in distances between neighboring motif pairs have also been observed on a genomic scale in <it>Drosophila </it><abbrgrp><abbr bid="B12">12</abbr></abbrgrp> and other eukaryotes <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>.</p>
         <p>Significant effort has been put into the problem of <it>de novo </it>motif discovery of transcription factor binding sites <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>. The task, often described as a local multiple alignment problem, is difficult due to the degenerate nature of transcription factor recognition sequences. Prediction may sometimes be improved by incorporating organizational features such as colocalization and order preferences into the model, and in recent years several such methods have been proposed <abbrgrp><abbr bid="B15">15</abbr><abbr bid="B16">16</abbr><abbr bid="B17">17</abbr><abbr bid="B18">18</abbr><abbr bid="B19">19</abbr></abbrgrp>. The idea of incorporating helical phasing into a motif discovery tool has been suggested <abbrgrp><abbr bid="B12">12</abbr></abbrgrp>, but to our knowledge no such tool has yet been devised. We propose a motif sampler which can efficiently discover ordered or unordered colocalized motif pairs <it>de novo </it>in DNA sequences. In addition, our tool incorporates an optional periodic spacing model, and we show on simulated data that it can detect weak periodic patterns that are not easily discovered using single motif or colocalization methods.</p>
      </sec>
      <sec>
         <st>
            <p>Implementation</p>
         </st>
         <sec>
            <st>
               <p>Algorithm overview</p>
            </st>
            <p>We propose a <it>de novo </it>method for motif discovery, HeliCis, which can find motif pairs separated by a distance (gap) that varies in a periodical manner. More specifically, the distance is modeled as some fixed offset <it>&#966; </it>(the phase) plus a variable integer multiple of the period <it>T </it>(Figure <figr fid="F1">1</figr>). Small deviations from exact periodic spacing may optionally be allowed. HeliCis detects patterns which are common to a group of sequences. A typical input would be regulatory DNA from a set of assumedly coregulated genes. The motif pair is assumed to be either present or absent in each sequence and may optionally be allowed to occur on either strand. The period <it>T </it>is specified by the user, but the program can be provided with a range of periods to evaluate. Upper and lower boundaries for the distance can be specified. The distance can be allowed to be negative, making it possible to find unordered motif pairs. Our method is not limited to finding periodically distributed binding sites. The flexibility of the algorithm makes the task of finding colocalized motifs (e.g. positioned within 100 bp of one another without periodicity) or motifs with fixed spacing (e.g. always exactly 25 bp from each other) into special cases simply achieved by choosing appropriate parameters. E.g. by setting the period to one, the model will find colocalized motifs without periodicity. Examples of parameter settings for different scenarios are available on the HeliCis home page<abbrgrp><abbr bid=" B20">20</abbr></abbrgrp>. The software also incorporates the possibility to take advantage of interspecies conservation by favoring motif placement in highly conserved regions.</p>
            <fig id="F1">
               <title>
                  <p>Figure 1</p>
               </title>
               <caption>
                  <p>Schematic drawing of the model structure</p>
               </caption>
               <text>
                  <p><b>Schematic drawing of the model structure</b>. The triangle and rectangle represent the first and second motif respectively. Gray boxes indicate valid locations for the second motif given the position of the first. The "phase" (distance offset) is assumed to be constant over all sequences and is determined by the algorithm.</p>
               </text>
               <graphic file="1471-2105-8-418-1"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>Mathematical model</p>
            </st>
            <p>Let <it>S </it>be the set of <it>N </it>sequences to be analyzed. Each sequence <it>s</it><sub><it>i </it></sub>&#8712; <it>S</it>, of length <it>n</it><sub><it>i</it></sub>, (<it>i </it>= 1...<it>N</it>) is assumed to contain zero or one motif pair. Below, we refer to motif-containing sequences as being <it>regulated </it>and denote this by <it>R</it><sub><it>i</it></sub><it> = true</it>. The position of the first and second motif in a particular sequence <it>s</it><sub><it>i </it></sub>is denoted <it>a</it><sub><it>i </it></sub>and <it>b</it><sub><it>i </it></sub>respectively. Motifs are modeled as two position frequency matrices <it>A </it>and <it>B</it>, where <it>A</it><sub><it>j</it></sub>[<it>l</it>] and <it>B</it><sub><it>j</it></sub>[<it>l</it>] denotes the probability of the nucleotide <it>l </it>appearing in position <it>j </it>of motif A and B respectively. Unregulated sequences are modeled as background sequence, described by an order 0 Markov process with nucleotide frequencies <it>&#952;</it><sub>0</sub>. Regulated sequences are modeled as a combination of motif and background sequence. The probability of a sequence <it>s</it><sub><it>i </it></sub>(where <it>s</it><sub><it>i</it>,<it>j </it></sub>denotes the <it>j</it>:th base in the sequence) can therefore be written</p>
            <p>
               <display-formula id="M1">
                  <m:math name="1471-2105-8-418-i1" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:mi>p</m:mi>
                           <m:mo stretchy="false">(</m:mo>
                           <m:msub>
                              <m:mi>s</m:mi>
                              <m:mi>i</m:mi>
                           </m:msub>
                           <m:mo>|</m:mo>
                           <m:msub>
                              <m:mi>R</m:mi>
                              <m:mi>i</m:mi>
                           </m:msub>
                           <m:mo>=</m:mo>
                           <m:mi>f</m:mi>
                           <m:mi>a</m:mi>
                           <m:mi>l</m:mi>
                           <m:mi>s</m:mi>
                           <m:mi>e</m:mi>
                           <m:mo>,</m:mo>
                           <m:msub>
                              <m:mi>&#952;</m:mi>
                              <m:mn>0</m:mn>
                           </m:msub>
                           <m:mo stretchy="false">)</m:mo>
                           <m:mo>=</m:mo>
                           <m:mstyle displaystyle="true">
                              <m:msub>
                                 <m:mo>&#8719;</m:mo>
                                 <m:mi>j</m:mi>
                              </m:msub>
                              <m:mrow>
                                 <m:msub>
                                    <m:mi>&#952;</m:mi>
                                    <m:mn>0</m:mn>
                                 </m:msub>
                                 <m:mrow>
                                    <m:mo>[</m:mo>
                                    <m:mrow>
                                       <m:msub>
                                          <m:mi>s</m:mi>
                                          <m:mrow>
                                             <m:mi>i</m:mi>
                                             <m:mo>,</m:mo>
                                             <m:mi>j</m:mi>
                                          </m:mrow>
                                       </m:msub>
                                    </m:mrow>
                                    <m:mo>]</m:mo>
                                 </m:mrow>
                              </m:mrow>
                           </m:mstyle>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacqWGWbaCcqGGOaakcqWGZbWCdaWgaaWcbaGaemyAaKgabeaakiabcYha8jabdkfasnaaBaaaleaacqWGPbqAaeqaaOGaeyypa0JaemOzayMaemyyaeMaemiBaWMaem4CamNaemyzauMaeiilaWccciGae8hUde3aaSbaaSqaaiabicdaWaqabaGccqGGPaqkcqGH9aqpdaqeqaqaaiab=H7aXnaaBaaaleaacqaIWaamaeqaaOWaamWaaeaacqWGZbWCdaWgaaWcbaGaemyAaKMaeiilaWIaemOAaOgabeaaaOGaay5waiaaw2faaaWcbaGaemOAaOgabeqdcqGHpis1aaaa@50C8@</m:annotation>
                     </m:semantics>
                  </m:math>
               </display-formula>
            </p>
            <p>and</p>
            <p>
               <display-formula id="M2">
                  <m:math name="1471-2105-8-418-i2" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:mi>p</m:mi>
                           <m:mo stretchy="false">(</m:mo>
                           <m:msub>
                              <m:mi>s</m:mi>
                              <m:mi>i</m:mi>
                           </m:msub>
                           <m:mo>|</m:mo>
                           <m:msub>
                              <m:mi>R</m:mi>
                              <m:mi>i</m:mi>
                           </m:msub>
                           <m:mo>=</m:mo>
                           <m:mi>t</m:mi>
                           <m:mi>r</m:mi>
                           <m:mi>u</m:mi>
                           <m:mi>e</m:mi>
                           <m:mo>,</m:mo>
                           <m:mi>A</m:mi>
                           <m:mo>,</m:mo>
                           <m:mi>B</m:mi>
                           <m:mo>,</m:mo>
                           <m:msub>
                              <m:mi>&#952;</m:mi>
                              <m:mn>0</m:mn>
                           </m:msub>
                           <m:mo>,</m:mo>
                           <m:msub>
                              <m:mi>a</m:mi>
                              <m:mi>i</m:mi>
                           </m:msub>
                           <m:mo>,</m:mo>
                           <m:msub>
                              <m:mi>b</m:mi>
                              <m:mi>i</m:mi>
                           </m:msub>
                           <m:mo stretchy="false">)</m:mo>
                           <m:mo>=</m:mo>
                           <m:msub>
                              <m:mi>Q</m:mi>
                              <m:mi>A</m:mi>
                           </m:msub>
                           <m:mrow>
                              <m:mo>[</m:mo>
                              <m:mrow>
                                 <m:msub>
                                    <m:mi>a</m:mi>
                                    <m:mi>i</m:mi>
                                 </m:msub>
                              </m:mrow>
                              <m:mo>]</m:mo>
                           </m:mrow>
                           <m:mo>&#8901;</m:mo>
                           <m:msub>
                              <m:mi>Q</m:mi>
                              <m:mi>B</m:mi>
                           </m:msub>
                           <m:mrow>
                              <m:mo>[</m:mo>
                              <m:mrow>
                                 <m:msub>
                                    <m:mi>b</m:mi>
                                    <m:mi>i</m:mi>
                                 </m:msub>
                              </m:mrow>
                              <m:mo>]</m:mo>
                           </m:mrow>
                           <m:mo>&#8901;</m:mo>
                           <m:mstyle displaystyle="true">
                              <m:msub>
                                 <m:mo>&#8719;</m:mo>
                                 <m:mi>j</m:mi>
                              </m:msub>
                              <m:mrow>
                                 <m:msub>
                                    <m:mi>&#952;</m:mi>
                                    <m:mn>0</m:mn>
                                 </m:msub>
                                 <m:mrow>
                                    <m:mo>[</m:mo>
                                    <m:mrow>
                                       <m:msub>
                                          <m:mi>s</m:mi>
                                          <m:mrow>
                                             <m:mi>i</m:mi>
                                             <m:mo>,</m:mo>
                                             <m:mi>j</m:mi>
                                          </m:mrow>
                                       </m:msub>
                                    </m:mrow>
                                    <m:mo>]</m:mo>
                                 </m:mrow>
                              </m:mrow>
                           </m:mstyle>
                           <m:mo>,</m:mo>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacqWGWbaCcqGGOaakcqWGZbWCdaWgaaWcbaGaemyAaKgabeaakiabcYha8jabdkfasnaaBaaaleaacqWGPbqAaeqaaOGaeyypa0JaemiDaqNaemOCaiNaemyDauNaemyzauMaeiilaWIaemyqaeKaeiilaWIaemOqaiKaeiilaWccciGae8hUde3aaSbaaSqaaiabicdaWaqabaGccqGGSaalcqWGHbqydaWgaaWcbaGaemyAaKgabeaakiabcYcaSiabdkgaInaaBaaaleaacqWGPbqAaeqaaOGaeiykaKIaeyypa0Jaemyuae1aaSbaaSqaaiabdgeabbqabaGcdaWadaqaaiabdggaHnaaBaaaleaacqWGPbqAaeqaaaGccaGLBbGaayzxaaGaeyyXICTaemyuae1aaSbaaSqaaiabdkeacbqabaGcdaWadaqaaiabdkgaInaaBaaaleaacqWGPbqAaeqaaaGccaGLBbGaayzxaaGaeyyXIC9aaebeaeaacqWF4oqCdaWgaaWcbaGaeGimaadabeaakmaadmaabaGaem4Cam3aaSbaaSqaaiabdMgaPjabcYcaSiabdQgaQbqabaaakiaawUfacaGLDbaaaSqaaiabdQgaQbqab0Gaey4dIunakiabcYcaSaaa@6EF1@</m:annotation>
                     </m:semantics>
                  </m:math>
               </display-formula>
            </p>
            <p>where</p>
            <p>
               <display-formula id="M3">
                  <m:math name="1471-2105-8-418-i3" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:msub>
                              <m:mi>Q</m:mi>
                              <m:mi>A</m:mi>
                           </m:msub>
                           <m:mrow>
                              <m:mo>[</m:mo>
                              <m:mi>i</m:mi>
                              <m:mo>]</m:mo>
                           </m:mrow>
                           <m:mo>=</m:mo>
                           <m:mstyle displaystyle="true">
                              <m:msubsup>
                                 <m:mo>&#8719;</m:mo>
                                 <m:mrow>
                                    <m:mi>k</m:mi>
                                    <m:mo>=</m:mo>
                                    <m:mn>1</m:mn>
                                 </m:mrow>
                                 <m:mrow>
                                    <m:msub>
                                       <m:mi>W</m:mi>
                                       <m:mi>A</m:mi>
                                    </m:msub>
                                 </m:mrow>
                              </m:msubsup>
                              <m:mrow>
                                 <m:mfrac>
                                    <m:mrow>
                                       <m:msub>
                                          <m:mi>A</m:mi>
                                          <m:mi>k</m:mi>
                                       </m:msub>
                                       <m:mrow>
                                          <m:mo>[</m:mo>
                                          <m:mrow>
                                             <m:msub>
                                                <m:mi>s</m:mi>
                                                <m:mrow>
                                                   <m:mn>1</m:mn>
                                                   <m:mo>,</m:mo>
                                                   <m:msub>
                                                      <m:mi>i</m:mi>
                                                      <m:mi>i</m:mi>
                                                   </m:msub>
                                                   <m:mo>+</m:mo>
                                                   <m:mi>k</m:mi>
                                                   <m:mo>&#8722;</m:mo>
                                                   <m:mn>1</m:mn>
                                                </m:mrow>
                                             </m:msub>
                                          </m:mrow>
                                          <m:mo>]</m:mo>
                                       </m:mrow>
                                    </m:mrow>
                                    <m:mrow>
                                       <m:msub>
                                          <m:mi>&#952;</m:mi>
                                          <m:mn>0</m:mn>
                                       </m:msub>
                                       <m:mrow>
                                          <m:mo>[</m:mo>
                                          <m:mrow>
                                             <m:msub>
                                                <m:mi>s</m:mi>
                                                <m:mrow>
                                                   <m:mn>1</m:mn>
                                                   <m:mo>,</m:mo>
                                                   <m:msub>
                                                      <m:mi>i</m:mi>
                                                      <m:mi>i</m:mi>
                                                   </m:msub>
                                                   <m:mo>+</m:mo>
                                                   <m:mi>k</m:mi>
                                                   <m:mo>&#8722;</m:mo>
                                                   <m:mn>1</m:mn>
                                                </m:mrow>
                                             </m:msub>
                                          </m:mrow>
                                          <m:mo>]</m:mo>
                                       </m:mrow>
                                    </m:mrow>
                                 </m:mfrac>
                              </m:mrow>
                           </m:mstyle>
                           <m:mo>,</m:mo>
                           <m:msub>
                              <m:mi>Q</m:mi>
                              <m:mi>B</m:mi>
                           </m:msub>
                           <m:mrow>
                              <m:mo>[</m:mo>
                              <m:mi>i</m:mi>
                              <m:mo>]</m:mo>
                           </m:mrow>
                           <m:mo>=</m:mo>
                           <m:mstyle displaystyle="true">
                              <m:msubsup>
                                 <m:mo>&#8719;</m:mo>
                                 <m:mrow>
                                    <m:mi>k</m:mi>
                                    <m:mo>=</m:mo>
                                    <m:mn>1</m:mn>
                                 </m:mrow>
                                 <m:mrow>
                                    <m:msub>
                                       <m:mi>W</m:mi>
                                       <m:mi>B</m:mi>
                                    </m:msub>
                                 </m:mrow>
                              </m:msubsup>
                              <m:mrow>
                                 <m:mfrac>
                                    <m:mrow>
                                       <m:msub>
                                          <m:mi>B</m:mi>
                                          <m:mi>k</m:mi>
                                       </m:msub>
                                       <m:mrow>
                                          <m:mo>[</m:mo>
                                          <m:mrow>
                                             <m:msub>
                                                <m:mi>s</m:mi>
                                                <m:mrow>
                                                   <m:mn>1</m:mn>
                                                   <m:mo>,</m:mo>
                                                   <m:msub>
                                                      <m:mi>i</m:mi>
                                                      <m:mi>i</m:mi>
                                                   </m:msub>
                                                   <m:mo>+</m:mo>
                                                   <m:mi>k</m:mi>
                                                   <m:mo>&#8722;</m:mo>
                                                   <m:mn>1</m:mn>
                                                </m:mrow>
                                             </m:msub>
                                          </m:mrow>
                                          <m:mo>]</m:mo>
                                       </m:mrow>
                                    </m:mrow>
                                    <m:mrow>
                                       <m:msub>
                                          <m:mi>&#952;</m:mi>
                                          <m:mn>0</m:mn>
                                       </m:msub>
                                       <m:mrow>
                                          <m:mo>[</m:mo>
                                          <m:mrow>
                                             <m:msub>
                                                <m:mi>s</m:mi>
                                                <m:mrow>
                                                   <m:mn>1</m:mn>
                                                   <m:mo>,</m:mo>
                                                   <m:msub>
                                                      <m:mi>i</m:mi>
                                                      <m:mi>i</m:mi>
                                                   </m:msub>
                                                   <m:mo>+</m:mo>
                                                   <m:mi>k</m:mi>
                                                   <m:mo>&#8722;</m:mo>
                                                   <m:mn>1</m:mn>
                                                </m:mrow>
                                             </m:msub>
                                          </m:mrow>
                                          <m:mo>]</m:mo>
                                       </m:mrow>
                                    </m:mrow>
                                 </m:mfrac>
                              </m:mrow>
                           </m:mstyle>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacqWGrbqudaWgaaWcbaGaemyqaeeabeaakmaadmaabaGaemyAaKgacaGLBbGaayzxaaGaeyypa0ZaaebmaeaadaWcaaqaaiabdgeabnaaBaaaleaacqWGRbWAaeqaaOWaamWaaeaacqWGZbWCdaWgaaWcbaGaeGymaeJaeiilaWIaemyAaK2aaSbaaWqaaiabdMgaPbqabaWccqGHRaWkcqWGRbWAcqGHsislcqaIXaqmaeqaaaGccaGLBbGaayzxaaaabaacciGae8hUde3aaSbaaSqaaiabicdaWaqabaGcdaWadaqaaiabdohaZnaaBaaaleaacqaIXaqmcqGGSaalcqWGPbqAdaWgaaadbaGaemyAaKgabeaaliabgUcaRiabdUgaRjabgkHiTiabigdaXaqabaaakiaawUfacaGLDbaaaaaaleaacqWGRbWAcqGH9aqpcqaIXaqmaeaacqWGxbWvdaWgaaadbaGaemyqaeeabeaaa0Gaey4dIunakiabcYcaSiabdgfarnaaBaaaleaacqWGcbGqaeqaaOWaamWaaeaacqWGPbqAaiaawUfacaGLDbaacqGH9aqpdaqeWaqaamaalaaabaGaemOqai0aaSbaaSqaaiabdUgaRbqabaGcdaWadaqaaiabdohaZnaaBaaaleaacqaIXaqmcqGGSaalcqWGPbqAdaWgaaadbaGaemyAaKgabeaaliabgUcaRiabdUgaRjabgkHiTiabigdaXaqabaaakiaawUfacaGLDbaaaeaacqWF4oqCdaWgaaWcbaGaeGimaadabeaakmaadmaabaGaem4Cam3aaSbaaSqaaiabigdaXiabcYcaSiabdMgaPnaaBaaameaacqWGPbqAaeqaaSGaey4kaSIaem4AaSMaeyOeI0IaeGymaedabeaaaOGaay5waiaaw2faaaaaaSqaaiabdUgaRjabg2da9iabigdaXaqaaiabdEfaxnaaBaaameaacqWGcbGqaeqaaaqdcqGHpis1aaaa@8766@</m:annotation>
                     </m:semantics>
                  </m:math>
               </display-formula>
            </p>
            <p>and where <it>W</it><sub><it>A </it></sub>and <it>W</it><sub><it>B </it></sub>are widths of the motifs. <it>a</it><sub><it>i </it></sub>and <it>b</it><sub><it>i </it></sub>cannot take on arbitrary values but will depend on each other, since we are looking for motif pairs where the distance between the two must follow certain criteria. We use a prior <it>p</it>(<it>a</it><sub><it>i</it></sub>,<it>b</it><sub><it>i</it></sub>) to reflect this, described below. We also assume there is a fixed prior probability <it>p(R</it><sub><it>i </it></sub>= <it>true) </it>for any sequence to be regulated. For <it>&#952;</it><sub>0</sub>, <ul><it>A</it><sub><it>j</it></sub></ul> and <ul><it>B</it><sub><it>j</it></sub></ul> we use Dirichlet priors, with pseudocounts <it>&#945;</it>[<it>l</it>] proportional to the frequencies of the bases in all the sequences. Our goal is to find values for <it>R </it>= (<it>R</it><sub>1</sub>, ..., <it>R</it><sub><it>N</it></sub>), <it>a </it>= (<it>a</it><sub>1</sub>, ..., <it>a</it><sub><it>N</it></sub>) and <it>b </it>= (<it>b</it><sub>1</sub>, ..., <it>b</it><sub><it>N</it></sub>) which maximize the posterior <it>p</it>(<it>R</it>,<it>a</it>,<it>b </it>| <it>S</it>). To accomplish this we use an algorithm based on the Gibbs sampling principle for motif discovery <abbrgrp><abbr bid="B21">21</abbr></abbrgrp>, which makes use of the predictive update version of the Gibbs sampler <abbrgrp><abbr bid="B22">22</abbr></abbrgrp>.</p>
            <p>Given a partitioning of the sequences into motifs and background (<it>a</it>, <it>b </it>and <it>R</it>) we can calculate the total observed counts of nucleotide <it>l </it>in the background (<it>c</it><sub>0</sub>[<it>l</it>]) and in the different positions <it>i </it>of motif A (<it>c</it><sub><it>A</it>,<it>i</it></sub>[<it>l</it>]) and motif B (<it>c</it><sub><it>B</it>,<it>i</it></sub>[<it>l</it>]). Sequences where <it>R</it><sub><it>i </it></sub>= 0 are assumed to contain only background sequence. We can then estimate <it>A</it>, <it>B</it>, and <it>&#952;</it><sub>0 </sub>as the expectation of <it>p</it>(<it>A</it>, <it>B</it>, <it>&#952;</it><sub>0 </sub>| <it>R</it>, <it>a</it>, <it>b</it>, <it>S</it>):</p>
            <p>
               <display-formula id="M4">
                  <m:math name="1471-2105-8-418-i4" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:msub>
                              <m:mi>A</m:mi>
                              <m:mi>i</m:mi>
                           </m:msub>
                           <m:mrow>
                              <m:mo>[</m:mo>
                              <m:mi>l</m:mi>
                              <m:mo>]</m:mo>
                           </m:mrow>
                           <m:mo>=</m:mo>
                           <m:mfrac>
                              <m:mrow>
                                 <m:msub>
                                    <m:mi>c</m:mi>
                                    <m:mrow>
                                       <m:mi>A</m:mi>
                                       <m:mo>,</m:mo>
                                       <m:mi>i</m:mi>
                                    </m:mrow>
                                 </m:msub>
                                 <m:mrow>
                                    <m:mo>[</m:mo>
                                    <m:mi>l</m:mi>
                                    <m:mo>]</m:mo>
                                 </m:mrow>
                                 <m:mo>+</m:mo>
                                 <m:mi>&#945;</m:mi>
                                 <m:mrow>
                                    <m:mo>[</m:mo>
                                    <m:mi>l</m:mi>
                                    <m:mo>]</m:mo>
                                 </m:mrow>
                              </m:mrow>
                              <m:mrow>
                                 <m:mstyle displaystyle="true">
                                    <m:msub>
                                       <m:mo>&#8721;</m:mo>
                                       <m:mi>l</m:mi>
                                    </m:msub>
                                    <m:mrow>
                                       <m:msub>
                                          <m:mi>c</m:mi>
                                          <m:mrow>
                                             <m:mi>A</m:mi>
                                             <m:mo>,</m:mo>
                                             <m:mi>i</m:mi>
                                          </m:mrow>
                                       </m:msub>
                                       <m:mrow>
                                          <m:mo>[</m:mo>
                                          <m:mi>l</m:mi>
                                          <m:mo>]</m:mo>
                                       </m:mrow>
                                    </m:mrow>
                                 </m:mstyle>
                                 <m:mo>+</m:mo>
                                 <m:mstyle displaystyle="true">
                                    <m:msub>
                                       <m:mo>&#8721;</m:mo>
                                       <m:mi>l</m:mi>
                                    </m:msub>
                                    <m:mrow>
                                       <m:mi>&#945;</m:mi>
                                       <m:mrow>
                                          <m:mo>[</m:mo>
                                          <m:mi>l</m:mi>
                                          <m:mo>]</m:mo>
                                       </m:mrow>
                                    </m:mrow>
                                 </m:mstyle>
                              </m:mrow>
                           </m:mfrac>
                           <m:mo>,</m:mo>
                           <m:msub>
                              <m:mi>B</m:mi>
                              <m:mi>i</m:mi>
                           </m:msub>
                           <m:mrow>
                              <m:mo>[</m:mo>
                              <m:mi>l</m:mi>
                              <m:mo>]</m:mo>
                           </m:mrow>
                           <m:mo>=</m:mo>
                           <m:mfrac>
                              <m:mrow>
                                 <m:msub>
                                    <m:mi>c</m:mi>
                                    <m:mrow>
                                       <m:mi>B</m:mi>
                                       <m:mo>,</m:mo>
                                       <m:mi>i</m:mi>
                                    </m:mrow>
                                 </m:msub>
                                 <m:mrow>
                                    <m:mo>[</m:mo>
                                    <m:mi>l</m:mi>
                                    <m:mo>]</m:mo>
                                 </m:mrow>
                                 <m:mo>+</m:mo>
                                 <m:mi>&#945;</m:mi>
                                 <m:mrow>
                                    <m:mo>[</m:mo>
                                    <m:mi>l</m:mi>
                                    <m:mo>]</m:mo>
                                 </m:mrow>
                              </m:mrow>
                              <m:mrow>
                                 <m:mstyle displaystyle="true">
                                    <m:msub>
                                       <m:mo>&#8721;</m:mo>
                                       <m:mi>l</m:mi>
                                    </m:msub>
                                    <m:mrow>
                                       <m:msub>
                                          <m:mi>c</m:mi>
                                          <m:mrow>
                                             <m:mi>B</m:mi>
                                             <m:mo>,</m:mo>
                                             <m:mi>i</m:mi>
                                          </m:mrow>
                                       </m:msub>
                                       <m:mrow>
                                          <m:mo>[</m:mo>
                                          <m:mi>l</m:mi>
                                          <m:mo>]</m:mo>
                                       </m:mrow>
                                    </m:mrow>
                                 </m:mstyle>
                                 <m:mo>+</m:mo>
                                 <m:mstyle displaystyle="true">
                                    <m:msub>
                                       <m:mo>&#8721;</m:mo>
                                       <m:mi>l</m:mi>
                                    </m:msub>
                                    <m:mrow>
                                       <m:mi>&#945;</m:mi>
                                       <m:mrow>
                                          <m:mo>[</m:mo>
                                          <m:mi>l</m:mi>
                                          <m:mo>]</m:mo>
                                       </m:mrow>
                                    </m:mrow>
                                 </m:mstyle>
                              </m:mrow>
                           </m:mfrac>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacqWGbbqqdaWgaaWcbaGaemyAaKgabeaakmaadmaabaGaemiBaWgacaGLBbGaayzxaaGaeyypa0ZaaSaaaeaacqWGJbWydaWgaaWcbaGaemyqaeKaeiilaWIaemyAaKgabeaakmaadmaabaGaemiBaWgacaGLBbGaayzxaaGaey4kaSccciGae8xSde2aamWaaeaacqWGSbaBaiaawUfacaGLDbaaaeaadaaeqaqaaiabdogaJnaaBaaaleaacqWGbbqqcqGGSaalcqWGPbqAaeqaaOWaamWaaeaacqWGSbaBaiaawUfacaGLDbaaaSqaaiabdYgaSbqab0GaeyyeIuoakiabgUcaRmaaqababaGae8xSde2aamWaaeaacqWGSbaBaiaawUfacaGLDbaaaSqaaiabdYgaSbqab0GaeyyeIuoaaaGccqGGSaalcqWGcbGqdaWgaaWcbaGaemyAaKgabeaakmaadmaabaGaemiBaWgacaGLBbGaayzxaaGaeyypa0ZaaSaaaeaacqWGJbWydaWgaaWcbaGaemOqaiKaeiilaWIaemyAaKgabeaakmaadmaabaGaemiBaWgacaGLBbGaayzxaaGaey4kaSIae8xSde2aamWaaeaacqWGSbaBaiaawUfacaGLDbaaaeaadaaeqaqaaiabdogaJnaaBaaaleaacqWGcbGqcqGGSaalcqWGPbqAaeqaaOWaamWaaeaacqWGSbaBaiaawUfacaGLDbaaaSqaaiabdYgaSbqab0GaeyyeIuoakiabgUcaRmaaqababaGae8xSde2aamWaaeaacqWGSbaBaiaawUfacaGLDbaaaSqaaiabdYgaSbqab0GaeyyeIuoaaaaaaa@808A@</m:annotation>
                     </m:semantics>
                  </m:math>
               </display-formula>
            </p>
            <p>
               <display-formula id="M5">
                  <m:math name="1471-2105-8-418-i5" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:msub>
                              <m:mi>&#952;</m:mi>
                              <m:mn>0</m:mn>
                           </m:msub>
                           <m:mrow>
                              <m:mo>[</m:mo>
                              <m:mi>l</m:mi>
                              <m:mo>]</m:mo>
                           </m:mrow>
                           <m:mo>=</m:mo>
                           <m:mfrac>
                              <m:mrow>
                                 <m:msub>
                                    <m:mi>c</m:mi>
                                    <m:mn>0</m:mn>
                                 </m:msub>
                                 <m:mrow>
                                    <m:mo>[</m:mo>
                                    <m:mi>l</m:mi>
                                    <m:mo>]</m:mo>
                                 </m:mrow>
                                 <m:mo>+</m:mo>
                                 <m:mi>&#945;</m:mi>
                                 <m:mrow>
                                    <m:mo>[</m:mo>
                                    <m:mi>l</m:mi>
                                    <m:mo>]</m:mo>
                                 </m:mrow>
                              </m:mrow>
                              <m:mrow>
                                 <m:mstyle displaystyle="true">
                                    <m:msub>
                                       <m:mo>&#8721;</m:mo>
                                       <m:mi>l</m:mi>
                                    </m:msub>
                                    <m:mrow>
                                       <m:msub>
                                          <m:mi>c</m:mi>
                                          <m:mn>0</m:mn>
                                       </m:msub>
                                       <m:mrow>
                                          <m:mo>[</m:mo>
                                          <m:mi>l</m:mi>
                                          <m:mo>]</m:mo>
                                       </m:mrow>
                                    </m:mrow>
                                 </m:mstyle>
                                 <m:mo>+</m:mo>
                                 <m:mstyle displaystyle="true">
                                    <m:msub>
                                       <m:mo>&#8721;</m:mo>
                                       <m:mi>l</m:mi>
                                    </m:msub>
                                    <m:mrow>
                                       <m:mi>&#945;</m:mi>
                                       <m:mrow>
                                          <m:mo>[</m:mo>
                                          <m:mi>l</m:mi>
                                          <m:mo>]</m:mo>
                                       </m:mrow>
                                    </m:mrow>
                                 </m:mstyle>
                              </m:mrow>
                           </m:mfrac>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaaiiGacqWF4oqCdaWgaaWcbaGaeGimaadabeaakmaadmaabaGaemiBaWgacaGLBbGaayzxaaGaeyypa0ZaaSaaaeaacqWGJbWydaWgaaWcbaGaeGimaadabeaakmaadmaabaGaemiBaWgacaGLBbGaayzxaaGaey4kaSIae8xSde2aamWaaeaacqWGSbaBaiaawUfacaGLDbaaaeaadaaeqaqaaiabdogaJnaaBaaaleaacqaIWaamaeqaaOWaamWaaeaacqWGSbaBaiaawUfacaGLDbaaaSqaaiabdYgaSbqab0GaeyyeIuoakiabgUcaRmaaqababaGae8xSde2aamWaaeaacqWGSbaBaiaawUfacaGLDbaaaSqaaiabdYgaSbqab0GaeyyeIuoaaaaaaa@51B2@</m:annotation>
                     </m:semantics>
                  </m:math>
               </display-formula>
            </p>
            <p>As in other Gibbs motif samplers, an iterative update/sampling procedure is applied. One of the sequences, <it>s</it><sub><it>i</it></sub>, is removed from the alignment by setting <it>R</it><sub><it>i </it></sub>= 0. Given values for <it>A</it>, <it>B</it>, and <it>&#952;</it><sub>0 </sub>according to the formulas above, new values for <it>R</it><sub><it>i</it></sub>, <it>a</it><sub><it>i </it></sub>and <it>b</it><sub><it>i </it></sub>are determined by sampling from <it>p</it>(<it>R</it><sub><it>i</it></sub>, <it>a</it><sub><it>i</it></sub>, <it>b</it><sub><it>i </it></sub>| <it>A</it>, <it>B</it>, <it>&#952;</it><sub>0</sub>, <it>S</it>) using the following steps: Bayes formula on odds form gives that</p>
            <p>
               <display-formula id="M6">
                  <m:math name="1471-2105-8-418-i6" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:mfrac>
                              <m:mrow>
                                 <m:mi>p</m:mi>
                                 <m:mrow>
                                    <m:mo>(</m:mo>
                                    <m:mrow>
                                       <m:msub>
                                          <m:mi>R</m:mi>
                                          <m:mi>i</m:mi>
                                       </m:msub>
                                       <m:mo>=</m:mo>
                                       <m:mi>f</m:mi>
                                       <m:mi>a</m:mi>
                                       <m:mi>l</m:mi>
                                       <m:mi>s</m:mi>
                                       <m:mi>e</m:mi>
                                       <m:mo>|</m:mo>
                                       <m:msub>
                                          <m:mi>s</m:mi>
                                          <m:mi>i</m:mi>
                                       </m:msub>
                                       <m:mo>,</m:mo>
                                       <m:msub>
                                          <m:mi>&#952;</m:mi>
                                          <m:mn>0</m:mn>
                                       </m:msub>
                                       <m:mo>,</m:mo>
                                       <m:mi>A</m:mi>
                                       <m:mo>,</m:mo>
                                       <m:mi>B</m:mi>
                                    </m:mrow>
                                    <m:mo>)</m:mo>
                                 </m:mrow>
                              </m:mrow>
                              <m:mrow>
                                 <m:mi>p</m:mi>
                                 <m:mrow>
                                    <m:mo>(</m:mo>
                                    <m:mrow>
                                       <m:msub>
                                          <m:mi>R</m:mi>
                                          <m:mi>i</m:mi>
                                       </m:msub>
                                       <m:mo>=</m:mo>
                                       <m:mi>t</m:mi>
                                       <m:mi>r</m:mi>
                                       <m:mi>u</m:mi>
                                       <m:mi>e</m:mi>
                                       <m:mo>|</m:mo>
                                       <m:msub>
                                          <m:mi>s</m:mi>
                                          <m:mi>i</m:mi>
                                       </m:msub>
                                       <m:mo>,</m:mo>
                                       <m:msub>
                                          <m:mi>&#952;</m:mi>
                                          <m:mn>0</m:mn>
                                       </m:msub>
                                       <m:mo>,</m:mo>
                                       <m:mi>A</m:mi>
                                       <m:mo>,</m:mo>
                                       <m:mi>B</m:mi>
                                    </m:mrow>
                                    <m:mo>)</m:mo>
                                 </m:mrow>
                              </m:mrow>
                           </m:mfrac>
                           <m:mo>=</m:mo>
                           <m:mfrac>
                              <m:mrow>
                                 <m:mi>p</m:mi>
                                 <m:mrow>
                                    <m:mo>(</m:mo>
                                    <m:mrow>
                                       <m:msub>
                                          <m:mi>s</m:mi>
                                          <m:mi>i</m:mi>
                                       </m:msub>
                                       <m:mo>|</m:mo>
                                       <m:msub>
                                          <m:mi>R</m:mi>
                                          <m:mi>i</m:mi>
                                       </m:msub>
                                       <m:mo>=</m:mo>
                                       <m:mi>f</m:mi>
                                       <m:mi>a</m:mi>
                                       <m:mi>l</m:mi>
                                       <m:mi>s</m:mi>
                                       <m:mi>e</m:mi>
                                       <m:mo>,</m:mo>
                                       <m:msub>
                                          <m:mi>&#952;</m:mi>
                                          <m:mn>0</m:mn>
                                       </m:msub>
                                    </m:mrow>
                                    <m:mo>)</m:mo>
                                 </m:mrow>
                              </m:mrow>
                              <m:mrow>
                                 <m:mi>p</m:mi>
                                 <m:mrow>
                                    <m:mo>(</m:mo>
                                    <m:mrow>
                                       <m:msub>
                                          <m:mi>s</m:mi>
                                          <m:mi>i</m:mi>
                                       </m:msub>
                                       <m:mo>|</m:mo>
                                       <m:msub>
                                          <m:mi>R</m:mi>
                                          <m:mi>i</m:mi>
                                       </m:msub>
                                       <m:mo>=</m:mo>
                                       <m:mi>t</m:mi>
                                       <m:mi>r</m:mi>
                                       <m:mi>u</m:mi>
                                       <m:mi>e</m:mi>
                                       <m:mo>,</m:mo>
                                       <m:msub>
                                          <m:mi>&#952;</m:mi>
                                          <m:mn>0</m:mn>
                                       </m:msub>
                                       <m:mo>,</m:mo>
                                       <m:mi>A</m:mi>
                                       <m:mo>,</m:mo>
                                       <m:mi>B</m:mi>
                                    </m:mrow>
                                    <m:mo>)</m:mo>
                                 </m:mrow>
                              </m:mrow>
                           </m:mfrac>
                           <m:mo>&#8901;</m:mo>
                           <m:mfrac>
                              <m:mrow>
                                 <m:mi>p</m:mi>
                                 <m:mrow>
                                    <m:mo>(</m:mo>
                                    <m:mrow>
                                       <m:msub>
                                          <m:mi>R</m:mi>
                                          <m:mi>i</m:mi>
                                       </m:msub>
                                       <m:mo>=</m:mo>
                                       <m:mi>f</m:mi>
                                       <m:mi>a</m:mi>
                                       <m:mi>l</m:mi>
                                       <m:mi>s</m:mi>
                                       <m:mi>e</m:mi>
                                    </m:mrow>
                                    <m:mo>)</m:mo>
                                 </m:mrow>
                              </m:mrow>
                              <m:mrow>
                                 <m:mi>p</m:mi>
                                 <m:mrow>
                                    <m:mo>(</m:mo>
                                    <m:mrow>
                                       <m:msub>
                                          <m:mi>R</m:mi>
                                          <m:mi>i</m:mi>
                                       </m:msub>
                                       <m:mo>=</m:mo>
                                       <m:mi>t</m:mi>
                                       <m:mi>r</m:mi>
                                       <m:mi>u</m:mi>
                                       <m:mi>e</m:mi>
                                    </m:mrow>
                                    <m:mo>)</m:mo>
                                 </m:mrow>
                              </m:mrow>
                           </m:mfrac>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaadaWcaaqaaiabdchaWnaabmaabaGaemOuai1aaSbaaSqaaiabdMgaPbqabaGccqGH9aqpcqWGMbGzcqWGHbqycqWGSbaBcqWGZbWCcqWGLbqzcqGG8baFcqWGZbWCdaWgaaWcbaGaemyAaKgabeaakiabcYcaSGGaciab=H7aXnaaBaaaleaacqaIWaamaeqaaOGaeiilaWIaemyqaeKaeiilaWIaemOqaieacaGLOaGaayzkaaaabaGaemiCaa3aaeWaaeaacqWGsbGudaWgaaWcbaGaemyAaKgabeaakiabg2da9iabdsha0jabdkhaYjabdwha1jabdwgaLjabcYha8jabdohaZnaaBaaaleaacqWGPbqAaeqaaOGaeiilaWIae8hUde3aaSbaaSqaaiabicdaWaqabaGccqGGSaalcqWGbbqqcqGGSaalcqWGcbGqaiaawIcacaGLPaaaaaGaeyypa0ZaaSaaaeaacqWGWbaCdaqadaqaaiabdohaZnaaBaaaleaacqWGPbqAaeqaaOGaeiiFaWNaemOuai1aaSbaaSqaaiabdMgaPbqabaGccqGH9aqpcqWGMbGzcqWGHbqycqWGSbaBcqWGZbWCcqWGLbqzcqGGSaalcqWF4oqCdaWgaaWcbaGaeGimaadabeaaaOGaayjkaiaawMcaaaqaaiabdchaWnaabmaabaGaem4Cam3aaSbaaSqaaiabdMgaPbqabaGccqGG8baFcqWGsbGudaWgaaWcbaGaemyAaKgabeaakiabg2da9iabdsha0jabdkhaYjabdwha1jabdwgaLjabcYcaSiab=H7aXnaaBaaaleaacqaIWaamaeqaaOGaeiilaWIaemyqaeKaeiilaWIaemOqaieacaGLOaGaayzkaaaaaiabgwSixpaalaaabaGaemiCaa3aaeWaaeaacqWGsbGudaWgaaWcbaGaemyAaKgabeaakiabg2da9iabdAgaMjabdggaHjabdYgaSjabdohaZjabdwgaLbGaayjkaiaawMcaaaqaaiabdchaWnaabmaabaGaemOuai1aaSbaaSqaaiabdMgaPbqabaGccqGH9aqpcqWG0baDcqWGYbGCcqWG1bqDcqWGLbqzaiaawIcacaGLPaaaaaaaaa@AA0D@</m:annotation>
                     </m:semantics>
                  </m:math>
               </display-formula>
            </p>
            <p>from which we get that</p>
            <p>
               <display-formula id="M7">
                  <m:math name="1471-2105-8-418-i7" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:mi>p</m:mi>
                           <m:mrow>
                              <m:mo>(</m:mo>
                              <m:mrow>
                                 <m:msub>
                                    <m:mi>R</m:mi>
                                    <m:mi>i</m:mi>
                                 </m:msub>
                                 <m:mo>=</m:mo>
                                 <m:mi>t</m:mi>
                                 <m:mi>r</m:mi>
                                 <m:mi>u</m:mi>
                                 <m:mi>e</m:mi>
                                 <m:mo>|</m:mo>
                                 <m:msub>
                                    <m:mi>s</m:mi>
                                    <m:mi>i</m:mi>
                                 </m:msub>
                                 <m:mo>,</m:mo>
                                 <m:msub>
                                    <m:mi>&#952;</m:mi>
                                    <m:mi>i</m:mi>
                                 </m:msub>
                                 <m:mo>,</m:mo>
                                 <m:mi>A</m:mi>
                                 <m:mo>,</m:mo>
                                 <m:mi>B</m:mi>
                              </m:mrow>
                              <m:mo>)</m:mo>
                           </m:mrow>
                           <m:mo>=</m:mo>
                           <m:msup>
                              <m:mrow>
                                 <m:mrow>
                                    <m:mo>[</m:mo>
                                    <m:mrow>
                                       <m:mn>1</m:mn>
                                       <m:mo>+</m:mo>
                                       <m:mfrac>
                                          <m:mrow>
                                             <m:mi>p</m:mi>
                                             <m:mrow>
                                                <m:mo>(</m:mo>
                                                <m:mrow>
                                                   <m:msub>
                                                      <m:mi>s</m:mi>
                                                      <m:mi>i</m:mi>
                                                   </m:msub>
                                                   <m:mo>|</m:mo>
                                                   <m:msub>
                                                      <m:mi>R</m:mi>
                                                      <m:mi>i</m:mi>
                                                   </m:msub>
                                                   <m:mo>=</m:mo>
                                                   <m:mi>f</m:mi>
                                                   <m:mi>a</m:mi>
                                                   <m:mi>l</m:mi>
                                                   <m:mi>s</m:mi>
                                                   <m:mi>e</m:mi>
                                                   <m:mo>,</m:mo>
                                                   <m:msub>
                                                      <m:mi>&#952;</m:mi>
                                                      <m:mn>0</m:mn>
                                                   </m:msub>
                                                </m:mrow>
                                                <m:mo>)</m:mo>
                                             </m:mrow>
                                          </m:mrow>
                                          <m:mrow>
                                             <m:mi>p</m:mi>
                                             <m:mrow>
                                                <m:mo>(</m:mo>
                                                <m:mrow>
                                                   <m:msub>
                                                      <m:mi>s</m:mi>
                                                      <m:mi>i</m:mi>
                                                   </m:msub>
                                                   <m:mo>|</m:mo>
                                                   <m:msub>
                                                      <m:mi>R</m:mi>
                                                      <m:mi>i</m:mi>
                                                   </m:msub>
                                                   <m:mo>=</m:mo>
                                                   <m:mi>t</m:mi>
                                                   <m:mi>r</m:mi>
                                                   <m:mi>u</m:mi>
                                                   <m:mi>e</m:mi>
                                                   <m:mo>,</m:mo>
                                                   <m:mi>A</m:mi>
                                                   <m:mo>,</m:mo>
                                                   <m:mi>B</m:mi>
                                                   <m:mo>,</m:mo>
                                                   <m:msub>
                                                      <m:mi>&#952;</m:mi>
                                                      <m:mn>0</m:mn>
                                                   </m:msub>
                                                </m:mrow>
                                                <m:mo>)</m:mo>
                                             </m:mrow>
                                          </m:mrow>
                                       </m:mfrac>
                                       <m:mo>&#8901;</m:mo>
                                       <m:mfrac>
                                          <m:mrow>
                                             <m:mn>1</m:mn>
                                             <m:mo>&#8722;</m:mo>
                                             <m:mi>p</m:mi>
                                             <m:mrow>
                                                <m:mo>(</m:mo>
                                                <m:mrow>
                                                   <m:msub>
                                                      <m:mi>R</m:mi>
                                                      <m:mi>i</m:mi>
                                                   </m:msub>
                                                   <m:mo>=</m:mo>
                                                   <m:mi>t</m:mi>
                                                   <m:mi>r</m:mi>
                                                   <m:mi>u</m:mi>
                                                   <m:mi>e</m:mi>
                                                </m:mrow>
                                                <m:mo>)</m:mo>
                                             </m:mrow>
                                          </m:mrow>
                                          <m:mrow>
                                             <m:mi>p</m:mi>
                                             <m:mrow>
                                                <m:mo>(</m:mo>
                                                <m:mrow>
                                                   <m:msub>
                                                      <m:mi>R</m:mi>
                                                      <m:mi>i</m:mi>
                                                   </m:msub>
                                                   <m:mo>=</m:mo>
                                                   <m:mi>t</m:mi>
                                                   <m:mi>r</m:mi>
                                                   <m:mi>u</m:mi>
                                                   <m:mi>e</m:mi>
                                                </m:mrow>
                                                <m:mo>)</m:mo>
                                             </m:mrow>
                                          </m:mrow>
                                       </m:mfrac>
                                    </m:mrow>
                                    <m:mo>]</m:mo>
                                 </m:mrow>
                              </m:mrow>
                              <m:mrow>
                                 <m:mo>&#8722;</m:mo>
                                 <m:mn>1</m:mn>
                              </m:mrow>
                           </m:msup>
                           <m:mo>,</m:mo>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacqWGWbaCdaqadaqaaiabdkfasnaaBaaaleaacqWGPbqAaeqaaOGaeyypa0JaemiDaqNaemOCaiNaemyDauNaemyzauMaeiiFaWNaem4Cam3aaSbaaSqaaiabdMgaPbqabaGccqGGSaaliiGacqWF4oqCdaWgaaWcbaGaemyAaKgabeaakiabcYcaSiabdgeabjabcYcaSiabdkeacbGaayjkaiaawMcaaiabg2da9maadmaabaGaeGymaeJaey4kaSYaaSaaaeaacqWGWbaCdaqadaqaaiabdohaZnaaBaaaleaacqWGPbqAaeqaaOGaeiiFaWNaemOuai1aaSbaaSqaaiabdMgaPbqabaGccqGH9aqpcqWGMbGzcqWGHbqycqWGSbaBcqWGZbWCcqWGLbqzcqGGSaalcqWF4oqCdaWgaaWcbaGaeGimaadabeaaaOGaayjkaiaawMcaaaqaaiabdchaWnaabmaabaGaem4Cam3aaSbaaSqaaiabdMgaPbqabaGccqGG8baFcqWGsbGudaWgaaWcbaGaemyAaKgabeaakiabg2da9iabdsha0jabdkhaYjabdwha1jabdwgaLjabcYcaSiabdgeabjabcYcaSiabdkeacjabcYcaSiab=H7aXnaaBaaaleaacqaIWaamaeqaaaGccaGLOaGaayzkaaaaaiabgwSixpaalaaabaGaeGymaeJaeyOeI0IaemiCaa3aaeWaaeaacqWGsbGudaWgaaWcbaGaemyAaKgabeaakiabg2da9iabdsha0jabdkhaYjabdwha1jabdwgaLbGaayjkaiaawMcaaaqaaiabdchaWnaabmaabaGaemOuai1aaSbaaSqaaiabdMgaPbqabaGccqGH9aqpcqWG0baDcqWGYbGCcqWG1bqDcqWGLbqzaiaawIcacaGLPaaaaaaacaGLBbGaayzxaaWaaWbaaSqabeaacqGHsislcqaIXaqmaaGccqGGSaalaaa@985A@</m:annotation>
                     </m:semantics>
                  </m:math>
               </display-formula>
            </p>
            <p>which is used to sample whether <it>R</it><sub><it>i </it></sub>= <it>true</it>. Note that, using (1) and (2), we have</p>
            <p>
               <display-formula id="M8">
                  <m:math name="1471-2105-8-418-i8" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:mfrac>
                              <m:mrow>
                                 <m:mi>p</m:mi>
                                 <m:mrow>
                                    <m:mo>(</m:mo>
                                    <m:mrow>
                                       <m:msub>
                                          <m:mi>s</m:mi>
                                          <m:mi>i</m:mi>
                                       </m:msub>
                                       <m:mo>|</m:mo>
                                       <m:msub>
                                          <m:mi>R</m:mi>
                                          <m:mi>i</m:mi>
                                       </m:msub>
                                       <m:mo>=</m:mo>
                                       <m:mi>t</m:mi>
                                       <m:mi>r</m:mi>
                                       <m:mi>u</m:mi>
                                       <m:mi>e</m:mi>
                                       <m:mo>,</m:mo>
                                       <m:mi>A</m:mi>
                                       <m:mo>,</m:mo>
                                       <m:mi>B</m:mi>
                                       <m:mo>,</m:mo>
                                       <m:msub>
                                          <m:mi>&#952;</m:mi>
                                          <m:mn>0</m:mn>
                                       </m:msub>
                                    </m:mrow>
                                    <m:mo>)</m:mo>
                                 </m:mrow>
                              </m:mrow>
                              <m:mrow>
                                 <m:mi>p</m:mi>
                                 <m:mrow>
                                    <m:mo>(</m:mo>
                                    <m:mrow>
                                       <m:msub>
                                          <m:mi>s</m:mi>
                                          <m:mi>i</m:mi>
                                       </m:msub>
                                       <m:mo>|</m:mo>
                                       <m:msub>
                                          <m:mi>R</m:mi>
                                          <m:mi>i</m:mi>
                                       </m:msub>
                                       <m:mo>=</m:mo>
                                       <m:mi>f</m:mi>
                                       <m:mi>a</m:mi>
                                       <m:mi>l</m:mi>
                                       <m:mi>s</m:mi>
                                       <m:mi>e</m:mi>
                                       <m:mo>,</m:mo>
                                       <m:msub>
                                          <m:mi>&#952;</m:mi>
                                          <m:mn>0</m:mn>
                                       </m:msub>
                                    </m:mrow>
                                    <m:mo>)</m:mo>
                                 </m:mrow>
                              </m:mrow>
                           </m:mfrac>
                           <m:mo>=</m:mo>
                           <m:mstyle displaystyle="true">
                              <m:munderover>
                                 <m:mo>&#8721;</m:mo>
                                 <m:mrow>
                                    <m:msub>
                                       <m:mi>a</m:mi>
                                       <m:mi>i</m:mi>
                                    </m:msub>
                                    <m:mo>,</m:mo>
                                    <m:msub>
                                       <m:mi>b</m:mi>
                                       <m:mi>i</m:mi>
                                    </m:msub>
                                    <m:mo>=</m:mo>
                                    <m:mn>1</m:mn>
                                 </m:mrow>
                                 <m:mrow>
                                    <m:msub>
                                       <m:mi>n</m:mi>
                                       <m:mi>i</m:mi>
                                    </m:msub>
                                 </m:mrow>
                              </m:munderover>
                              <m:mrow>
                                 <m:mi>p</m:mi>
                                 <m:mrow>
                                    <m:mo>(</m:mo>
                                    <m:mrow>
                                       <m:msub>
                                          <m:mi>a</m:mi>
                                          <m:mi>i</m:mi>
                                       </m:msub>
                                       <m:mo>,</m:mo>
                                       <m:msub>
                                          <m:mi>b</m:mi>
                                          <m:mi>i</m:mi>
                                       </m:msub>
                                    </m:mrow>
                                    <m:mo>)</m:mo>
                                 </m:mrow>
                              </m:mrow>
                           </m:mstyle>
                           <m:mo>&#8901;</m:mo>
                           <m:msub>
                              <m:mi>Q</m:mi>
                              <m:mi>A</m:mi>
                           </m:msub>
                           <m:mrow>
                              <m:mo>[</m:mo>
                              <m:mrow>
                                 <m:msub>
                                    <m:mi>a</m:mi>
                                    <m:mi>i</m:mi>
                                 </m:msub>
                              </m:mrow>
                              <m:mo>]</m:mo>
                           </m:mrow>
                           <m:mo>&#8901;</m:mo>
                           <m:msub>
                              <m:mi>Q</m:mi>
                              <m:mi>B</m:mi>
                           </m:msub>
                           <m:mrow>
                              <m:mo>[</m:mo>
                              <m:mrow>
                                 <m:msub>
                                    <m:mi>b</m:mi>
                                    <m:mi>i</m:mi>
                                 </m:msub>
                              </m:mrow>
                              <m:mo>]</m:mo>
                           </m:mrow>
                           <m:mo>.</m:mo>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaadaWcaaqaaiabdchaWnaabmaabaGaem4Cam3aaSbaaSqaaiabdMgaPbqabaGccqGG8baFcqWGsbGudaWgaaWcbaGaemyAaKgabeaakiabg2da9iabdsha0jabdkhaYjabdwha1jabdwgaLjabcYcaSiabdgeabjabcYcaSiabdkeacjabcYcaSGGaciab=H7aXnaaBaaaleaacqaIWaamaeqaaaGccaGLOaGaayzkaaaabaGaemiCaa3aaeWaaeaacqWGZbWCdaWgaaWcbaGaemyAaKgabeaakiabcYha8jabdkfasnaaBaaaleaacqWGPbqAaeqaaOGaeyypa0JaemOzayMaemyyaeMaemiBaWMaem4CamNaemyzauMaeiilaWIae8hUde3aaSbaaSqaaiabicdaWaqabaaakiaawIcacaGLPaaaaaGaeyypa0ZaaabCaeaacqWGWbaCdaqadaqaaiabdggaHnaaBaaaleaacqWGPbqAaeqaaOGaeiilaWIaemOyai2aaSbaaSqaaiabdMgaPbqabaaakiaawIcacaGLPaaaaSqaaiabdggaHnaaBaaameaacqWGPbqAaeqaaSGaeiilaWIaemOyai2aaSbaaWqaaiabdMgaPbqabaWccqGH9aqpcqaIXaqmaeaacqWGUbGBdaWgaaadbaGaemyAaKgabeaaa0GaeyyeIuoakiabgwSixlabdgfarnaaBaaaleaacqWGbbqqaeqaaOWaamWaaeaacqWGHbqydaWgaaWcbaGaemyAaKgabeaaaOGaay5waiaaw2faaiabgwSixlabdgfarnaaBaaaleaacqWGcbGqaeqaaOWaamWaaeaacqWGIbGydaWgaaWcbaGaemyAaKgabeaaaOGaay5waiaaw2faaiabc6caUaaa@872C@</m:annotation>
                     </m:semantics>
                  </m:math>
               </display-formula>
            </p>
            <p>We define the prior <it>p</it>(<it>a</it><sub><it>i</it></sub>,<it>b</it><sub><it>i</it></sub>) to be proportional to an indicator function <it>e</it>(<it>a</it><sub><it>i</it></sub>,<it>b</it><sub><it>i</it></sub>) which is zero unless <it>a</it><sub><it>i </it></sub>and <it>b</it><sub><it>i </it></sub>represent a pair of motif positions compatible with the assumptions that the motifs are both within the sequence, do not overlap, and have a distance conforming to the assumed periodicity and the assumed possible variation around this periodicity. As described above, the allowed distance is modeled as a fixed phase <it>&#966; </it>plus a variable integer multiple of the period <it>T </it>(Figure <figr fid="F1">1</figr>). Specifically, given <it>W</it><sub><it>A</it></sub>, <it>W</it><sub><it>B</it></sub>, the period <it>T</it>, the phase <it>&#966;</it>, the allowed deviation from exact periodic distance ("noise"), the length of sequence <it>i </it>and the minimum and maximum distances, we can for all <it>i </it>= 1..<it>n</it><sub><it>i </it></sub>find all <it>j </it>such that <it>e</it>(<it>i</it>,<it>j</it>) = 1, and the value of (8) can be calculated as</p>
            <p>
               <display-formula id="M9">
                  <m:math name="1471-2105-8-418-i9" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:mfrac bevelled="true">
                              <m:mrow>
                                 <m:mstyle displaystyle="true">
                                    <m:munderover>
                                       <m:mo>&#8721;</m:mo>
                                       <m:mrow>
                                          <m:mi>i</m:mi>
                                          <m:mo>=</m:mo>
                                          <m:mn>1</m:mn>
                                       </m:mrow>
                                       <m:mrow>
                                          <m:msub>
                                             <m:mi>n</m:mi>
                                             <m:mi>i</m:mi>
                                          </m:msub>
                                       </m:mrow>
                                    </m:munderover>
                                    <m:mrow>
                                       <m:mrow>
                                          <m:mo>[</m:mo>
                                          <m:mrow>
                                             <m:msub>
                                                <m:mi>Q</m:mi>
                                                <m:mi>A</m:mi>
                                             </m:msub>
                                             <m:mrow>
                                                <m:mo>[</m:mo>
                                                <m:mrow>
                                                   <m:msub>
                                                      <m:mi>a</m:mi>
                                                      <m:mi>i</m:mi>
                                                   </m:msub>
                                                </m:mrow>
                                                <m:mo>]</m:mo>
                                             </m:mrow>
                                             <m:mo>&#8901;</m:mo>
                                             <m:mstyle displaystyle="true">
                                                <m:munder>
                                                   <m:mo>&#8721;</m:mo>
                                                   <m:mrow>
                                                      <m:mi>j</m:mi>
                                                      <m:mo>:</m:mo>
                                                      <m:mi>e</m:mi>
                                                      <m:mo stretchy="false">(</m:mo>
                                                      <m:msub>
                                                         <m:mi>a</m:mi>
                                                         <m:mi>i</m:mi>
                                                      </m:msub>
                                                      <m:mo>,</m:mo>
                                                      <m:mi>j</m:mi>
                                                      <m:mo stretchy="false">)</m:mo>
                                                      <m:mo>=</m:mo>
                                                      <m:mn>1</m:mn>
                                                   </m:mrow>
                                                </m:munder>
                                                <m:mrow>
                                                   <m:msub>
                                                      <m:mi>Q</m:mi>
                                                      <m:mi>B</m:mi>
                                                   </m:msub>
                                                   <m:mrow>
                                                      <m:mo>[</m:mo>
                                                      <m:mi>j</m:mi>
                                                      <m:mo>]</m:mo>
                                                   </m:mrow>
                                                </m:mrow>
                                             </m:mstyle>
                                          </m:mrow>
                                          <m:mo>]</m:mo>
                                       </m:mrow>
                                    </m:mrow>
                                 </m:mstyle>
                              </m:mrow>
                              <m:mrow>
                                 <m:mstyle displaystyle="true">
                                    <m:munderover>
                                       <m:mo>&#8721;</m:mo>
                                       <m:mrow>
                                          <m:mi>i</m:mi>
                                          <m:mo>=</m:mo>
                                          <m:mn>1</m:mn>
                                       </m:mrow>
                                       <m:mrow>
                                          <m:msub>
                                             <m:mi>n</m:mi>
                                             <m:mi>i</m:mi>
                                          </m:msub>
                                       </m:mrow>
                                    </m:munderover>
                                    <m:mrow>
                                       <m:mstyle displaystyle="true">
                                          <m:munderover>
                                             <m:mo>&#8721;</m:mo>
                                             <m:mrow>
                                                <m:mi>j</m:mi>
                                                <m:mo>=</m:mo>
                                                <m:mn>1</m:mn>
                                             </m:mrow>
                                             <m:mrow>
                                                <m:msub>
                                                   <m:mi>n</m:mi>
                                                   <m:mi>i</m:mi>
                                                </m:msub>
                                             </m:mrow>
                                          </m:munderover>
                                          <m:mrow>
                                             <m:mi>e</m:mi>
                                             <m:mo stretchy="false">(</m:mo>
                                             <m:mi>i</m:mi>
                                             <m:mo>,</m:mo>
                                             <m:mi>j</m:mi>
                                             <m:mo stretchy="false">)</m:mo>
                                          </m:mrow>
                                       </m:mstyle>
                                    </m:mrow>
                                 </m:mstyle>
                              </m:mrow>
                           </m:mfrac>
                           <m:mo>.</m:mo>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaadaWccaqaamaaqahabaWaamWaaeaacqWGrbqudaWgaaWcbaGaemyqaeeabeaakmaadmaabaGaemyyae2aaSbaaSqaaiabdMgaPbqabaaakiaawUfacaGLDbaacqGHflY1daaeqbqaaiabdgfarnaaBaaaleaacqWGcbGqaeqaaOWaamWaaeaacqWGQbGAaiaawUfacaGLDbaaaSqaaiabdQgaQjabcQda6iabdwgaLjabcIcaOiabdggaHnaaBaaameaacqWGPbqAaeqaaSGaeiilaWIaemOAaOMaeiykaKIaeyypa0JaeGymaedabeqdcqGHris5aaGccaGLBbGaayzxaaaaleaacqWGPbqAcqGH9aqpcqaIXaqmaeaacqWGUbGBdaWgaaadbaGaemyAaKgabeaaa0GaeyyeIuoaaOqaamaaqahabaWaaabCaeaacqWGLbqzcqGGOaakcqWGPbqAcqGGSaalcqWGQbGAcqGGPaqkaSqaaiabdQgaQjabg2da9iabigdaXaqaaiabd6gaUnaaBaaameaacqWGPbqAaeqaaaqdcqGHris5aaWcbaGaemyAaKMaeyypa0JaeGymaedabaGaemOBa42aaSbaaWqaaiabdMgaPbqabaaaniabggHiLdaaaOGaeiOla4caaa@6DA4@</m:annotation>
                     </m:semantics>
                  </m:math>
               </display-formula>
            </p>
            <p>Secondly, we get that <it>p</it>(<it>a</it><sub><it>i </it></sub>| <it>R</it><sub><it>i </it></sub>= <it>true</it>, <it>A</it>, <it>B</it>, <it>&#952;</it><sub>0</sub>, S) is proportional to</p>
            <p>
               <display-formula id="M10">
                  <m:math name="1471-2105-8-418-i10" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:msub>
                              <m:mi>Q</m:mi>
                              <m:mi>A</m:mi>
                           </m:msub>
                           <m:mrow>
                              <m:mo>[</m:mo>
                              <m:mrow>
                                 <m:msub>
                                    <m:mi>a</m:mi>
                                    <m:mi>i</m:mi>
                                 </m:msub>
                              </m:mrow>
                              <m:mo>]</m:mo>
                           </m:mrow>
                           <m:mo>&#8901;</m:mo>
                           <m:mstyle displaystyle="true">
                              <m:munder>
                                 <m:mo>&#8721;</m:mo>
                                 <m:mrow>
                                    <m:mi>k</m:mi>
                                    <m:mo>&#8712;</m:mo>
                                    <m:mi>e</m:mi>
                                    <m:mo stretchy="false">(</m:mo>
                                    <m:mi>i</m:mi>
                                    <m:mo>,</m:mo>
                                    <m:msub>
                                       <m:mi>a</m:mi>
                                       <m:mi>i</m:mi>
                                    </m:msub>
                                    <m:mo stretchy="false">)</m:mo>
                                 </m:mrow>
                              </m:munder>
                              <m:mrow>
                                 <m:msub>
                                    <m:mi>Q</m:mi>
                                    <m:mi>B</m:mi>
                                 </m:msub>
                                 <m:mrow>
                                    <m:mo>[</m:mo>
                                    <m:mi>k</m:mi>
                                    <m:mo>]</m:mo>
                                 </m:mrow>
                              </m:mrow>
                           </m:mstyle>
                           <m:mo>,</m:mo>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacqWGrbqudaWgaaWcbaGaemyqaeeabeaakmaadmaabaGaemyyae2aaSbaaSqaaiabdMgaPbqabaaakiaawUfacaGLDbaacqGHflY1daaeqbqaaiabdgfarnaaBaaaleaacqWGcbGqaeqaaOWaamWaaeaacqWGRbWAaiaawUfacaGLDbaaaSqaaiabdUgaRjabgIGiolabdwgaLjabcIcaOiabdMgaPjabcYcaSiabdggaHnaaBaaameaacqWGPbqAaeqaaSGaeiykaKcabeqdcqGHris5aOGaeiilaWcaaa@49FD@</m:annotation>
                     </m:semantics>
                  </m:math>
               </display-formula>
            </p>
            <p>so if <it>R</it><sub><it>i </it></sub>= <it>true</it>, a value for <it>a</it><sub><it>i </it></sub>can be sampled by using probabilities proportional to the numbers (10). Finally, <it>b</it><sub><it>i </it></sub>can be sampled by noting that given <it>R</it><sub><it>i </it></sub>= <it>true </it>and a value for <it>a</it><sub><it>i</it></sub>, the probabilities for valid values of <it>b</it><sub><it>i </it></sub>according to <it>e</it>(<it>i</it>, <it>a</it><sub><it>i</it></sub>) are proportional to <it>Q</it><sub><it>B</it></sub>[<it>b</it><sub><it>i</it></sub>].</p>
            <p>The algorithm is initiated by setting all <it>R</it><sub><it>i </it></sub>= <it>false</it>. The update/sampling procedure described above is then performed for each sequence <it>s</it><sub><it>i</it></sub>, <it>i </it>= 1...<it>N</it>. When all <it>R</it><sub><it>i</it></sub>, <it>a</it><sub><it>i </it></sub>and <it>b</it><sub><it>i </it></sub>have been updated, the alignment is scored according to</p>
            <p>
               <display-formula id="M11">
                  <m:math name="1471-2105-8-418-i11" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mtable columnalign="left">
                           <m:mtr>
                              <m:mtd>
                                 <m:mi>F</m:mi>
                                 <m:mo>=</m:mo>
                                 <m:mi>log</m:mi>
                                 <m:mo>&#8289;</m:mo>
                                 <m:mfrac>
                                    <m:mrow>
                                       <m:mi>p</m:mi>
                                       <m:mrow>
                                          <m:mo>(</m:mo>
                                          <m:mrow>
                                             <m:mi>S</m:mi>
                                             <m:mo>|</m:mo>
                                             <m:mi>A</m:mi>
                                             <m:mo>,</m:mo>
                                             <m:mi>B</m:mi>
                                             <m:mo>,</m:mo>
                                             <m:msub>
                                                <m:mi>&#952;</m:mi>
                                                <m:mn>0</m:mn>
                                             </m:msub>
                                             <m:mo>,</m:mo>
                                             <m:mi>a</m:mi>
                                             <m:mo>,</m:mo>
                                             <m:mi>b</m:mi>
                                             <m:mo>,</m:mo>
                                             <m:mi>R</m:mi>
                                          </m:mrow>
                                          <m:mo>)</m:mo>
                                       </m:mrow>
                                    </m:mrow>
                                    <m:mrow>
                                       <m:mi>p</m:mi>
                                       <m:mrow>
                                          <m:mo>(</m:mo>
                                          <m:mrow>
                                             <m:mi>S</m:mi>
                                             <m:mo>|</m:mo>
                                             <m:msub>
                                                <m:mi>&#952;</m:mi>
                                                <m:mn>0</m:mn>
                                             </m:msub>
                                             <m:mo>,</m:mo>
                                             <m:mi>R</m:mi>
                                             <m:mo>=</m:mo>
                                             <m:mo stretchy="false">(</m:mo>
                                             <m:mi>f</m:mi>
                                             <m:mi>a</m:mi>
                                             <m:mi>l</m:mi>
                                             <m:mi>s</m:mi>
                                             <m:mi>e</m:mi>
                                             <m:mo>,</m:mo>
                                             <m:mn>...</m:mn>
                                             <m:mo>,</m:mo>
                                             <m:mi>f</m:mi>
                                             <m:mi>a</m:mi>
                                             <m:mi>l</m:mi>
                                             <m:mi>s</m:mi>
                                             <m:mi>e</m:mi>
                                             <m:mo stretchy="false">)</m:mo>
                                          </m:mrow>
                                          <m:mo>)</m:mo>
                                       </m:mrow>
                                    </m:mrow>
                                 </m:mfrac>
                              </m:mtd>
                           </m:mtr>
                           <m:mtr>
                              <m:mtd>
                                 <m:mo>=</m:mo>
                                 <m:mstyle displaystyle="true">
                                    <m:msubsup>
                                       <m:mo>&#8721;</m:mo>
                                       <m:mrow>
                                          <m:mi>k</m:mi>
                                          <m:mo>=</m:mo>
                                          <m:mn>1</m:mn>
                                       </m:mrow>
                                       <m:mrow>
                                          <m:msub>
                                             <m:mi>W</m:mi>
                                             <m:mi>A</m:mi>
                                          </m:msub>
                                       </m:mrow>
                                    </m:msubsup>
                                    <m:mrow>
                                       <m:mstyle displaystyle="true">
                                          <m:msub>
                                             <m:mo>&#8721;</m:mo>
                                             <m:mi>l</m:mi>
                                          </m:msub>
                                          <m:mrow>
                                             <m:msub>
                                                <m:mi>c</m:mi>
                                                <m:mrow>
                                                   <m:mi>A</m:mi>
                                                   <m:mo>,</m:mo>
                                                   <m:mi>k</m:mi>
                                                </m:mrow>
                                             </m:msub>
                                             <m:mrow>
                                                <m:mo>[</m:mo>
                                                <m:mi>l</m:mi>
                                                <m:mo>]</m:mo>
                                             </m:mrow>
                                          </m:mrow>
                                       </m:mstyle>
                                       <m:mi>log</m:mi>
                                       <m:mo>&#8289;</m:mo>
                                       <m:mo stretchy="false">(</m:mo>
                                       <m:mfrac>
                                          <m:mrow>
                                             <m:msub>
                                                <m:mi>A</m:mi>
                                                <m:mi>k</m:mi>
                                             </m:msub>
                                             <m:mrow>
                                                <m:mo>[</m:mo>
                                                <m:mi>l</m:mi>
                                                <m:mo>]</m:mo>
                                             </m:mrow>
                                          </m:mrow>
                                          <m:mrow>
                                             <m:msub>
                                                <m:mi>&#952;</m:mi>
                                                <m:mn>0</m:mn>
                                             </m:msub>
                                             <m:mo stretchy="false">(</m:mo>
                                             <m:mi>l</m:mi>
                                             <m:mo stretchy="false">)</m:mo>
                                          </m:mrow>
                                       </m:mfrac>
                                       <m:mo stretchy="false">)</m:mo>
                                    </m:mrow>
                                 </m:mstyle>
                                 <m:mo>+</m:mo>
                                 <m:mstyle displaystyle="true">
                                    <m:msubsup>
                                       <m:mo>&#8721;</m:mo>
                                       <m:mrow>
                                          <m:mi>k</m:mi>
                                          <m:mo>=</m:mo>
                                          <m:mn>1</m:mn>
                                       </m:mrow>
                                       <m:mrow>
                                          <m:msub>
                                             <m:mi>W</m:mi>
                                             <m:mi>B</m:mi>
                                          </m:msub>
                                       </m:mrow>
                                    </m:msubsup>
                                    <m:mrow>
                                       <m:mstyle displaystyle="true">
                                          <m:msub>
                                             <m:mo>&#8721;</m:mo>
                                             <m:mi>l</m:mi>
                                          </m:msub>
                                          <m:mrow>
                                             <m:msub>
                                                <m:mi>c</m:mi>
                                                <m:mrow>
                                                   <m:mi>B</m:mi>
                                                   <m:mo>,</m:mo>
                                                   <m:mi>k</m:mi>
                                                </m:mrow>
                                             </m:msub>
                                             <m:mrow>
                                                <m:mo>[</m:mo>
                                                <m:mi>l</m:mi>
                                                <m:mo>]</m:mo>
                                             </m:mrow>
                                          </m:mrow>
                                       </m:mstyle>
                                       <m:mi>log</m:mi>
                                       <m:mo>&#8289;</m:mo>
                                       <m:mo stretchy="false">(</m:mo>
                                       <m:mfrac>
                                          <m:mrow>
                                             <m:msub>
                                                <m:mi>B</m:mi>
                                                <m:mi>k</m:mi>
                                             </m:msub>
                                             <m:mrow>
                                                <m:mo>[</m:mo>
                                                <m:mi>l</m:mi>
                                                <m:mo>]</m:mo>
                                             </m:mrow>
                                          </m:mrow>
                                          <m:mrow>
                                             <m:msub>
                                                <m:mi>&#952;</m:mi>
                                                <m:mn>0</m:mn>
                                             </m:msub>
                                             <m:mo stretchy="false">(</m:mo>
                                             <m:mi>l</m:mi>
                                             <m:mo stretchy="false">)</m:mo>
                                          </m:mrow>
                                       </m:mfrac>
                                       <m:mo stretchy="false">)</m:mo>
                                    </m:mrow>
                                 </m:mstyle>
                              </m:mtd>
                           </m:mtr>
                        </m:mtable>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakqaabeqaaiabdAeagjabg2da9iGbcYgaSjabc+gaVjabcEgaNnaalaaabaGaemiCaa3aaeWaaeaacqWGtbWucqGG8baFcqWGbbqqcqGGSaalcqWGcbGqcqGGSaaliiGacqWF4oqCdaWgaaWcbaGaeGimaadabeaakiabcYcaSiabdggaHjabcYcaSiabdkgaIjabcYcaSiabdkfasbGaayjkaiaawMcaaaqaaiabdchaWnaabmaabaGaem4uamLaeiiFaWNae8hUde3aaSbaaSqaaiabicdaWaqabaGccqGGSaalcqWGsbGucqGH9aqpcqGGOaakcqWGMbGzcqWGHbqycqWGSbaBcqWGZbWCcqWGLbqzcqGGSaalcqGGUaGlcqGGUaGlcqGGUaGlcqGGSaalcqWGMbGzcqWGHbqycqWGSbaBcqWGZbWCcqWGLbqzcqGGPaqkaiaawIcacaGLPaaaaaaabaGaeyypa0ZaaabmaeaadaaeqaqaaiabdogaJnaaBaaaleaacqWGbbqqcqGGSaalcqWGRbWAaeqaaOWaamWaaeaacqWGSbaBaiaawUfacaGLDbaaaSqaaiabdYgaSbqab0GaeyyeIuoakiGbcYgaSjabc+gaVjabcEgaNjabcIcaOmaalaaabaGaemyqae0aaSbaaSqaaiabdUgaRbqabaGcdaWadaqaaiabdYgaSbGaay5waiaaw2faaaqaaiab=H7aXnaaBaaaleaacqaIWaamaeqaaOGaeiikaGIaemiBaWMaeiykaKcaaiabcMcaPaWcbaGaem4AaSMaeyypa0JaeGymaedabaGaem4vaC1aaSbaaWqaaiabdgeabbqabaaaniabggHiLdGccqGHRaWkdaaeWaqaamaaqababaGaem4yam2aaSbaaSqaaiabdkeacjabcYcaSiabdUgaRbqabaGcdaWadaqaaiabdYgaSbGaay5waiaaw2faaaWcbaGaemiBaWgabeqdcqGHris5aOGagiiBaWMaei4Ba8Maei4zaCMaeiikaGYaaSaaaeaacqWGcbGqdaWgaaWcbaGaem4AaSgabeaakmaadmaabaGaemiBaWgacaGLBbGaayzxaaaabaGae8hUde3aaSbaaSqaaiabicdaWaqabaGccqGGOaakcqWGSbaBcqGGPaqkaaGaeiykaKcaleaacqWGRbWAcqGH9aqpcqaIXaqmaeaacqWGxbWvdaWgaaadbaGaemOqaieabeaaa0GaeyyeIuoaaaaa@B0C2@</m:annotation>
                     </m:semantics>
                  </m:math>
               </display-formula>
            </p>
            <p>We are interested in finding values which maximize <it>p</it>(<it>R</it>, <it>a</it>, <it>b </it>| <it>S</it>), which approximately corresponds to maximizing <it>F </it>above. Having completed a full iteration of the update/sampling procedure, sampling continues at the first sequence. The algorithm stops when the same <it>F </it>has been observed several times in a row or when the maximum number of iterations is reached. To avoid getting stuck in local maxima, the algorithm is restarted several times. It is also systematically restarted with different settings of the phase <it>&#966; </it>(all values between 0...<it>T</it>-1 are evaluated), as this parameter is not updated during each run of the algorithm and therefore has to be determined exhaustively.</p>
            <p>To avoid that the algorithm finds "shifted versions" of the actual motifs, a type of shift jump is introduced. Each time the score <it>F </it>is improved, possible shifts of the motifs are found, defined by adding or subtracting some integer to all <it>a</it><sub><it>i </it></sub>and <it>b</it><sub><it>i</it></sub>. For each of the possible shifts (<it>a</it>*, <it>b</it>*), we calculate <it>F</it>. If a better score is encountered, the positions are updated and used as a starting point for the next update/sampling iteration.</p>
            <p>For simplicity, we have described the case where motif pairs are assumed to occur only on the forward strand. Our method optionally permits both forward and reverse strands to be searched. In this case, the sampling distribution and the calculation of the posterior probability for <it>R </it>is extended to included both strands. Optionally, information about conservation between species can be used to favor placement of motifs in evolutionarily conserved regions. In this case, instead of single sequences, pairwise alignments of orthologous sequences are loaded into the program. Gaps are removed from the "base" sequences to ensure that correct distances are maintained. The fraction of conserved bases over windows the same size as the motifs is calculated for each possible motif position. The sampling distributions are then weighted according to this vector. A similar strategy is implemented in <abbrgrp><abbr bid="B23">23</abbr></abbrgrp>. The same vector is also used to exclude regions from being searched. This allows the sampler to be restarted after convergence to search for a new set of non-overlapping binding sites.</p>
         </sec>
         <sec>
            <st>
               <p>Implementation and user interface</p>
            </st>
            <p>The main algorithm is implemented in Matlab while time critical functions are written in the C language. These can be downloaded for local use (see Additional File <supplr sid="S1">1</supplr>). HeliCis is also available through a web interface<abbrgrp><abbr bid=" B20">20</abbr></abbrgrp> which provides several templates to simplify parameter setup. To make it easier to understand the function of the different parameters, these are visualized using an interactive schematic figure which is updated to reflect the current settings (Figure <figr fid="F2">2</figr>). The web interface is implemented in php and the source files can be made available upon request.</p>
            <suppl id="S1">
               <title>
                  <p>Additional file 1</p>
               </title>
               <text>
                  <p>Matlab and C source files. This archive contains source files and instructions for compilation.</p>
               </text>
               <file name="1471-2105-8-418-S1.zip">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <fig id="F2">
               <title>
                  <p>Figure 2</p>
               </title>
               <caption>
                  <p>Web interface screenshot, showing the parameter setup screen</p>
               </caption>
               <text>
                  <p><b>Web interface screenshot, showing the parameter setup screen</b>. The schematic shows valid positions for motif 2 given the position of motif one. The image is dynamically generated to reflect the current parameter settings.</p>
               </text>
               <graphic file="1471-2105-8-418-2"/>
            </fig>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Results</p>
         </st>
         <sec>
            <st>
               <p>Performance vs. motif information content</p>
            </st>
            <p>The performance was evaluated on synthetic sequence datasets. Ordered pairs of SRF (CArG) and ETS binding sites, generated from raw TRANSFAC <abbrgrp><abbr bid="B24">24</abbr></abbrgrp> weight matrices (M01007 and M00771), were planted into sets of 15 random sequences of length 400 bp. The choice of matrices was arbitrary, although these factors have been shown to cooperatively regulate certain genes <abbrgrp><abbr bid="B25">25</abbr></abbrgrp>. One motif pair was assigned to each sequence and the distance between each pair was set to a uniformly random multiple (n = 0...4) of the helical period (10 bp) plus a 5 bp offset. The binding sites were thus both colocalized and periodically spaced. The TRANSFAC CArG matrix is based on 54 occurrences and the central 12 bases were used when generating the test sequences (the core CArG motif is 10 bp long). The ETS matrix is 12 bp long and based on 48 occurrences. Raw counts were converted into relative frequencies and bases were randomly selected according to this distribution. Several sequence sets with increasingly weaker motifs were generated by varying the number of pseudocounts between 0 and 4. The information content of the resulting matrices was calculated. Evaluation sequence sets are available both as supplementary information (see Additional File <supplr sid="S2">2</supplr>) and for download on the HeliCis homepage <abbrgrp><abbr bid="B20">20</abbr></abbrgrp>.</p>
            <suppl id="S2">
               <title>
                  <p>Additional file 2</p>
               </title>
               <text>
                  <p>Evaluation sequences. This archive contains evaluation sequence datasets and the Matlab scripts used for generating them.</p>
               </text>
               <file name="1471-2105-8-418-S2.zip">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <p>HeliCis with default settings for periodic spacing (period 10, motif distance 0...50 bp), HeliCis with colocalization settings (period 1, distance 0...50 bp) and HeliCis with single motif settings were compared to an established single motif discovery tool based on the EM algorithm, MEME <abbrgrp><abbr bid="B26">26</abbr></abbrgrp>, and a motif discovery tool based on Gibbs sampling, BioProspector <abbrgrp><abbr bid="B27">27</abbr></abbrgrp>. The latter was run in "two-block" mode, searching for motif pairs with a maximum gap of 50 bp. All were configured to search the forward strand only with a fixed motif width of 12 bp, and with forced presence of a motif in each sequence (oops = "one occurrence per sequence" model in MEME, "-a 1" switch for BioProspector, "-p 1" switch for HeliCis). The quality of the resulting alignments was determined by calculating the fraction of correctly identified sites (Figure <figr fid="F3">3</figr>). Results shown are average values from five independent trials where the sequence sets were regenerated each time. It should be noted that BioProspector, unlike HeliCis and MEME, cannot be forced to detect exactly one occurrence per sequence, but will often assign several motifs per sequence. This should be taken into account when evaluating the results, as this model may be slightly disadvantageous on this dataset.</p>
            <fig id="F3">
               <title>
                  <p>Figure 3</p>
               </title>
               <caption>
                  <p>Performance on synthetic sequence datasets containing colocalized and periodically spaced CArG and ETS motifs with varying information content</p>
               </caption>
               <text>
                  <p><b>Performance on synthetic sequence datasets containing colocalized and periodically spaced CArG and ETS motifs with varying information content</b>. HeliCis with different settings was compared to MEME and BioProspector. The information content of the motifs was gradually reduced by varying the number of pseudocounts and the sensitivity of the different tools was determined by calculating the fraction of correctly identified motifs. Results are from 5 averaged trials.</p>
               </text>
               <graphic file="1471-2105-8-418-3"/>
            </fig>
            <p>The CArG motif has high information content and all tested tools performed reasonably well on this motif before pseudocounts were added. However, the sensitivity of HeliCis with periodic and colocalization settings was still higher, reaching 99 % and 97 % respectively, as opposed to 88 % for MEME and BioProspector. As the information content of the motifs was lowered, the ability of the periodic model to make use of the periodicity in the data became obvious and the other methods were outperformed. When the already weak ETS motif was obscured by added pseudocounts, HeliCis in colocalization mode quickly lost its ability to make use of this motif to improve detection of the CArG box.</p>
            <p>The ETS motif was not efficiently detected using any of the single motif methods, and this is where the advantages of the HeliCis model were most obvious. BioProspector in two-block mode was able to draw some advantage of the proximity to the stronger CArG motif and reached 65 % sensivity with no added pseudocounts, to be compared with ~42 % for MEME and HeliCis in single motif mode. The corresponding result for HeliCis in colocalization mode was 92 %, and the advantage was even bigger when the information content of the motifs was reduced. On the ETS motif, HeliCis in periodic mode had considerably higher sensitivity than all the other tested methods throughout the series.</p>
         </sec>
         <sec>
            <st>
               <p>Performance vs. fraction of sequences containing motifs</p>
            </st>
            <p>In a second evaluation, sets of 20 sequences containing artificially planted CArG and ETS motifs were generated as described above. However, this time the information content of the motif matrices was kept constant (one pseudocount added). Instead, the fraction of sequences containing motifs was gradually reduced from 20/20 to 10/20, thus making them increasingly difficult to detect. In this case, the tools were not forced to detect motifs in all sequences (zoops = "zero or one occurrences per sequence" model in MEME, default for BioProspector and HeliCis). Other settings were as described above. To account for false positive predictions, a PPV score (positive predictive value, i.e. the fraction of predicted sites which are correct) was calculated, in addition to sensitivity. The results, shown in Figure <figr fid="F4">4</figr>, are average values from 5 independent trials.</p>
            <fig id="F4">
               <title>
                  <p>Figure 4</p>
               </title>
               <caption>
                  <p>Performance on synthetic sequence datasets with varying motif coverage</p>
               </caption>
               <text>
                  <p><b>Performance on synthetic sequence datasets with varying motif coverage</b>. Datasets of 20 sequences with colocalized and periodically spaced CArG and ETS motifs were generated. The proportion of sequences containing the motifs was gradually reduced, thus making them increasingly difficult to detect. HeliCis with different settings was compared to MEME and BioProspector. The plots show sensitivity and positive predictive value (PPV = TP/(TP + FP)). Results are from 5 averaged trials.</p>
               </text>
               <graphic file="1471-2105-8-418-4"/>
            </fig>
            <p>Again, the less informative ETS motif benefited considerably from the HeliCis model, both with periodic and colocalization settings. This motif was only sporadically detected by MEME, BioProspector and HeliCis with single motif settings, while HeliCis in periodic mode reached 91 % sensitivity when 16/20 sequences contained the motifs. When the fraction of motif-containing sequences was high (20/20 to 16/20) also the CArG motif was detected with higher sensitivity by HeliCis in periodic mode compared to the other tested tools.</p>
            <p>In the most challenging dataset, with motifs in 10 out of 20 sequences, HeliCis was not able to detect any motifs. However, both MEME and BioProspector could sporadically detect the CArG motif with average sensitivity scores of 32 % and 18 % respectively. MEME generally performed well in the PPV plots, reflecting that it was less prone to assigning false positive motifs in non-motif containing sequences. BioProspector does not have the possibility to limit the number of detected two-block motifs to maximum one per sequence. Due to a larger number of false positive predictions it therefore scored unfavorably in the PPV plots. It should be noted that its two-block model was occasionally able to detect the difficult ETS motif with high sensitivity, however, the average performance was still similar to the single motif methods.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Discussion</p>
         </st>
         <p>We have described a novel tool for <it>de novo </it>discovery of regulatory DNA motifs, HeliCis, available for local use and through a web interface<abbrgrp><abbr bid=" B20">20</abbr></abbrgrp>. Our method can efficiently detect motif pairs which are spatially colocalized in regulatory DNA. It is based on a flexible probabilistic model which optionally allows <it>de novo </it>discovery of motif pairs with periodic spacing (helical phasing). A large number of experimental studies show the importance of helical phasing in regulatory regions. The ability to detect such patterns <it>de novo </it>without prior knowledge of recognition sequences may be useful in the study of coregulated CRMs.</p>
         <p>Our results show that HeliCis is able to efficiently take advantage of the synergistic effects of colocalization to improve sensitivity to weak DNA patterns. HeliCis in colocalization mode was evaluated on planted ETS and CArG motifs which were colocalized with a spacer of random variable length. The weaker ETS motif was detected with far better accuracy compared to other tested methods, and this can be attributed to the ability of our method to make use of the nearby stronger CArG motif to improve sensitivity. Detection of the CArG motif also benefited from the ETS-motif, although to a lesser extent. Sensitivity was further improved in a drastic way by running HeliCis in periodic mode. Both the CArG and the ETS motif benefited considerably from this reduction of the search space. Importantly, this shows that the method is capable of finding weak periodic patterns which are not readily detected using a "sequential" approach, i.e. first detecting single motifs and second analyzing their spacing properties.</p>
         <p>One limitation of our model is that the motifs widths are fixed. Some Gibbs sampling algorithms handle this using an alternative scoring function and restarts using several widths <abbrgrp><abbr bid="B21">21</abbr></abbrgrp> or the "fragmentation algorithm <abbrgrp><abbr bid="B28">28</abbr></abbrgrp>," while others use a fixed width <abbrgrp><abbr bid="B15">15</abbr><abbr bid="B27">27</abbr></abbrgrp>. TF binding sites are usually within the 8&#8211;12 bp range and we have found results to be quite robust to changes in this parameter as long as the motif width is not set too short. Results were nearly identical when HeliCis was applied to the test sets in this paper using a 10 bp motif width instead of the default 12 bp (data not shown).</p>
         <p>HeliCis models the intermotif distance as a variable integer multiple of the period <it>T </it>plus a fixed "phase" (offset) <it>&#966; </it>= 0...<it>T</it>-1. The phase is determined exhaustively by restarting the sampler several times, leading execution time to be proportional to the chosen period. A desirable improvement would be to determine the phase during execution of the algorithm rather than to use restarts. If several periods other than the default 10 bp are to be evaluated, more restarts are required and the algorithm can become computationally demanding. However, the current implementation normally does not cause problems with sequence sets of reasonable size. With 15 400 bp sequences, execution time with the periodic model (10 bp period) is typically around 10 minutes on a low-end processor (Pentium 4 2.4 GHz). The execution time in each iteration theoretically scales linearly with the number of sequences, the total amount of sequence data, the motif length and the maximum motif distance. In practice, as long as each individual sequence is not to long (&lt;1000 bp), the number of sequences is the most important factor (data not shown). Some parameters in the web interface have been slightly limited to avoid overloading the server, but no such limitations are present in the downloadable version.</p>
      </sec>
      <sec>
         <st>
            <p>Conclusion</p>
         </st>
         <p>HeliCis is a flexible and efficient tool for <it>de novo </it>discovery of colocalized DNA motif pairs. It incorporates structural features such as ordered or unordered colocalization and periodic spacing. Our evaluations show that it can detect weak periodic patterns which cannot be easily discovered by others means. It is available both for local use and through a simple web interface.</p>
      </sec>
      <sec>
         <st>
            <p>Availability and requirements</p>
         </st>
         <p>Project name: HeliCis</p>
         <p>Project home page: <url>http://lymphomics.wall.gu.se/helicis</url></p>
         <p>Operating system: Platform independent</p>
         <p>Programming language: Matlab, C</p>
         <p>License: Free for academic and non-profit researchers. Contact the authors for commercial licensing.</p>
      </sec>
      <sec>
         <st>
            <p>Authors' contributions</p>
         </st>
         <p>The functional specification of the method was prepared by EL and PL. The mathematical model and algorithm was designed and implemented by EL. The web interface was implemented by EL. The manuscript was drafted by EL and PM with contributions from PL. All authors read and approved the final manuscript.</p>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>Prof. Olle Nerman is greatly acknowledged for fruitful discussions during the initial part of the project. The work was partly funded by the European Commission: The Sixth Framework Programme (LSHG-CT-2004-503573).</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>The Regulatory Genome: Gene Regulatory Networks In Development and Evolution</p>
            </title>
            <aug>
               <au>
                  <snm>Davidson</snm>
                  <fnm>EH</fnm>
               </au>
            </aug>
            <publisher> Academic Press</publisher>
            <pubdate>2006</pubdate>
         </bibl>
         <bibl id="B2">
            <title>
               <p>Information display by transcriptional enhancers</p>
            </title>
            <aug>
               <au>
                  <snm>Kulkarni</snm>
                  <fnm>MM</fnm>
               </au>
               <au>
                  <snm>Arnosti</snm>
                  <fnm>DN</fnm>
               </au>
            </aug>
            <source>Development</source>
            <pubdate>2003</pubdate>
            <volume>130</volume>
            <issue>26</issue>
            <fpage>6569</fpage>
            <lpage>6575</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1242/dev.00890</pubid>
                  <pubid idtype="pmpid" link="fulltext">14660545</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>The enhanceosome and transcriptional synergy</p>
            </title>
            <aug>
               <au>
                  <snm>Carey</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Cell</source>
            <pubdate>1998</pubdate>
            <volume>92</volume>
            <issue>1</issue>
            <fpage>5</fpage>
            <lpage>8</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0092-8674(00)80893-4</pubid>
                  <pubid idtype="pmpid" link="fulltext">9489694</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>Helical repeat of DNA in solution</p>
            </title>
            <aug>
               <au>
                  <snm>Wang</snm>
                  <fnm>JC</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci U S A</source>
            <pubdate>1979</pubdate>
            <volume>76</volume>
            <issue>1</issue>
            <fpage>200</fpage>
            <lpage>203</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">382905</pubid>
                  <pubid idtype="pmpid">284332</pubid>
                  <pubid idtype="doi">10.1073/pnas.76.1.200</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>Physical interaction of the activator protein-1 factors c-Fos and c-Jun with Cbfa1 for collagenase-3 promoter activation</p>
            </title>
            <aug>
               <au>
                  <snm>D'Alonzo</snm>
                  <fnm>RC</fnm>
               </au>
               <au>
                  <snm>Selvamurugan</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Karsenty</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Partridge</snm>
                  <fnm>NC</fnm>
               </au>
            </aug>
            <source>J Biol Chem</source>
            <pubdate>2002</pubdate>
            <volume>277</volume>
            <issue>1</issue>
            <fpage>816</fpage>
            <lpage>822</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1074/jbc.M107082200</pubid>
                  <pubid idtype="pmpid" link="fulltext">11641401</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>Smooth muscle alpha-actin CArG elements coordinate formation of a smooth muscle cell-selective, serum response factor-containing activation complex</p>
            </title>
            <aug>
               <au>
                  <snm>Mack</snm>
                  <fnm>CP</fnm>
               </au>
               <au>
                  <snm>Thompson</snm>
                  <fnm>MM</fnm>
               </au>
               <au>
                  <snm>Lawrenz-Smith</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Owens</snm>
                  <fnm>GK</fnm>
               </au>
            </aug>
            <source>Circ Res</source>
            <pubdate>2000</pubdate>
            <volume>86</volume>
            <issue>2</issue>
            <fpage>221</fpage>
            <lpage>232</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">10666419</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>An enhanceosome containing the Jun B/Fra-2 heterodimer and the HMG-I(Y) architectural protein controls HPV 18 transcription</p>
            </title>
            <aug>
               <au>
                  <snm>Bouallaga</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Massicard</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Yaniv</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Thierry</snm>
                  <fnm>F</fnm>
               </au>
            </aug>
            <source>EMBO Rep</source>
            <pubdate>2000</pubdate>
            <volume>1</volume>
            <issue>5</issue>
            <fpage>422</fpage>
            <lpage>427</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1083764</pubid>
                  <pubid idtype="pmpid" link="fulltext">11258482</pubid>
                  <pubid idtype="doi">10.1093/embo-reports/kvd091</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>Lung surfactant protein B promoter function is dependent on the helical phasing, orientation and combinatorial actions of cis-DNA elements</p>
            </title>
            <aug>
               <au>
                  <snm>Alam</snm>
                  <fnm>MN</fnm>
               </au>
               <au>
                  <snm>Berhane</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Boggaram</snm>
                  <fnm>V</fnm>
               </au>
            </aug>
            <source>Gene</source>
            <pubdate>2002</pubdate>
            <volume>282</volume>
            <issue>1-2</issue>
            <fpage>103</fpage>
            <lpage>111</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0378-1119(01)00844-7</pubid>
                  <pubid idtype="pmpid" link="fulltext">11814682</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p>Regulation of tumor necrosis factor alpha gene expression by mycobacteria involves the assembly of a unique enhanceosome dependent on the coactivator proteins CBP/p300</p>
            </title>
            <aug>
               <au>
                  <snm>Barthel</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Tsytsykova</snm>
                  <fnm>AV</fnm>
               </au>
               <au>
                  <snm>Barczak</snm>
                  <fnm>AK</fnm>
               </au>
               <au>
                  <snm>Tsai</snm>
                  <fnm>EY</fnm>
               </au>
               <au>
                  <snm>Dascher</snm>
                  <fnm>CC</fnm>
               </au>
               <au>
                  <snm>Brenner</snm>
                  <fnm>MB</fnm>
               </au>
               <au>
                  <snm>Goldfeld</snm>
                  <fnm>AE</fnm>
               </au>
            </aug>
            <source>Mol Cell Biol</source>
            <pubdate>2003</pubdate>
            <volume>23</volume>
            <issue>2</issue>
            <fpage>526</fpage>
            <lpage>533</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">151551</pubid>
                  <pubid idtype="pmpid" link="fulltext">12509451</pubid>
                  <pubid idtype="doi">10.1128/MCB.23.2.526-533.2003</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B10">
            <title>
               <p>A novel NF-kappa B-regulated site within the human I gamma 1 promoter requires p300 for optimal transcriptional activity</p>
            </title>
            <aug>
               <au>
                  <snm>Dryer</snm>
                  <fnm>RL</fnm>
               </au>
               <au>
                  <snm>Covey</snm>
                  <fnm>LR</fnm>
               </au>
            </aug>
            <source>J Immunol</source>
            <pubdate>2005</pubdate>
            <volume>175</volume>
            <issue>7</issue>
            <fpage>4499</fpage>
            <lpage>4507</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">16177093</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>Coordinate enhancers share common organizational features in the Drosophila genome</p>
            </title>
            <aug>
               <au>
                  <snm>Erives</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Levine</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci U S A</source>
            <pubdate>2004</pubdate>
            <volume>101</volume>
            <issue>11</issue>
            <fpage>3851</fpage>
            <lpage>3856</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">374333</pubid>
                  <pubid idtype="pmpid" link="fulltext">15026577</pubid>
                  <pubid idtype="doi">10.1073/pnas.0400611101</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>Distance preferences in the arrangement of binding motifs and hierarchical levels in organization of transcription regulatory information</p>
            </title>
            <aug>
               <au>
                  <snm>Makeev</snm>
                  <fnm>VJ</fnm>
               </au>
               <au>
                  <snm>Lifanov</snm>
                  <fnm>AP</fnm>
               </au>
               <au>
                  <snm>Nazina</snm>
                  <fnm>AG</fnm>
               </au>
               <au>
                  <snm>Papatsenko</snm>
                  <fnm>DA</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2003</pubdate>
            <volume>31</volume>
            <issue>20</issue>
            <fpage>6016</fpage>
            <lpage>6026</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">219477</pubid>
                  <pubid idtype="pmpid" link="fulltext">14530449</pubid>
                  <pubid idtype="doi">10.1093/nar/gkg799</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B13">
            <title>
               <p>Periodical distribution of transcription factor sites in promoter regions and connection with chromatin structure</p>
            </title>
            <aug>
               <au>
                  <snm>Ioshikhes</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Trifonov</snm>
                  <fnm>EN</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>MQ</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci U S A</source>
            <pubdate>1999</pubdate>
            <volume>96</volume>
            <issue>6</issue>
            <fpage>2891</fpage>
            <lpage>2895</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">15865</pubid>
                  <pubid idtype="pmpid" link="fulltext">10077607</pubid>
                  <pubid idtype="doi">10.1073/pnas.96.6.2891</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B14">
            <title>
               <p>Assessing computational tools for the discovery of transcription factor binding sites</p>
            </title>
            <aug>
               <au>
                  <snm>Tompa</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Bailey</snm>
                  <fnm>TL</fnm>
               </au>
               <au>
                  <snm>Church</snm>
                  <fnm>GM</fnm>
               </au>
               <au>
                  <snm>De Moor</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Eskin</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Favorov</snm>
                  <fnm>AV</fnm>
               </au>
               <au>
                  <snm>Frith</snm>
                  <fnm>MC</fnm>
               </au>
               <au>
                  <snm>Fu</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Kent</snm>
                  <fnm>WJ</fnm>
               </au>
               <au>
                  <snm>Makeev</snm>
                  <fnm>VJ</fnm>
               </au>
               <au>
                  <snm>Mironov</snm>
                  <fnm>AA</fnm>
               </au>
               <au>
                  <snm>Noble</snm>
                  <fnm>WS</fnm>
               </au>
               <au>
                  <snm>Pavesi</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Pesole</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Regnier</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Simonis</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Sinha</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Thijs</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>van Helden</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Vandenbogaert</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Weng</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Workman</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Ye</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Zhu</snm>
                  <fnm>Z</fnm>
               </au>
            </aug>
            <source>Nat Biotechnol</source>
            <pubdate>2005</pubdate>
            <volume>23</volume>
            <issue>1</issue>
            <fpage>137</fpage>
            <lpage>144</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nbt1053</pubid>
                  <pubid idtype="pmpid" link="fulltext">15637633</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B15">
            <title>
               <p>De novo cis-regulatory module elicitation for eukaryotic genomes</p>
            </title>
            <aug>
               <au>
                  <snm>Gupta</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Liu</snm>
                  <fnm>JS</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci U S A</source>
            <pubdate>2005</pubdate>
            <volume>102</volume>
            <issue>20</issue>
            <fpage>7079</fpage>
            <lpage>7084</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1129096</pubid>
                  <pubid idtype="pmpid" link="fulltext">15883375</pubid>
                  <pubid idtype="doi">10.1073/pnas.0408743102</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B16">
            <title>
               <p>Algorithms for extracting structured motifs using a suffix tree with an application to promoter and regulatory site consensus identification</p>
            </title>
            <aug>
               <au>
                  <snm>Marsan</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Sagot</snm>
                  <fnm>MF</fnm>
               </au>
            </aug>
            <source>J Comput Biol</source>
            <pubdate>2000</pubdate>
            <volume>7</volume>
            <issue>3-4</issue>
            <fpage>345</fpage>
            <lpage>362</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1089/106652700750050826</pubid>
                  <pubid idtype="pmpid" link="fulltext">11108467</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B17">
            <title>
               <p>A discriminative model for identifying spatial cis-regulatory modules</p>
            </title>
            <aug>
               <au>
                  <snm>Segal</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Sharan</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>J Comput Biol</source>
            <pubdate>2005</pubdate>
            <volume>12</volume>
            <issue>6</issue>
            <fpage>822</fpage>
            <lpage>834</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1089/cmb.2005.12.822</pubid>
                  <pubid idtype="pmpid" link="fulltext">16108719</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B18">
            <title>
               <p>CisModule: de novo discovery of cis-regulatory modules by hierarchical mixture modeling</p>
            </title>
            <aug>
               <au>
                  <snm>Zhou</snm>
                  <fnm>Q</fnm>
               </au>
               <au>
                  <snm>Wong</snm>
                  <fnm>WH</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci U S A</source>
            <pubdate>2004</pubdate>
            <volume>101</volume>
            <issue>33</issue>
            <fpage>12114</fpage>
            <lpage>12119</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">514443</pubid>
                  <pubid idtype="pmpid" link="fulltext">15297614</pubid>
                  <pubid idtype="doi">10.1073/pnas.0402858101</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B19">
            <title>
               <p>Decoding human regulatory circuits</p>
            </title>
            <aug>
               <au>
                  <snm>Thompson</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Palumbo</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Wasserman</snm>
                  <fnm>WW</fnm>
               </au>
               <au>
                  <snm>Liu</snm>
                  <fnm>JS</fnm>
               </au>
               <au>
                  <snm>Lawrence</snm>
                  <fnm>CE</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2004</pubdate>
            <volume>14</volume>
            <issue>10A</issue>
            <fpage>1967</fpage>
            <lpage>1974</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">524421</pubid>
                  <pubid idtype="pmpid" link="fulltext">15466295</pubid>
                  <pubid idtype="doi">10.1101/gr.2589004</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B20">
            <title>
               <p>HeliCis website</p>
            </title>
            <url>http://lymphomics.wall.gu.se/helicis</url>
         </bibl>
         <bibl id="B21">
            <title>
               <p>Detecting subtle sequence signals: a Gibbs sampling strategy for multiple alignment</p>
            </title>
            <aug>
               <au>
                  <snm>Lawrence</snm>
                  <fnm>CE</fnm>
               </au>
               <au>
                  <snm>Altschul</snm>
                  <fnm>SF</fnm>
               </au>
               <au>
                  <snm>Boguski</snm>
                  <fnm>MS</fnm>
               </au>
               <au>
                  <snm>Liu</snm>
                  <fnm>JS</fnm>
               </au>
               <au>
                  <snm>Neuwald</snm>
                  <fnm>AF</fnm>
               </au>
               <au>
                  <snm>Wootton</snm>
                  <fnm>JC</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>1993</pubdate>
            <volume>262</volume>
            <issue>5131</issue>
            <fpage>208</fpage>
            <lpage>214</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.8211139</pubid>
                  <pubid idtype="pmpid" link="fulltext">8211139</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B22">
            <title>
               <p>The collapsed Gibbs sampler and other issues: with applications to a protein binding problem</p>
            </title>
            <aug>
               <au>
                  <snm>Liu</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Research Report No R-426, Dept Statistics, Harvard Univ</source>
            <publisher> Harvard Univesity Press</publisher>
            <pubdate>1992</pubdate>
         </bibl>
         <bibl id="B23">
            <title>
               <p>Gibbs Recursive Sampler: finding transcription factor binding sites</p>
            </title>
            <aug>
               <au>
                  <snm>Thompson</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Rouchka</snm>
                  <fnm>EC</fnm>
               </au>
               <au>
                  <snm>Lawrence</snm>
                  <fnm>CE</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2003</pubdate>
            <volume>31</volume>
            <issue>13</issue>
            <fpage>3580</fpage>
            <lpage>3585</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">169014</pubid>
                  <pubid idtype="pmpid" link="fulltext">12824370</pubid>
                  <pubid idtype="doi">10.1093/nar/gkg608</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B24">
            <title>
               <p>Recognition of regulatory regions in genomic sequences</p>
            </title>
            <aug>
               <au>
                  <snm>Wingender</snm>
                  <fnm>E</fnm>
               </au>
            </aug>
            <source>J Biotechnol</source>
            <pubdate>1994</pubdate>
            <volume>35</volume>
            <issue>2-3</issue>
            <fpage>273</fpage>
            <lpage>280</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/0168-1656(94)90041-8</pubid>
                  <pubid idtype="pmpid" link="fulltext">7765063</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B25">
            <title>
               <p>Myocardin and ternary complex factors compete for SRF to control smooth muscle gene expression</p>
            </title>
            <aug>
               <au>
                  <snm>Wang</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>DZ</fnm>
               </au>
               <au>
                  <snm>Hockemeyer</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>McAnally</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Nordheim</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Olson</snm>
                  <fnm>EN</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2004</pubdate>
            <volume>428</volume>
            <issue>6979</issue>
            <fpage>185</fpage>
            <lpage>189</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature02382</pubid>
                  <pubid idtype="pmpid" link="fulltext">15014501</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B26">
            <title>
               <p>Fitting a mixture model by expectation maximization to discover motifs in biopolymers</p>
            </title>
            <aug>
               <au>
                  <snm>Bailey</snm>
                  <fnm>TL</fnm>
               </au>
               <au>
                  <snm>Elkan</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>Proc Int Conf Intell Syst Mol Biol</source>
            <pubdate>1994</pubdate>
            <volume>2</volume>
            <fpage>28</fpage>
            <lpage>36</lpage>
            <xrefbib>
               <pubid idtype="pmpid">7584402</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B27">
            <title>
               <p>BioProspector: discovering conserved DNA motifs in upstream regulatory regions of co-expressed genes</p>
            </title>
            <aug>
               <au>
                  <snm>Liu</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Brutlag</snm>
                  <fnm>DL</fnm>
               </au>
               <au>
                  <snm>Liu</snm>
                  <fnm>JS</fnm>
               </au>
            </aug>
            <source>Pac Symp Biocomput</source>
            <pubdate>2001</pubdate>
            <fpage>127</fpage>
            <lpage>138</lpage>
            <xrefbib>
               <pubid idtype="pmpid">11262934</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B28">
            <title>
               <p>Gibbs motif sampling: detection of bacterial outer membrane protein repeats</p>
            </title>
            <aug>
               <au>
                  <snm>Neuwald</snm>
                  <fnm>AF</fnm>
               </au>
               <au>
                  <snm>Liu</snm>
                  <fnm>JS</fnm>
               </au>
               <au>
                  <snm>Lawrence</snm>
                  <fnm>CE</fnm>
               </au>
            </aug>
            <source>Protein Sci</source>
            <pubdate>1995</pubdate>
            <volume>4</volume>
            <issue>8</issue>
            <fpage>1618</fpage>
            <lpage>1632</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">2143180</pubid>
                  <pubid idtype="pmpid" link="fulltext">8520488</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
      </refgrp>
   </bm>
</art>
