<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
	<ui>1471-2105-13-222</ui>
	<ji>1471-2105</ji>
	<fm>
		<dochead>Research article</dochead>
		<bibl>
			<title>
				<p>Wavelet-based detection of transcriptional activity on a novel <it>Staphylococcus aureus</it> tiling microarray</p>
			</title>
			<aug>
				<au id="A1" ca="yes"><snm>Segura</snm><fnm>V&#237;ctor</fnm><insr iid="I1"/><email>vsegura@unav.es</email></au>
				<au id="A2"><snm>Toledo-Arana</snm><fnm>Alejandro</fnm><insr iid="I2"/><email>alejandro.toledo@unavarra.es</email></au>
				<au id="A3"><snm>Uzqueda</snm><fnm>Maite</fnm><insr iid="I1"/><email>muzqueda@unav.es</email></au>
				<au id="A4"><snm>Lasa</snm><fnm>I&#241;igo</fnm><insr iid="I2"/><email>ilasa@unavarra.es</email></au>
				<au id="A5"><snm>Mu&#241;oz-Barrutia</snm><fnm>Arrate</fnm><insr iid="I3"/><email>arrmunoz@unav.es</email></au>
			</aug>
			<insg>
				<ins id="I1"><p>Genomics, Proteomics and Bioinformatics Unit, Center for Applied Medical Research, University of Navarra, Pamplona, Spain</p></ins>
				<ins id="I2"><p>Laboratory of Microbial Biofilms, Instituto de Agrobiotecnolog&#237;a, Universidad P&#250;blica de Navarra-Consejo Superior de Investigaciones Cient&#237;ficas-Gobierno de Navarra, Pamplona 31006, Spain</p></ins>
				<ins id="I3"><p>Cancer Imaging Laboratory, Center for Applied Medical Research, University of Navarra, Pamplona, Spain</p></ins>
			</insg>
			<source>BMC Bioinformatics</source>
			<section><title><p>Transcriptome analysis</p></title></section><issn>1471-2105</issn>
			<pubdate>2012</pubdate>
			<volume>13</volume>
			<issue>1</issue>
			<fpage>222</fpage>
			<url>http://www.biomedcentral.com/1471-2105/13/222</url>
			<xrefbib><pubidlist><pubid idtype="doi">10.1186/1471-2105-13-222</pubid><pubid idtype="pmpid">22950634</pubid></pubidlist></xrefbib>
		</bibl>
		<history><rec><date><day>27</day><month>6</month><year>2012</year></date></rec><acc><date><day>16</day><month>8</month><year>2012</year></date></acc><pub><date><day>5</day><month>9</month><year>2012</year></date></pub></history>
		<cpyrt><year>2012</year><collab>Segura et al.; licensee BioMed Central Ltd.</collab><note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note></cpyrt>
		<abs>
			<sec>
				<st>
					<p>Abstract</p>
				</st>
				<sec>
					<st>
						<p>Background</p>
					</st><p>High-density oligonucleotide microarray is an appropriate technology for genomic analysis, and is particulary useful in the generation of transcriptional maps, ChIP-on-chip studies and re-sequencing of the genome.Transcriptome analysis of tiling microarray data facilitates the discovery of novel transcripts and the assessment of differential expression in diverse experimental conditions. Although new technologies such as next-generation sequencing have appeared, microarrays might still be useful for the study of small genomes or for the analysis of genomic regions with custom microarrays due to their lower price and good accuracy in expression quantification.</p>
				</sec>
				<sec>
					<st>
						<p>Results</p>
					</st><p>Here, we propose a novel wavelet-based method, named ZCL (zero-crossing lines), for the combined denoising and segmentation of tiling signals. The denoising is performed with the classical SUREshrink method and the detection of transcriptionally active regions is based on the computation of the Continuous Wavelet Transform (CWT). In particular, the detection of the transitions is implemented as the thresholding of the zero-crossing lines. The algorithm described has been applied to the public <it>Saccharomyces cerevisiae</it> dataset and it has been compared with two well-known algorithms: pseudo-median sliding window (PMSW) and the structural change model (SCM). As a proof-of-principle, we applied the ZCL algorithm to the analysis of the custom tiling microarray hybridization results of a <it>S. aureus</it> mutant deficient in the sigma B transcription factor. The challenge was to identify those transcripts whose expression decreases in the absence of sigma B.</p>
				</sec>
				<sec>
					<st>
						<p>Conclusions</p>
					</st><p>The proposed method archives the best performance in terms of positive predictive value (PPV) while its sensitivity is similar to the other algorithms used for the comparison. The computation time needed to process the transcriptional signals is low as compared with model-based methods and in the same range to those based on the use of filters. Automatic parameter selection has been incorporated and moreover, it can be easily adapted to a parallel implementation. We can conclude that the proposed method is well suited for the analysis of tiling signals, in which transcriptional activity is often hidden in the noise. Finally, the quantification and differential expression analysis of <it>S. aureus</it> dataset have demonstrated the valuable utility of this novel device to the biological analysis of the <it>S. aureus</it> transcriptome.</p>
				</sec>
			</sec>
		</abs>
	</fm>
	<bdy>
		<sec>
			<st>
				<p>Background</p>
			</st><p>The complete deciphering of the information contained in the genome would be helpful to improve our understanding of the biological processes occurring in living organisms. High-density oligonucleotide-based whole-genome microarray is an extensively used technology to detect the expression of all RNA species including protein coding RNAs and non-coding RNAs. It is particularly suitable for the analysis of whole small-sized genomes such as those corresponding to bacteria. For these organisms high resolution can be achieved with the microarrays currently provided by the manufactures.</p><p>Applications of tiling array technology include the generation of transcriptional maps and annotations of genomes, the identification of transcription factor binding sites, the analysis of alternative splicing events, the analysis of methylation states, the discovery of genotyping and polymorphism, and the re-sequentation of genomes <abbrgrp>
					<abbr bid="B1">1</abbr>
				</abbrgrp>.</p><p>The emerging high-throughput next generation DNA sequencing (NGS) technologies <abbrgrp>
					<abbr bid="B2">2</abbr>
				</abbrgrp> have revolutionized transcriptomics by allowing RNA analysis through cDNA sequencing on a massive scale (RNA-seq). Several limitations inherent to microarray technologies are overcome by NGS technologies, in particular, it is not necessary to design appropriate probes and the experimental reproducibility is guaranteed. However, the microarray design presented in <abbrgrp>
					<abbr bid="B3">3</abbr>
				</abbrgrp> allowed a comprehensive examination of gene expression and genome-wide identification of alternative splicing as well as detection of coding and noncoding transcripts. This microarray (Affymetrix GG-H array) was compared with RNA-seq in <abbrgrp>
					<abbr bid="B3">3</abbr>
				</abbrgrp>. The reproducibility in the estimation of gene and exon abundance was high and even more sensitive than RNA-seq at the exon level. This microarray design contains as targets near 50000 highly transcribed fragments of unknown functions from Affymetrix tiling microarray data <abbrgrp>
					<abbr bid="B4">4</abbr>
				</abbrgrp>. The NGS experiments highlighted that 49% of these fragments had uniquely mapped reads, revealing a high degree of concordance between both technologies.</p><p>The analysis of a tiling microarray experiment starts with a two-step process that generates a discrete signal. First, the DNA or RNA samples are hybridized in the custom designed tiling array. Second, for each probe, the raw intensities are converted to a score <abbrgrp>
					<abbr bid="B5">5</abbr>
				</abbrgrp>. The result is a discrete intensity signal with a value per probe.</p><p>The workflow shown in Figure <figr fid="F1">1</figr> summarizes the methodology. It consists of four basic blocks: (1) signal pre-processing (DNA normalization, non-uniform to uniform resampling and de-noising); (2) segmentation to detect abrupt intensity changes; (3) definition of transcriptionally active regions (TARs) and (4) biological knowledge extraction (for example, differential expression analysis of genes).</p>
			<fig id="F1"><title><p>Figure 1</p></title><caption><p>Wavelet-based processing of tiling signals</p></caption><text>
   <p><b>Wavelet-based processing of tiling signals.</b> Workflow for the analysis of the tiling signal based on the computation of the Continuous Wavelet Transform (CWT).</p>
</text><graphic file="1471-2105-13-222-1"/></fig><p>Transcriptome analysis refers to the detection of segments where the noisy tiling signal is constant. The start and end points of these segments correspond to transcript start and end sites. Several approaches have been deployed in the segmentation of tiling signals: pseudo-median or Hodges-Lehmann estimator <abbrgrp>
					<abbr bid="B6">6</abbr>
					<abbr bid="B7">7</abbr>
				</abbrgrp>, local non parametric smoothing <abbrgrp>
					<abbr bid="B8">8</abbr>
					<abbr bid="B9">9</abbr>
				</abbrgrp>, hidden Markov models <abbrgrp>
					<abbr bid="B10">10</abbr>
					<abbr bid="B11">11</abbr>
					<abbr bid="B12">12</abbr>
					<abbr bid="B13">13</abbr>
				</abbrgrp>, circular binary segmentation <abbrgrp>
					<abbr bid="B14">14</abbr>
				</abbrgrp> and structural change model <abbrgrp>
					<abbr bid="B15">15</abbr>
					<abbr bid="B16">16</abbr>
				</abbrgrp>.</p><p>Wavelet analysis using the Discrete Wavelet Transform (DWT) <abbrgrp>
					<abbr bid="B17">17</abbr>
				</abbrgrp> has demonstrated excellent performance in the analysis of ChIP-chip experiments using tiling array technology <abbrgrp>
					<abbr bid="B18">18</abbr>
					<abbr bid="B19">19</abbr>
				</abbrgrp>. In this paper, we propose a wavelet transform based method for the identification of TARs in tiling signals (ZCL). We have chosen the SUREShrink algorithm for denoising and a method based on the computation of the Continuous Wavelet Transform (CWT) for detection of transcription start and end sites. In particular, the sharp transitions of the tiling signal are identified as the zero-crossing lines of a multiresolution decomposition using as the mother wavelet the second derivative of a Gaussian <abbrgrp>
					<abbr bid="B20">20</abbr>
				</abbrgrp>. We applied the proposed analysis to the public <it>Saccharomyces Cerevisiae</it> dataset to validate our analytical approach. The novel identification algorithm was compared with two well-known methods: pseudo-median sliding window (PMSW) and structural change model (SCM). The absence of a biologically validated ground truth to evaluate the resulting segmentations prevent the use of specificity and sensitivity as performance metrics. Consequently, the evaluation has been made in terms of positive predictive value (PPV), sensitivity and computation time. We also evaluated the segmentation quality resulting from the combination of the TARs detected by several of the methods under study.</p><p>We also used this algorithm for the identification of the subset of transcripts whose expression decreases in a <it>S. aureus</it> strain deficient in the sigma B transcription factor. SigB has been shown to be involved in the stress response to different stimuli, the regulation of <it>sarA</it>, <it>sarH1</it>, and <it>agr</it> that control a wide array of virulence factors, biofilm formation, the ability of <it>S. aureus</it> to bind to various host-cell matrix proteins such as fibrinogen and fibronectin, and in the development of resistance to the antibiotics methicillin and teicoplanin <abbrgrp>
					<abbr bid="B21">21</abbr>
					<abbr bid="B22">22</abbr>
					<abbr bid="B23">23</abbr>
					<abbr bid="B24">24</abbr>
					<abbr bid="B25">25</abbr>
					<abbr bid="B26">26</abbr>
				</abbrgrp>.</p><p>We applied the segmentation methods to this high quality dataset and we have demonstrated its usefulness for the analysis of the tiling array derived transcriptome map. The results demonstrate that ZCL not only allows a rapid identification of the transcripst based on the segmentation procedure but also a more accurate estimation of the expression level of each transcript.</p>
		</sec>
		<sec>
			<st>
				<p>Results and discussion</p>
			</st><p>All the steps needed to obtain a trancriptional map from the raw data (read the CEL files, normalize, denoise and segment the tiling signal) have been implemented using the statistical language <it>R/Bioconductor</it>
				<abbrgrp>
					<abbr bid="B27">27</abbr>
				</abbrgrp>. The CRAN packages <it>Rwave</it>, <it>wavethresh</it> and <it>wmtsa</it> have been used for wavelet analysis. All the R functions described are available as Supplementary Material (Additional file <supplr sid="S1">1</supplr>, Additional file <supplr sid="S2">2</supplr> and Additional file <supplr sid="S3">3</supplr>). The R code to perform the example analyses and the generation of the figures included in the paper can also be found as Supplementary Material. The results show that wavelets compare well with the rest of methods in terms of segmentation accuracy and time consumed in the analysis.</p>
			<suppl id="S1">
				<title>
					<p>Additional file 1</p>
				</title>
				<text>
					<p><b>R code: Segmentation and visualization functions.</b> Implemented functions in R language to perform PMSW and SCM segmentation and the proposed wavelet-based method for denoising and segmentation. In addition, functions are provided for proper visualization of data, integration of analysis results and evaluation of the obtained transcriptional maps.</p>
				</text>
				<file name="1471-2105-13-222-S1.r">
   <p>Click here for file</p>
</file>
			</suppl>
			<suppl id="S2">
				<title>
					<p>Additional file 2</p>
				</title>
				<text>
					<p><b>R code: Segmentation analysis of </b><b><it>S. cerevisiae.</it></b> R script for segmentation of the <it>S. cerevisiae</it> dataset and the generation of the figures included in the manuscript.</p>
				</text>
				<file name="1471-2105-13-222-S2.r">
   <p>Click here for file</p>
</file>
			</suppl>
			<suppl id="S3">
				<title>
					<p>Additional file 3</p>
				</title>
				<text>
					<p><b>R code: Segmentation analysis of </b><b><it>S. aureus</it></b><b>).</b> R script for the segmentation of the <it>S. aureus</it> dataset and the generation of the figures included in the manuscript.</p>
				</text>
				<file name="1471-2105-13-222-S3.r">
   <p>Click here for file</p>
</file>
			</suppl>
			<sec>
				<st>
					<p>Experimental datasets</p>
				</st>
				<sec>
					<st>
						<p>Saccharomyces cerevisiae dataset</p>
					</st><p>The dataset is described in <abbrgrp>
							<abbr bid="B16">16</abbr>
						</abbrgrp>. An oligonucleotide array for <it>S. cerevisiae</it> was developed. It contains 6.5 million probes and interrogates both strands of the full genomic sequence with 25-mer probes tiled at an average of 8 nucleotide intervals on each strand and 4 nucleotide tile offset between strands. The first-strand cDNA was synthesized using random primers from poly(A) and total RNA. A set of genomic DNA was also hybridized for normalization purposes <abbrgrp>
							<abbr bid="B15">15</abbr>
						</abbrgrp>. Their analysis of the transcription map identified the transcript boundaries, its structure and the intensity level of coding and non-coding transcripts <abbrgrp>
							<abbr bid="B16">16</abbr>
						</abbrgrp>. All data (CEL files, bmap files for both strands and annotation file) was deposited in ArrayExpress database with accession number E-TABM-14.</p>
				</sec>
				<sec>
					<st>
						<p>Staphylococcus aureus dataset</p>
					</st><p>The <it>Staphylococcus aureus</it> custom tiling microarray (NA-Staph-b520729F) was designed in collaboration with Affymetrix (Santa Clara, CA, USA). Specifically, the microarray (format 49-7875 with 11 <it>&#956;</it>m features) contains a total of 522,406 probes, divided into two parts. The first part corresponds to the tiling array containing a total of 384,932 probes (25-mer), which are further divided into eight sets. The set used in our analysis covers both strands of the <it>S. aureus</it> NCTC 8325 genome (2,821,347 bp covered by 363,127 probes). Each 25-mer probe was tiled each 14-nt across the whole genome, resulting in 11-nt overlaps and a 7-nt tile offset between strands. The microarray design has been deposited in the ArrayExpress Archive at EMBL-EBI (<url>http://www.ebi.ac.uk/microarray-as/ae/</url>), ArrayExpress accesion no. A-AFFY-165.</p><p>Before cDNA synthesis, RNA integrity from each sample was confirmed on Agilent RNA Nano LabChips (Agilent Technologies). 10 <it>&#956;</it>g of RNAs extracted from bacterial strains grown until exponential phase (OD600nm = 0.8) were reverse transcribed using SuperScript II reverse transcriptase (Invitrogen Life Technologies). They were processed following the protocol of the Affymetrix GeneChip Expression Analysis Technical Manual (P/N 702232 Rev. 2) in the presence of 6 ng/ml Actinomycin D to avoid spurious second-strand cDNA synthesis during the reverse transcription reaction <abbrgrp>
							<abbr bid="B28">28</abbr>
						</abbrgrp>. Sense RNA corresponding to B. subtilis poly-A <it>lys</it>, <it>phe</it>, <it>thr</it>, <it>trp</it>, <it>dap</it> genes were spiked into sample RNA as a control for the labeling and hybridization steps. cDNA was digested by DNase I (PIERCE) in 10X DNAse I buffer (USB-Affymetrix) and the size of digestion products was analyzed in the Agilent Bioanalyser 2100 using RNA Nano LabChips to ensure that the fragmentation resulted in a majority of products in the range of 50 to 200 base-pairs. The fragmented cDNA were then biotinylated using terminal deoxynucleotidyl transferase (Promega) and the GeneChip DNA labeling reagent (Affymetrix) following the manufacturer&#8217;s recommendations. Biotinylated cDNA (5 <it>&#956;</it>g per array) were hybridized for 16 hours according to the Affymetrix protocol in a total volume of 200 <it>&#956;</it>l per hybridization chamber. Following incubation, the arrays were washed and stained in the Fluidics station 450 (Affymetrix) using the protocol FS450_0005. The arrays were then scanned using the GeneChip scanner 3000 (Affymetrix). The intensity signals of each probe cell were computed by the GeneChip operating software (GCOS) and stored in cell intensity files (.CEL extension) before preprocessing and analysis. All microarray data described in this study have been deposited in the ArrayExpress Archive at EMBL-EBI (<url>http://www.ebi.ac.uk/microarray-as/ae/</url>), ArrayExpress accesion no. E-MEXP-2778.</p>
				</sec>
			</sec>
			<sec>
				<st>
					<p>Probe annotation and normalization</p>
				</st><p>The annotation of the PM probe sequences was obtained with the alignment to the genome sequence of <it>S. cerevisiae</it> strain S288c (SGD of August 7, 2005) as provided in the package <it>davidTiling</it> of Bioconductor. Available data correspond to 3 replicates of poly(A), 2 replicates of total RNA and 3 replicates of genomic DNA. The CEL files were read and the normalized signals (poly(A) and total RNA) were obtained using Equation 4. The analysis steps (denoising, segmentation and detection of TARs) were performed on the poly(A) signal as it showed an improved hybridization quality <abbrgrp>
						<abbr bid="B16">16</abbr>
					</abbrgrp>. Once the signal is constructed from CEL and annotation files we used <it>tilingArray</it> package functions to obtain equally-spaced samples. Other resampling methods can be applied without loss of generality.</p><p>The annotation files for <it>S. aureus</it> microarray are provided in the ArrayExpress database (A-AFFY-165). The microarrays of the experiment correspond to three replicates of genomic DNA, three replicates of RNA of the 15981 wild-type strain, and three replicates of the <it>sigB</it> deletion. All the preprocessing steps were performed as previosly described for <it>S. cerevisiae</it> dataset.</p>
			</sec>
			<sec>
				<st>
					<p>Denoising</p>
				</st><p>The denoising was evaluated using the signal to noise ratio (SNR), a quantitative measure of its performance. In order to compare the results obtained with those from Huber et al. <abbrgrp>
						<abbr bid="B15">15</abbr>
					</abbrgrp> based on a variance stabilization and normalization transformation, the same definition of SNR was used. We looked at a set of control regions, two positive control regions (pos) within the ORFs of RPN2 and SER33 at coordinates 217860&#8722;220697 and 221078&#8722;222487and two negative control regions (neg) in the background at coordinates 216800&#8722;217700 and 222800&#8722;227000 of <it>S. cerevisiae</it> (see Figure <figr fid="F2">2</figr>). We assumed (as in <abbrgrp>
						<abbr bid="B15">15</abbr>
					</abbrgrp>) that the differences between positive and negative controls give an estimation of the signal level, whereas variations from the mean intensity within each region are due to noise. </p>
				<fig id="F2"><title><p>Figure 2</p></title><caption><p>Signal to noise ratio of different filtering methods</p></caption><text>
   <p><b>Signal to noise ratio of different filtering methods.</b> Portion of the tiling signal used to evaluate the Signal to Noise Ratio (SNR). We consider two signal regions (S) and two noise regions (N). <b>(a)</b> Normalized signal. <b>(b)</b> Denoised signal using Donoho&#8217;s threshold. <b>(c)</b> Denoised signal using the SUREShrink threshold.</p>
</text><graphic file="1471-2105-13-222-2"/></fig><p>The SNR was computed as </p><p>
					<display-formula id="M1">
						<m:math name="1471-2105-13-222-i1" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:mtext>SNR</m:mtext>
<m:mo>=</m:mo>
<m:mfrac>
   <m:mrow>
      <m:mi>&#916;&#956;</m:mi>
   </m:mrow>
   <m:mrow>
      <m:mi>&#963;</m:mi>
   </m:mrow>
</m:mfrac>
<m:mo>=</m:mo>
<m:mfrac>
   <m:mrow>
      <m:mn>1</m:mn>
   </m:mrow>
   <m:mrow>
      <m:mi>&#963;</m:mi>
   </m:mrow>
</m:mfrac>
<m:mfenced separators="" open="(" close=")">
   <m:mrow>
      <m:munder>
         <m:mrow>
            <m:mo>&#8721;</m:mo>
         </m:mrow>
         <m:mrow>
            <m:mi>r</m:mi>
            <m:mo>&#8712;</m:mo>
            <m:mtext>pos</m:mtext>
         </m:mrow>
      </m:munder>
      <m:mfrac>
         <m:mrow>
            <m:msub>
               <m:mrow>
                  <m:mi>&#956;</m:mi>
               </m:mrow>
               <m:mrow>
                  <m:mi>r</m:mi>
               </m:mrow>
            </m:msub>
         </m:mrow>
         <m:mrow>
            <m:mo>|</m:mo>
            <m:mtext>pos</m:mtext>
            <m:mo>|</m:mo>
         </m:mrow>
      </m:mfrac>
      <m:mo>&#8722;</m:mo>
      <m:munder>
         <m:mrow>
            <m:mo>&#8721;</m:mo>
         </m:mrow>
         <m:mrow>
            <m:mi>r</m:mi>
            <m:mo>&#8712;</m:mo>
            <m:mtext>neg</m:mtext>
         </m:mrow>
      </m:munder>
      <m:mfrac>
         <m:mrow>
            <m:msub>
               <m:mrow>
                  <m:mi>&#956;</m:mi>
               </m:mrow>
               <m:mrow>
                  <m:mi>r</m:mi>
               </m:mrow>
            </m:msub>
         </m:mrow>
         <m:mrow>
            <m:mo>|</m:mo>
            <m:mtext>neg</m:mtext>
            <m:mo>|</m:mo>
         </m:mrow>
      </m:mfrac>
   </m:mrow>
</m:mfenced>
<m:mo>,</m:mo>
</m:math>
					</display-formula>
				</p><p>with the noise standard deviation <it>&#963;</it>calculated as the average of the differences between 0.975 and 0.025 quantiles of the data within each of the control regions. Namely, </p><p>
					<display-formula id="M2">
						<m:math name="1471-2105-13-222-i2" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:mi>&#963;</m:mi>
<m:mo>=</m:mo>
<m:mfrac>
   <m:mrow>
      <m:munder>
         <m:mrow>
            <m:mo>&#8721;</m:mo>
         </m:mrow>
         <m:mrow>
            <m:mi>r</m:mi>
            <m:mo>&#8712;</m:mo>
            <m:mtext>pos</m:mtext>
            <m:mo>,</m:mo>
            <m:mtext>neg</m:mtext>
         </m:mrow>
      </m:munder>
      <m:mo>(</m:mo>
      <m:msubsup>
         <m:mrow>
            <m:mi>Q</m:mi>
         </m:mrow>
         <m:mrow>
            <m:mi>r</m:mi>
         </m:mrow>
         <m:mrow>
            <m:mn>0</m:mn>
            <m:mi>.</m:mi>
            <m:mn>975</m:mn>
         </m:mrow>
      </m:msubsup>
      <m:mo>&#8722;</m:mo>
      <m:msubsup>
         <m:mrow>
            <m:mi>Q</m:mi>
         </m:mrow>
         <m:mrow>
            <m:mi>r</m:mi>
         </m:mrow>
         <m:mrow>
            <m:mn>0</m:mn>
            <m:mi>.</m:mi>
            <m:mn>025</m:mn>
         </m:mrow>
      </m:msubsup>
      <m:mo>)</m:mo>
   </m:mrow>
   <m:mrow>
      <m:mo>(</m:mo>
      <m:msubsup>
         <m:mrow>
            <m:mi>Q</m:mi>
         </m:mrow>
         <m:mrow>
            <m:mi>N</m:mi>
         </m:mrow>
         <m:mrow>
            <m:mn>0</m:mn>
            <m:mi>.</m:mi>
            <m:mn>975</m:mn>
         </m:mrow>
      </m:msubsup>
      <m:mo>&#8722;</m:mo>
      <m:msubsup>
         <m:mrow>
            <m:mi>Q</m:mi>
         </m:mrow>
         <m:mrow>
            <m:mi>N</m:mi>
         </m:mrow>
         <m:mrow>
            <m:mn>0</m:mn>
            <m:mi>.</m:mi>
            <m:mn>025</m:mn>
         </m:mrow>
      </m:msubsup>
      <m:mo>)</m:mo>
      <m:mo>(</m:mo>
      <m:mo>|</m:mo>
      <m:mtext>pos</m:mtext>
      <m:mo>|</m:mo>
      <m:mo>+</m:mo>
      <m:mo>|</m:mo>
      <m:mtext>neg</m:mtext>
      <m:mo>|</m:mo>
      <m:mo>)</m:mo>
   </m:mrow>
</m:mfrac>
</m:math>
					</display-formula>
				</p><p>where the symbol <it>r</it> counts over the different regions and <it>Q</it>
					<sub>
						<it>N</it>
					</sub> refers to the standard normal distribution <inline-formula>
						<m:math name="1471-2105-13-222-i3" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:mi mathvariant="bold-script">N</m:mi>
<m:mo>(</m:mo>
<m:mn>0</m:mn>
<m:mo>,</m:mo>
<m:mn>1</m:mn>
<m:mo>)</m:mo>
</m:math>
					</inline-formula>. Table <tblr tid="T1">1</tblr> shows the SNR of the normalized signal and the wavelet-based denoised signal using Donoho&#8217;s method <abbrgrp>
						<abbr bid="B29">29</abbr>
					</abbrgrp> and the SUREShrink approach <abbrgrp>
						<abbr bid="B30">30</abbr>
					</abbrgrp> in relation to the best SNR obtained in <abbrgrp>
						<abbr bid="B15">15</abbr>
					</abbrgrp>. We observe that the use of wavelets for denoising results in a large increase in the SNR (18.94<it>%</it> with Donoho&#8217;s method and 30.63<it>%</it>with SUREShrink approach), especially when the SUREShrink denoising is applied. This could be due to the elimination of most part of the non-Gaussian noise component and the consequent reduction in the estimated variance. In the rest of the paper, the SUREShrink is the method applied for denoising. </p>
				<table id="T1">
					<title>
						<p>Table 1</p>
					</title>
					<caption>
						<p>
							<b>Estimated SNR values of the tiling signal shown in Figure</b>
							<figr fid="F3">3</figr>
						</p>
					</caption>
					<tgroup align="left" cols="2">
						<colspec align="left" colname="c1" colnum="1" colwidth="50*"/>
						<colspec align="center" colname="c2" colnum="2" colwidth="50*"/>
						<thead valign="top">
							<row rowsep="1">
								<entry align="center" colname="c1" nameend="c2" namest="c1">
									<p>
										<b>SNR results</b>
									</p>
								</entry>
							</row>
						</thead>
						<tfoot>
							<p>Estimated SNR values of the tiling signal. The normalized and the wavelet denoised signal using Donoho&#8217;s and SUREShrink on which the calculation was performed are shown in Figure <figr fid="F3">3</figr>.</p>
						</tfoot>
						<tbody valign="top">
							<row rowsep="1">
								<entry align="left" colname="c1">
									<p>Signal</p>
								</entry>
								<entry align="center" colname="c2">
									<p>SNR</p>
								</entry>
							</row>
							<row rowsep="1">
								<entry align="left" colname="c1">
									<p>Best SNR in <abbrgrp>
											<abbr bid="B15">15</abbr>
										</abbrgrp>
									</p>
								</entry>
								<entry align="center" colname="c2">
									<p>4.58</p>
								</entry>
							</row>
							<row rowsep="1">
								<entry align="left" colname="c1">
									<p>Normalized signal</p>
								</entry>
								<entry align="center" colname="c2">
									<p>4.28</p>
								</entry>
							</row>
							<row rowsep="1">
								<entry align="left" colname="c1">
									<p>Wavelet denoising (Donoho&#8217;s)</p>
								</entry>
								<entry align="center" colname="c2">
									<p>5.28</p>
								</entry>
							</row>
							<row rowsep="1">
								<entry align="left" colname="c1">
									<p>Wavelet denoising (SUREShrink)</p>
								</entry>
								<entry align="center" colname="c2">
									<p>
										<b>6.17</b>
									</p>
								</entry>
							</row>
						</tbody>
					</tgroup>
				</table>
			</sec>
			<sec>
				<st>
					<p>Segmentation</p>
				</st><p>A descriptive example of the denoising and segmentation for <it>S. cerevisiae</it> is shown in Figure <figr fid="F3">3</figr>. The analysis corresponds to a 140 Kb segment of chromosome 1 from position 20000 to position 160000. The results are given for the three algorithms compared (SCM, PMSW, ZCL). The CWT computation of ZCL used as mother wavelet the second derivative of a Gaussian with 100 scales. Zero-crossing lines were calculated and only those with a length greater than a pre-defined threshold were considered to correspond to signal transitions.</p>
				<fig id="F3"><title><p>Figure 3</p></title><caption><p>Wavelet-based segmentation of <it>S. cerevisiae</it> tiling signal</p></caption><text>
   <p><b>Wavelet-based segmentation of </b><b><it>S. cerevisiae </it></b><b>tiling signal.</b> Visualization of <it>S. cerevisiae</it> tiling microarray signal along 140 kb of chromosome 1. Each dot corresponds to a probe of the forward strand (top) and reverse strand (bottom). The superimposed pulse signal represents the segmentation obtained using the different methods (PMSW, SCM, ZCL). The parameters of the different analysis are described in the Results and discussion section of the manuscript.</p>
</text><graphic file="1471-2105-13-222-3"/></fig><p>The TAR start and end positions were defined as the transition locations for which the difference between the mean intensity of neighboring segments is greater than 10% of the dynamic range of the tiling signal. Moreover, the inspection of the intensity histogram of chromosome 1 forward strand was used to set the minimum normalized transcription level value to &#8722;2. The same parameters were adopted to process the other strands of the organism. The R function <it>segmentZCL</it> (provided as Supplementary Material) implements the whole segmentation procedure.</p><p>Another representative example of segmentation results is given in Figure <figr fid="F4">4</figr>. In this case, the <it>S. aureus</it> signal of 15981 and <it>sigB</it> mutant were segmented using ZCL and fixing only the number of scales to 100. No denoising was applied prior to the segmentation. The figure represent a fragment of the signals from position 2.1 Mb to 2.3 Mb. In Additional file <supplr sid="S4">4</supplr>: Figure S1 and Additional file <supplr sid="S5">5</supplr>: Figure S2, the results for the PMSW algorithm are shown. In Additional file <supplr sid="S6">6</supplr>: Figure S3 and Additional file <supplr sid="S7">7</supplr>: Figure S4, the results for the SCM algorithm are presented.</p>
				<suppl id="S4">
					<title>
						<p>Additional file 4</p>
					</title>
					<text>
						<p><b>Figure S1.</b> Visualization of <it>S. aureus</it> tiling microarray signal along 200 kb of NCTC8325 genome. Each dot corresponds to a probe of the forward strand (top) and reverse strand (bottom) for NCTC8325 wild-type normalized signal. Superimposed pulse signal represents the segmentation obtained using PMSW method. The parameters of the analysis are described in the Results and discussion section of the manuscript.</p>
					</text>
					<file name="1471-2105-13-222-S4.pdf">
   <p>Click here for file</p>
</file>
				</suppl>
				<suppl id="S5">
					<title>
						<p>Additional file 5</p>
					</title>
					<text>
						<p><b>Figure S2.</b> Visualization of <it>S. aureus</it> tiling microarray signal along 200 kb of NCTC8325 genome. Each dot corresponds to a probe of the forward strand (top) and reverse strand (bottom) for sigmaB mutant normalized signal. Superimposed pulse signal represents the segmentation obtained using PMSW method. The parameters of the analysis are described in the Results and discussion section of the manuscript.</p>
					</text>
					<file name="1471-2105-13-222-S5.pdf">
   <p>Click here for file</p>
</file>
				</suppl>
				<suppl id="S6">
					<title>
						<p>Additional file 6</p>
					</title>
					<text>
						<p><b>Figure S3.</b> Visualization of <it>S. aureus</it> tiling microarray signal along 200 kb of NCTC8325 genome. Each dot corresponds to a probe of the forward strand (top) and reverse strand (bottom) for NCTC8325 wild-type normalized signal. Superimposed pulse signal represents the segmentation obtained using SCM method. The parameters of the analysis are described in the Results and discussion section of the manuscript.</p>
					</text>
					<file name="1471-2105-13-222-S6.pdf">
   <p>Click here for file</p>
</file>
				</suppl>
				<suppl id="S7">
					<title>
						<p>Additional file 7</p>
					</title>
					<text>
						<p><b>Figure S4.</b> Visualization of <it>S. aureus</it> tiling microarray signal along 200 kb of NCTC8325 genome. Each dot corresponds to a probe of the forward strand (top) and reverse strand (bottom) for sigmaB mutant normalized signal. Superimposed pulse signal represents the segmentation obtained using SCM method. The parameters of the analysis are described in the Results and discussion section of the manuscript.</p>
					</text>
					<file name="1471-2105-13-222-S7.pdf">
   <p>Click here for file</p>
</file>
				</suppl>
				<fig id="F4"><title><p>Figure 4</p></title><caption><p>Wavelet-based segmentation of <it>S. aureus</it> tiling signal</p></caption><text>
   <p><b>Wavelet-based segmentation of </b><b><it>S. aureus </it></b><b>tiling signal.</b> Visualization of <it>S. aureus</it> tiling microarray signal along 200 kb of NCTC8325 genome. Each dot corresponds to a probe of the forward strand (top) and reverse strand (bottom) for <it>sigB</it> mutant and 15981 wild-type normalized signals. The superimposed pulse signal represents the segmentation obtained using ZCL. The parameters of the analysis are described in the Results and discussion section of the manuscript.</p>
</text><graphic file="1471-2105-13-222-4"/></fig>
			</sec>
			<sec>
				<st>
					<p>Segmentations comparison using <it>S. cerevisiae</it> dataset</p>
				</st><p>The results from the ZCL segmentation were compared to those obtained with PSMW <abbrgrp>
						<abbr bid="B6">6</abbr>
					</abbrgrp> and SCM <abbrgrp>
						<abbr bid="B15">15</abbr>
					</abbrgrp>. The robust PMSW method is based on the calculation of a pseudo-median within a sliding window. The local expression level is computed with the Hodges-Lehmann estimator <abbrgrp>
						<abbr bid="B31">31</abbr>
					</abbrgrp> on the RNA normalized signal. To be able to do this, the <it>Tilescope</it> pipeline <abbrgrp>
						<abbr bid="B7">7</abbr>
					</abbrgrp> was implemented. Once the candidate transcript regions were determined, the TARs were assembled by the combination of a normalization intensity threshold and a max-gap and min-run criteria. The former is defined as the maximum distance below which two adjacent transcribed probes are included in the same TAR. The later as the minimum length of a feature to be classified as a transcribed region.</p><p>Huber&#8217;s method is based on the structural change model (SCM). The SCM model <abbrgrp>
						<abbr bid="B15">15</abbr>
						<abbr bid="B16">16</abbr>
					</abbrgrp> is used in econometrics for the modeling of sharp transitions in financial time series. It has been applied to the segmentation of comparative genomics hybridization (CGH) data <abbrgrp>
						<abbr bid="B32">32</abbr>
					</abbrgrp>. The signal is modeled as a piecewise constant function of chromosomal coordinates described using the segment boundaries, the maximum number of segments and the mean signal value for each segment. The method is applied independently to each chromosome and, if the signal is strand-specific, to each of its two strands. A dynamic programming algorithm part of the <it>tilingArray</it> package of Bioconductor computes a globally optimal set of parameters for segmentations of increasing number of segments.</p><p>Due to the lack of a biologically validated ground truth to evaluate the outputs, we compared the methods in terms of two metrics, sensitivity and positive predictive value (PPV) at probe-level. We define sensitivity as the number of probes in the detected TARs that overlap with annotated regions (true positives, <it>TP</it>) divided by the total number of probes in the annotated regions (sum of true positives and false negatives, <it>TP</it> + <it>FN</it>): Sensitivity=<it>TP</it>/(<it>TP</it> + <it>FN</it>). The PPV is defined as the number of probes in the detected TARs that overlap with annotated regions (<it>TP</it>) divided by the total number of probes in the detected TARs (sum of true and false positives, <it>TP</it> + <it>FP</it>): <it>PPV</it>=<it>TP</it>/(<it>TP</it> + <it>FP</it>). A sensitivity of 100% is not expected since in any given tissue or cell line at any given experimental condition, not all known genes will be expressed. Also, a PPV of 100% is not expected since an accurate and complete gene annotation is not available <abbrgrp>
						<abbr bid="B5">5</abbr>
						<abbr bid="B33">33</abbr>
					</abbrgrp>.</p><p>The PMSW and SCM methods were applied to the <it>S. cerevisiae</it> using the parameters previously reported in the literature <abbrgrp>
						<abbr bid="B6">6</abbr>
						<abbr bid="B15">15</abbr>
					</abbrgrp>. In particular, the PMSW method used a bandwidth size (BW) of 3, a normalized intensity threshold equal to &#8722;2, a separation between the probes in a TAR (maxgap) of 10 and a minimum acceptable TAR size (minrun) of 90. The maximum number of segments for the SCM method was fixed to 1500. In the case of ZCL we selected the following parameters: 100 wavelet scales, minimum TAR size of 10 and minimum value of normalized intensity transcription equal to &#8722;2. The zero-crossing line length threshold was computed based on the histogram of line lengths.</p><p>The graphical representation of the results obtained after processing the <it>S. cerevisiae</it> tiling signal are shown in Figure <figr fid="F5">5</figr>. For each chromosome we calculated the number of detected TARs, PPV, sensitivity and computation time for the forward and reverse strands. Table <tblr tid="T2">2</tblr> presents the performance metrics mean value. The segmentation with PMSW includes a larger number of detected TARs. The highest PPV values are obtained with the ZCL method (with or without denoising) at the cost of a slightly reduced sensitivity. PMSW and ZCL outperform SCM in term of computation time. The best sensitivity value corresponds to PMSW.</p>
				<fig id="F5"><title><p>Figure 5</p></title><caption><p>Results for the identification of TARs</p></caption><text>
   <p><b>Results for the identification of TARs.</b> Number of detected TARs, probe-level PPV and sensitivity, and computation time for the proposed (solid line), PMSW (dashed line) and SCM (dotted line) methods. The analysis was performed for the forward (left) and the reverse (right) strands of all chromosomes of <it>S. cerevisiae</it> tiling microarray data.</p>
</text><graphic file="1471-2105-13-222-5"/></fig>
				<table id="T2">
					<title>
						<p>Table 2</p>
					</title>
					<caption>
						<p>
							<b>Evaluation metrics for </b><b>
								<it>S. cerevisiae </it>
							</b><b>dataset</b>
						</p>
					</caption>
					<tgroup align="left" cols="5">
						<colspec align="left" colname="c1" colnum="1" colwidth="1*"/>
						<colspec align="center" colname="c2" colnum="2" colwidth="1*"/>
						<colspec align="center" colname="c3" colnum="3" colwidth="1*"/>
						<colspec align="center" colname="c4" colnum="4" colwidth="1*"/>
						<colspec align="center" colname="c5" colnum="5" colwidth="1*"/>
						<thead valign="top">
							<row rowsep="1">
								<entry align="center" colname="c1" nameend="c5" namest="c1">
									<p>
										<b>Evaluation metrics (</b><b><it>S. cerevisiae</it></b><b>)</b>
									</p>
								</entry>
							</row>
						</thead>
						<tfoot>
							<p>Mean number of detected TARs, probe-level PPV, probe-level sensitivity and computational time for PMSW, SCM and ZCL methods (all chomosome and strands of <it>S. cerevisiae</it>).</p>
						</tfoot>
						<tbody valign="top">
							<row rowsep="1">
								<entry align="left" colname="c1">
									<p>
										<b>Method</b>
									</p>
								</entry>
								<entry align="center" colname="c2">
									<p>
										<b>TARs</b>
									</p>
								</entry>
								<entry align="center" colname="c3">
									<p>
										<b>PPV</b>
									</p>
								</entry>
								<entry align="center" colname="c4">
									<p>
										<b>Sensitivity</b>
									</p>
								</entry>
								<entry align="center" colname="c5">
									<p>
										<b>Time (min)</b>
									</p>
								</entry>
							</row>
							<row rowsep="1">
								<entry align="left" colname="c1">
									<p>PMSW</p>
								</entry>
								<entry align="center" colname="c2">
									<p>22114</p>
								</entry>
								<entry align="center" colname="c3">
									<p>0.7416</p>
								</entry>
								<entry align="center" colname="c4">
									<p>
										<b>0.4700</b>
									</p>
								</entry>
								<entry align="center" colname="c5">
									<p>2.88</p>
								</entry>
							</row>
							<row rowsep="1">
								<entry align="left" colname="c1">
									<p>SCM</p>
								</entry>
								<entry align="center" colname="c2">
									<p>11246</p>
								</entry>
								<entry align="center" colname="c3">
									<p>0.7847</p>
								</entry>
								<entry align="center" colname="c4">
									<p>0.3904</p>
								</entry>
								<entry align="center" colname="c5">
									<p>79.09</p>
								</entry>
							</row>
							<row rowsep="1">
								<entry align="left" colname="c1">
									<p>ZCL</p>
								</entry>
								<entry align="center" colname="c2">
									<p>18209</p>
								</entry>
								<entry align="center" colname="c3">
									<p>0.8486</p>
								</entry>
								<entry align="center" colname="c4">
									<p>0.3760</p>
								</entry>
								<entry align="center" colname="c5">
									<p>13.02</p>
								</entry>
							</row>
							<row rowsep="1">
								<entry align="left" colname="c1">
									<p>ZCLSure</p>
								</entry>
								<entry align="center" colname="c2">
									<p>22513</p>
								</entry>
								<entry align="center" colname="c3">
									<p>
										<b>0.8547</b>
									</p>
								</entry>
								<entry align="center" colname="c4">
									<p>0.3686</p>
								</entry>
								<entry align="center" colname="c5">
									<p>10.70</p>
								</entry>
							</row>
						</tbody>
					</tgroup>
				</table><p>In-depth analysis of chromosome 1 gives interesting insights into concerning the relationship between methods. In the forward strand, the number of probes annotated as genes is 12796 representing 19.35% of the total number of probes. 65.16% of probes are correctly classified by the three algorithms (12.21% of gene probes and 52.95% of non-gene probes). From the annotated probes, 63.12% are detected by all methods, while only 11.85% of the probes are not detected by either of them. This means that 88.15% of the annotated probes are detected by at least one of the methods. The reverse strand contains 11866 annotated probes (17.90% of probes located in this strand), from which 19.74% are considered part of a TAR by all the methods and 28.35% are true negative probes. In this strand, 51.41% of the annotated probes are included in a TAR by any method while only 14.35% are never detected. In other words, 85.65% of the probes in the strand are detected by at least one algorithm. In light of this outcome, we considered it worthwhile to evaluate if the combination of results computed with the different methods would improve the performance of the segmentation.</p>
				<sec>
					<st>
						<p>Combination of TAR probes candidates</p>
					</st><p>We evaluated the improvement in performance obtained by the combination of the different segmentations. We chose different strategies to define the sets (intersection of two or three methods and majority voting system). After a decision is taken on the candidates, TARs are constructed to create the transcriptional map. In Table <tblr tid="T3">3</tblr>, we give evaluation metrics (PPV and sensitivity) for both strands of <it>S. cerevisiae</it> chromosome 1. As individual methods, ZCL gives the best PPV and the best sensitivity for the reverse strand. The best performing combination considering a compromise between PPV and sensitivity is given by the majority voting system.</p>
					<table id="T3">
						<title>
							<p>Table 3</p>
						</title>
						<caption>
							<p>
								<b>Evaluation of segmentation combinations for both strand of chromosome 1</b>
							</p>
						</caption>
						<tgroup align="left" cols="5">
							<colspec align="left" colname="c1" colnum="1" colwidth="1*"/>
							<colspec align="center" colname="c2" colnum="2" colwidth="1*"/>
							<colspec align="center" colname="c3" colnum="3" colwidth="1*"/>
							<colspec align="center" colname="c4" colnum="4" colwidth="1*"/>
							<colspec align="center" colname="c5" colnum="5" colwidth="1*"/>
							<thead valign="top">
								<row rowsep="1">
									<entry align="center" colname="c1" nameend="c5" namest="c1">
										<p>
											<b>Integrative transcriptional analysis</b>
										</p>
									</entry>
								</row>
							</thead>
							<tfoot>
								<p>PPV and sensitivity for both strand of chromosome 1 using individual TAR detection algorithms and the combination of their results.</p>
							</tfoot>
							<tbody valign="top">
								<row rowsep="1">
									<entry align="left" colname="c1">
										<p>
											<b>Method</b>
										</p>
									</entry>
									<entry align="center" colname="c2">
										<p>
											<b>PPV Forward</b>
										</p>
									</entry>
									<entry align="center" colname="c3">
										<p>
											<b>Sensitivity Forward</b>
										</p>
									</entry>
									<entry align="center" colname="c4">
										<p>
											<b>PPV Reverse</b>
										</p>
									</entry>
									<entry align="center" colname="c5">
										<p>
											<b>Sensitivity Reverse</b>
										</p>
									</entry>
								</row>
								<row rowsep="1">
									<entry align="left" colname="c1">
										<p>PMSW</p>
									</entry>
									<entry align="center" colname="c2">
										<p>0.6511</p>
									</entry>
									<entry align="center" colname="c3">
										<p>0.5873</p>
									</entry>
									<entry align="center" colname="c4">
										<p>0.5811</p>
									</entry>
									<entry align="center" colname="c5">
										<p>0.2073</p>
									</entry>
								</row>
								<row rowsep="1">
									<entry align="left" colname="c1">
										<p>SCM</p>
									</entry>
									<entry align="center" colname="c2">
										<p>0.7188</p>
									</entry>
									<entry align="center" colname="c3">
										<p>0.4390</p>
									</entry>
									<entry align="center" colname="c4">
										<p>0.6968</p>
									</entry>
									<entry align="center" colname="c5">
										<p>0.2146</p>
									</entry>
								</row>
								<row rowsep="1">
									<entry align="left" colname="c1">
										<p>ZCL</p>
									</entry>
									<entry align="center" colname="c2">
										<p>
											<b>0.8675</b>
										</p>
									</entry>
									<entry align="center" colname="c3">
										<p>0.3821</p>
									</entry>
									<entry align="center" colname="c4">
										<p>
											<b>0.8220</b>
										</p>
									</entry>
									<entry align="center" colname="c5">
										<p>
											<b>0.2208</b>
										</p>
									</entry>
								</row>
								<row rowsep="1">
									<entry align="left" colname="c1">
										<p>PMSW &#8898; SCM &#8898; ZCL</p>
									</entry>
									<entry align="center" colname="c2">
										<p>0.6312</p>
									</entry>
									<entry align="center" colname="c3">
										<p>
											<b>0.5984</b>
										</p>
									</entry>
									<entry align="center" colname="c4">
										<p>0.5441</p>
									</entry>
									<entry align="center" colname="c5">
										<p>0.2030</p>
									</entry>
								</row>
								<row rowsep="1">
									<entry align="left" colname="c1">
										<p>PMSW &#8898; ZCL</p>
									</entry>
									<entry align="center" colname="c2">
										<p>0.6448</p>
									</entry>
									<entry align="center" colname="c3">
										<p>0.5921</p>
									</entry>
									<entry align="center" colname="c4">
										<p>0.5744</p>
									</entry>
									<entry align="center" colname="c5">
										<p>0.2076</p>
									</entry>
								</row>
								<row rowsep="1">
									<entry align="left" colname="c1">
										<p>PMSW &#8898; SCM</p>
									</entry>
									<entry align="center" colname="c2">
										<p>0.6370</p>
									</entry>
									<entry align="center" colname="c3">
										<p>0.5957</p>
									</entry>
									<entry align="center" colname="c4">
										<p>0.5504</p>
									</entry>
									<entry align="center" colname="c5">
										<p>0.2043</p>
									</entry>
								</row>
								<row rowsep="1">
									<entry align="left" colname="c1">
										<p>SCM &#8898; ZCL</p>
									</entry>
									<entry align="center" colname="c2">
										<p>0.7053</p>
									</entry>
									<entry align="center" colname="c3">
										<p>0.4409</p>
									</entry>
									<entry align="center" colname="c4">
										<p>0.6626</p>
									</entry>
									<entry align="center" colname="c5">
										<p>0.2116</p>
									</entry>
								</row>
								<row rowsep="1">
									<entry align="left" colname="c1">
										<p>Majority voting</p>
									</entry>
									<entry align="center" colname="c2">
										<p>0.7247</p>
									</entry>
									<entry align="center" colname="c3">
										<p>0.4396</p>
									</entry>
									<entry align="center" colname="c4">
										<p>0.6993</p>
									</entry>
									<entry align="center" colname="c5">
										<p>0.2164</p>
									</entry>
								</row>
							</tbody>
						</tgroup>
					</table>
				</sec>
				<sec>
					<st>
						<p>Computational performance</p>
					</st><p>
						<it>S. cerevisiae</it> analyses were executed in an Intel(R) Xeon(R) processor server (64 bits, 4 cores, 2 GHz) with 32 Gb installed memory running Red Hat Enterprise Linux AS release 4 and R 2.13.0. Computing times needed to process each chromosome strand with the described methods are shown in Figure <figr fid="F5">5</figr>. For the same signal length, longer computation time is required for SCM, while comparable times are needed for PMSW and ZCL. The mean time to segment the whole transcriptome is 2.88 mins for PMSW, 13.02 mins for ZCL and 79.09 mins for SCM.</p>
				</sec>
			</sec>
			<sec>
				<st>
					<p>Differential expression analysis of <it>S. aureus sigmaB</it> mutant</p>
				</st><p>Comparative segmentation analysis using ZCL and PMSW and SCM algorithms was applied for the hybridization data obtained with a custom designed Affymetrix tiling array of <it>S. aureus</it>. Segmentation results for <it>S. aureus</it> are summarized in Table <tblr tid="T4">4</tblr>. In this case, the performance measures are almost identical for all methods. These results suggest that the performance of the methods depends on the quality of the signals, decreasing for PMSW and SCM algorithms as the SNR of the signal get worse. In spite of this, other advantages such as computation time, automatic selection of parameters and the possibility of parallel computation makes ZCL our preferred option to segment tiling signals.</p>
				<table id="T4">
					<title>
						<p>Table 4</p>
					</title>
					<caption>
						<p>
							<b>Evaluation metrics for S. aureus dataset</b>
						</p>
					</caption>
					<tgroup align="left" cols="6">
						<colspec align="left" colname="c1" colnum="1" colwidth="1*"/>
						<colspec align="center" colname="c2" colnum="2" colwidth="1*"/>
						<colspec align="center" colname="c3" colnum="3" colwidth="1*"/>
						<colspec align="center" colname="c4" colnum="4" colwidth="1*"/>
						<colspec align="center" colname="c5" colnum="5" colwidth="1*"/>
						<colspec align="center" colname="c6" colnum="6" colwidth="1*"/>
						<thead valign="top">
							<row rowsep="1">
								<entry align="center" colname="c1" nameend="c6" namest="c1">
									<p>
										<b>Evaluation metrics (S. aureus)</b>
									</p>
								</entry>
							</row>
						</thead>
						<tfoot>
							<p>Mean number of detected TARs, probe-level PPV, probe-level sensitivity and computational time for PMSW, SCM and ZCL methods.</p>
						</tfoot>
						<tbody valign="top">
							<row rowsep="1">
								<entry align="left" colname="c1">
									<p>
										<b>Tiling Signal</b>
									</p>
								</entry>
								<entry align="center" colname="c2">
									<p>
										<b>Metric</b>
									</p>
								</entry>
								<entry align="center" colname="c3">
									<p>
										<b>PMSW</b>
									</p>
								</entry>
								<entry align="center" colname="c4">
									<p>
										<b>SCM</b>
									</p>
								</entry>
								<entry align="center" colname="c5">
									<p>
										<b>ZCL </b>
									</p>
								</entry>
								<entry align="center" colname="c6">
									<p>
										<b>ZCLSure </b>
									</p>
								</entry>
							</row>
							<row rowsep="1">
								<entry align="left" colname="c1">
									<p>WT Forward</p>
								</entry>
								<entry align="center" colname="c2">
									<p>PPV</p>
								</entry>
								<entry align="center" colname="c3">
									<p>0.6298</p>
								</entry>
								<entry align="center" colname="c4">
									<p>
										<b>0.6498</b>
									</p>
								</entry>
								<entry align="center" colname="c5">
									<p>0.6248</p>
								</entry>
								<entry align="center" colname="c6">
									<p>0.6407</p>
								</entry>
							</row>
							<row rowsep="1">
								<entry align="left" colname="c1">
									<p>WT Forward</p>
								</entry>
								<entry align="center" colname="c2">
									<p>Sens</p>
								</entry>
								<entry align="center" colname="c3">
									<p>0.8657</p>
								</entry>
								<entry align="center" colname="c4">
									<p>
										<b>0.8766</b>
									</p>
								</entry>
								<entry align="center" colname="c5">
									<p>0.8715</p>
								</entry>
								<entry align="center" colname="c6">
									<p>0.8719</p>
								</entry>
							</row>
							<row rowsep="1">
								<entry align="left" colname="c1">
									<p>WT Reverse</p>
								</entry>
								<entry align="center" colname="c2">
									<p>PPV</p>
								</entry>
								<entry align="center" colname="c3">
									<p>0.6867</p>
								</entry>
								<entry align="center" colname="c4">
									<p>0.6993</p>
								</entry>
								<entry align="center" colname="c5">
									<p>
										<b>0.7050</b>
									</p>
								</entry>
								<entry align="center" colname="c6">
									<p>0.6989</p>
								</entry>
							</row>
							<row rowsep="1">
								<entry align="left" colname="c1">
									<p>WT Reverse</p>
								</entry>
								<entry align="center" colname="c2">
									<p>Sens</p>
								</entry>
								<entry align="center" colname="c3">
									<p>0.8506</p>
								</entry>
								<entry align="center" colname="c4">
									<p>
										<b>0.8560</b>
									</p>
								</entry>
								<entry align="center" colname="c5">
									<p>0.8388</p>
								</entry>
								<entry align="center" colname="c6">
									<p>0.8535</p>
								</entry>
							</row>
							<row rowsep="1">
								<entry align="left" colname="c1">
									<p>
										<it>&#916;</it>sigB Forward</p>
								</entry>
								<entry align="center" colname="c2">
									<p>PPV</p>
								</entry>
								<entry align="center" colname="c3">
									<p>0.6238</p>
								</entry>
								<entry align="center" colname="c4">
									<p>
										<b>0.6388</b>
									</p>
								</entry>
								<entry align="center" colname="c5">
									<p>0.6227</p>
								</entry>
								<entry align="center" colname="c6">
									<p>0.6308</p>
								</entry>
							</row>
							<row rowsep="1">
								<entry align="left" colname="c1">
									<p>
										<it>&#916;</it>sigB Forward</p>
								</entry>
								<entry align="center" colname="c2">
									<p>Sens</p>
								</entry>
								<entry align="center" colname="c3">
									<p>
										<b>0.9054</b>
									</p>
								</entry>
								<entry align="center" colname="c4">
									<p>0.9035</p>
								</entry>
								<entry align="center" colname="c5">
									<p>0.9027</p>
								</entry>
								<entry align="center" colname="c6">
									<p>0.9036</p>
								</entry>
							</row>
							<row rowsep="1">
								<entry align="left" colname="c1">
									<p>
										<it>&#916;</it>sigB Reverse</p>
								</entry>
								<entry align="center" colname="c2">
									<p>PPV</p>
								</entry>
								<entry align="center" colname="c3">
									<p>0.6664</p>
								</entry>
								<entry align="center" colname="c4">
									<p>
										<b>0.6815</b>
									</p>
								</entry>
								<entry align="center" colname="c5">
									<p>0.6765</p>
								</entry>
								<entry align="center" colname="c6">
									<p>0.6748</p>
								</entry>
							</row>
							<row rowsep="1">
								<entry align="left" colname="c1">
									<p>
										<it>&#916;</it>sigB Reverse</p>
								</entry>
								<entry align="center" colname="c2">
									<p>Sens</p>
								</entry>
								<entry align="center" colname="c3">
									<p>
										<b>0.8697</b>
									</p>
								</entry>
								<entry align="center" colname="c4">
									<p>0.8684</p>
								</entry>
								<entry align="center" colname="c5">
									<p>0.8667</p>
								</entry>
								<entry align="center" colname="c6">
									<p>0.8515</p>
								</entry>
							</row>
						</tbody>
					</tgroup>
				</table><p>The most frequent transcriptional analysis is the detection of genes that have changed their expression in the conditions under study (differential expression analysis). As sigma B affects the expression of more than one hundred genes, we decided to test whether it is possible to use the intensity of all the probes included in each detected TAR with the ZCL segmentation procedure to calculate the expression level of the transcript in a particular environmental condition. In order to carry out this analysis using tiling microarrays we need to compress the intensity of all the probes included in each detected TAR into one value. Standard methods for microarray normalization can be applied, for example RMA (Robust Multichip Average) algorithm in the case of Affymetrix microarrays <abbrgrp>
						<abbr bid="B34">34</abbr>
					</abbrgrp>. This processing can be performed using the packages <it>affxparser</it>, <it>affy</it> and <it>limma</it> of <it>Bioconductor</it> for CDF (chip definition file) generation, normalization and differential expression analysis.</p><p>We introduced a simple analytical tool to be used independently of the microarray platform to measure the gene expression level based on the median value of the TAR probe intensities. We calculated this value for each wild-type and sigmaB mutant sample. We applied a statistical analysis (t-test) to obtained the p-value associated with the expression change taking into account the biological variability of the samples. Considering well-defined TARs in the <it>S. aureus</it> annotation, we found previously described alterations in several genes <abbrgrp>
						<abbr bid="B21">21</abbr>
					</abbrgrp>. In Figure <figr fid="F6">6</figr>, we show the boxplots that represents these expression level changes. We confirmed the down-regulation of sigB and other <it>&#963;</it>
					<sup>
						<it>B</it>
					</sup>-regulated genes, as the alkaline shock protein 23 (<it>asp23</it>) <abbrgrp>
						<abbr bid="B22">22</abbr>
						<abbr bid="B23">23</abbr>
					</abbrgrp> and lysine-specific permease (<it>lysP</it>) <abbrgrp>
						<abbr bid="B21">21</abbr>
					</abbrgrp>, although the latter is not statistically significant (p &gt; 0.05). We also found genes up-regulated in the sigmaB mutant, as the staphylococcal nuclease (<it>nuc</it>) <abbrgrp>
						<abbr bid="B23">23</abbr>
						<abbr bid="B24">24</abbr>
					</abbrgrp>, the zinc metalloprotease aureolysin (<it>aur</it>) <abbrgrp>
						<abbr bid="B24">24</abbr>
						<abbr bid="B25">25</abbr>
					</abbrgrp> and the <it>&#945;</it>-hemolysin (<it>hla</it>) <abbrgrp>
						<abbr bid="B24">24</abbr>
						<abbr bid="B26">26</abbr>
					</abbrgrp>, the latter without a statistically significant p-value. </p>
				<fig id="F6"><title><p>Figure 6</p></title><caption><p>Differential expression analysis of <it>S. aureus sigB</it> mutant</p></caption><text>
   <p><b>Differential expression analysis of </b><b><it>S. aureus sigB </it></b><b>mutant.</b> Boxplots of median gene expression intensities. The expression of the selected genes has been previously reported to change in response to sigB repression.</p>
</text><graphic file="1471-2105-13-222-6"/></fig>
			</sec>
		</sec>
		<sec>
			<st>
				<p>Conclusions</p>
			</st><p>Transcriptomics is a powerful technology for the study of gene structures and RNA-based regulation in any organism. Genome-wide transcriptome analysis of prokaryotes can be carried out with any of these two techniques: RNA-seq and genomic tiling arrays <abbrgrp>
					<abbr bid="B35">35</abbr>
				</abbrgrp>. High-resolution tiling arrays have been used, among others, to study the transcriptomes of <it>Caulobacter crescentus</it>
				<abbrgrp>
					<abbr bid="B36">36</abbr>
				</abbrgrp>, <it>Escherichia coli</it> and <abbrgrp>
					<abbr bid="B37">37</abbr>
				</abbrgrp>, <it>Listeria monocytogenes</it>
				<abbrgrp>
					<abbr bid="B38">38</abbr>
				</abbrgrp>.</p><p>In this paper, we propose a combined WT-based method for the denoising and segmentation of tiling signals. For illustrative and evaluative purposes, we applied the proposed analysis to the public <it>S. cerevisiae</it>. Our denoising results show an increase in the SNR of the filtered signal with respect to Huber&#8217;s method <abbrgrp>
					<abbr bid="B15">15</abbr>
				</abbrgrp>. We believe it is advisable to properly denoise the tiling signal before segmentation as the number of false positives induced by signal variability is thus reduced. Even when constructing a manual segmentation, it seems an advisable choice to mark the transitions on the denoised signal, as its improved quality could help the expert to better discriminate between low expression transcripts and noise.</p><p>Our segmentation algorithm (ZCL) calculates all the possible break points based on the zero-crossing lines of the second derivative of the Gaussian wavelet. The results show that our method achieves the best compromise between accuracy (evaluated in terms of PPV and sensitivity) and computation time. The R code provided can be used to apply our algorithm as well as to combine the resulting segmentation with other methods as PMSW and SCM.</p><p>We also designed a new tiling microarray for the analysis of <it>S. aureus</it> genome, publicly available in the ArrayExpress database (accession number A-AFFY-165). This platform has been used for the comparison of the gene expression pattern of the <it>S. aureus</it> 15981 wild type and its isogenic <it>sigB</it> mutant. We selected this mutant because it is one of the most study staphylococcal regulatory factors and consequenctly it was a useful gold standard to compare the accuracy of our algorithms. The relevance of the segmentation results comes from the fact that a correct analysis of the tiling signals could improve the matching between the probes and the corresponding transcriptional units. In particular, it could help to more precisely localize the start and end transcription sites or even, include units that are not annotated in the current genome definition.</p><p>Once the TARs are properly detected, differentially expressed transcripts can be identified by well-known methods (such as Linear Models for Microarray Data (LIMMA) <abbrgrp>
					<abbr bid="B39">39</abbr>
				</abbrgrp>) with a previous probe summarization algorithm to generate the transcript annotation (using, for example, Robust Microarray Analysis (RMA) <abbrgrp>
					<abbr bid="B34">34</abbr>
				</abbrgrp>). In practice, this means that differential gene expression analysis could benefit from an enhanced analysis of tiling signals such as the one proposed here. To confirm the accuracy of the proposed method, we introduced a simple measure based on the median of TAR probe intensity. Using this approach, known up-regulated (<it>nuc</it>, <it>aur</it>, <it>hla</it>) and down-regulated (<it>asp23</it>, <it>lysP</it>) genes in sigB mutant were verified.</p><p>In conclusion, we present a novel method for denoising and segmentation of tiling microarray signals based on wavelet multiresolution analysis that outperforms previous methods in terms of SNR, positive predictive value and computation time. The R code that implements the method is given as supplementary material and can be easily adapted to a parallel computing schema. Also, we have introduced the possibility of combining the results of ZCL with those obtained with other two well-known approaches (PMSW and SCM) for the segmentation of tiling signals.</p>
		</sec>
		<sec>
			<st>
				<p>Methods</p>
			</st>
			<sec>
				<st>
					<p>WT-based analysis</p>
				</st><p>The CWT of a continuous signal <it>s</it>(<it>x</it>)is defined as <abbrgrp>
						<abbr bid="B40">40</abbr>
					</abbrgrp>
				</p><p>
					<display-formula id="M3">
						<m:math name="1471-2105-13-222-i4" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:mi>CWT</m:mi>
<m:mo>(</m:mo>
<m:mi>a</m:mi>
<m:mo>,</m:mo>
<m:mi>b</m:mi>
<m:mo>)</m:mo>
<m:mo>=</m:mo>
<m:mfrac>
   <m:mrow>
      <m:mn>1</m:mn>
   </m:mrow>
   <m:mrow>
      <m:msqrt>
         <m:mrow>
            <m:mi>a</m:mi>
         </m:mrow>
      </m:msqrt>
   </m:mrow>
</m:mfrac>
<m:munderover>
   <m:mrow>
      <m:mo>&#8747;</m:mo>
   </m:mrow>
   <m:mrow>
      <m:mo>&#8722;</m:mo>
      <m:mi>&#8734;</m:mi>
   </m:mrow>
   <m:mrow>
      <m:mo>+</m:mo>
      <m:mi>&#8734;</m:mi>
   </m:mrow>
</m:munderover>
<m:mi>s</m:mi>
<m:mo>(</m:mo>
<m:mi>x</m:mi>
<m:mo>)</m:mo>
<m:msup>
   <m:mrow>
      <m:mi>&#968;</m:mi>
   </m:mrow>
   <m:mrow>
      <m:mo>&#8727;</m:mo>
   </m:mrow>
</m:msup>
<m:mfenced separators="" open="(" close=")">
   <m:mrow>
      <m:mfrac>
         <m:mrow>
            <m:mi>b</m:mi>
            <m:mo>&#8722;</m:mo>
            <m:mi>x</m:mi>
         </m:mrow>
         <m:mrow>
            <m:mi>a</m:mi>
         </m:mrow>
      </m:mfrac>
   </m:mrow>
</m:mfenced>
<m:mi>dx</m:mi>
</m:math>
					</display-formula>
				</p><p>where <inline-formula>
						<m:math name="1471-2105-13-222-i5" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:mi>a</m:mi>
<m:mo>&#8712;</m:mo>
<m:msup>
   <m:mrow>
      <m:mi mathvariant="double-struck">R</m:mi>
   </m:mrow>
   <m:mrow>
      <m:mo>+</m:mo>
   </m:mrow>
</m:msup>
<m:mo>&#8722;</m:mo>
<m:mo>{</m:mo>
<m:mn>0</m:mn>
<m:mo>}</m:mo>
</m:math>
					</inline-formula> is the scale, <inline-formula>
						<m:math name="1471-2105-13-222-i6" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:mi>b</m:mi>
<m:mo>&#8712;</m:mo>
<m:mi mathvariant="double-struck">R</m:mi>
</m:math>
					</inline-formula> is the translation, <it>&#968;</it>(<it>x</it>) is the mother wavelet, <it>&#968;</it>
					<sup>&#8727;</sup>((<it>b</it>&#8722;<it>x</it>)/<it>a</it>) is the complex conjugated, scaled and translated wavelet and <it>CWT</it> is the 2D matrix of wavelet coefficients. The continuous input signal <it>s</it>(<it>x</it>) interpolates the discrete input samples <it>s</it>
					<it>k</it>,<it>k</it>=1,&#8230;,<it>n</it>where <it>n</it> is the length of the signal.</p><p>The CWT can be interpreted as the correlation of the input signal with a position reversed version of <it>&#968;</it> rescaled by a factor <it>a</it>. For an 1D input signal, the result is a 2D description of the signal with respect to the position <it>b</it> and scale <it>a</it> and shifted by <it>b</it>. The scale <it>a</it> is inversely proportional to the central frequency of the dilated wavelet <it>&#968;</it>
					<sub>
						<it>a</it>
					</sub>=<it>&#968;</it>(<it>x</it>/<it>a</it>), which is typically a bandpass function; <it>b</it> represents the position location at which we analyze the signal. The larger the scale <it>a</it>, the wider the analyzing function <it>&#968;</it>
					<sub>
						<it>a</it>
					</sub>, and hence the smaller the corresponding analyzed frequency. The output value is maximized when the frequency of the signal matches that of the corresponding dilated wavelet. The CWT computation for arbitrary scales can be easily adapted to a parallel implementation with a linear computational complexity <abbrgrp>
						<abbr bid="B41">41</abbr>
					</abbrgrp>.</p><p>Mallat&#8217;s fast wavelet algorithm <abbrgrp>
						<abbr bid="B42">42</abbr>
					</abbrgrp> uses the multiresolution properties of the wavelet to compute the CWT at dyadic scales <it>a</it>=2<sup>
						<it>i</it>
					</sup> and time shifts <it>b</it>=2<sup>
						<it>i</it>
					</sup>
					<it>k</it>, <inline-formula>
						<m:math name="1471-2105-13-222-i7" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:mi>k</m:mi>
<m:mo>&#8712;</m:mo>
<m:mo>,</m:mo>
<m:mi mathvariant="double-struck">Z</m:mi>
</m:math>
					</inline-formula>, resulting in what is known as DWT. For additional information about the wavelet transform and its properties the reader is referred to <abbrgrp>
						<abbr bid="B17">17</abbr>
					</abbrgrp>.</p>
			</sec>
			<sec>
				<st>
					<p>Normalization of tiling microarray data</p>
				</st><p>The analysis starts with background correction and quantile normalization as describe by the RMA algorithm <abbrgrp>
						<abbr bid="B34">34</abbr>
					</abbrgrp>. Next, we calculate the geometric mean of the RNA intensities and the geometric mean of the DNA replicates to get a signal score <it>s</it>
					<it>k</it> at position <it>k</it> proportional to the transcription level in the reference genome <abbrgrp>
						<abbr bid="B16">16</abbr>
					</abbrgrp>
				</p><p>
					<display-formula id="M4">
						<m:math name="1471-2105-13-222-i8" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:mi>s</m:mi>
<m:mo>[</m:mo>
<m:mi>k</m:mi>
<m:mo>]</m:mo>
<m:mo>=</m:mo>
<m:mfrac>
   <m:mrow>
      <m:munderover>
         <m:mrow>
            <m:mo>&#8721;</m:mo>
         </m:mrow>
         <m:mrow>
            <m:mi>j</m:mi>
            <m:mo>=</m:mo>
            <m:mn>1</m:mn>
         </m:mrow>
         <m:mrow>
            <m:mi>n</m:mi>
         </m:mrow>
      </m:munderover>
      <m:mo>log</m:mo>
      <m:msub>
         <m:mrow>
            <m:mtext>RNA</m:mtext>
         </m:mrow>
         <m:mrow>
            <m:mi>j</m:mi>
         </m:mrow>
      </m:msub>
      <m:mo>[</m:mo>
      <m:mi>k</m:mi>
      <m:mo>]</m:mo>
   </m:mrow>
   <m:mrow>
      <m:munderover>
         <m:mrow>
            <m:mo>&#8721;</m:mo>
         </m:mrow>
         <m:mrow>
            <m:mi>j</m:mi>
            <m:mo>=</m:mo>
            <m:mn>1</m:mn>
         </m:mrow>
         <m:mrow>
            <m:mi>m</m:mi>
         </m:mrow>
      </m:munderover>
      <m:mo>log</m:mo>
      <m:msub>
         <m:mrow>
            <m:mtext>DNA</m:mtext>
         </m:mrow>
         <m:mrow>
            <m:mi>j</m:mi>
         </m:mrow>
      </m:msub>
      <m:mo>[</m:mo>
      <m:mi>k</m:mi>
      <m:mo>]</m:mo>
   </m:mrow>
</m:mfrac>
<m:mo>,</m:mo>
</m:math>
					</display-formula>
				</p><p>where <it>n</it> is the number of RNA samples and <it>m</it> is the number of DNA samples.</p>
			</sec>
			<sec>
				<st>
					<p>WT-based denoising</p>
				</st><p>One of the most established methods of wavelet-based denoising was proposed by Donoho and Johnstone <abbrgrp>
						<abbr bid="B29">29</abbr>
					</abbrgrp> and it is based on the thresholding of the DWT coefficients at scale <it>a</it>=2. This method is composed of three steps: (i) calculate the DWT of the tiling signal <it>s</it>
					<it>k</it> at scale <it>a</it>=2; (ii) threshold the wavelet coefficients; (iii) compute the inverse wavelet transform of the thresholded coefficients. A universal threshold, <it>T</it>, is proposed <abbrgrp>
						<abbr bid="B29">29</abbr>
					</abbrgrp> to remove white noise which it is given by </p><p>
					<display-formula id="M5">
						<m:math name="1471-2105-13-222-i9" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:mi>T</m:mi>
<m:mo>=</m:mo>
<m:mi>&#963;</m:mi>
<m:msqrt>
   <m:mrow>
      <m:mn>2</m:mn>
      <m:mo>log</m:mo>
      <m:mo>(</m:mo>
      <m:mi>n</m:mi>
      <m:mo>)</m:mo>
   </m:mrow>
</m:msqrt>
<m:mspace width="2.77695pt"/>
<m:mspace width="2.77695pt"/>
<m:mtext>with</m:mtext>
<m:mspace width="2.77695pt"/>
<m:mspace width="1em"/>
<m:mi>&#963;</m:mi>
<m:mo>=</m:mo>
<m:mtext>MAD</m:mtext>
<m:mo>/</m:mo>
<m:mn>0</m:mn>
<m:mi>.</m:mi>
<m:mn>6745</m:mn>
<m:mo>,</m:mo>
</m:math>
					</display-formula>
				</p><p>where <it>n</it> is the length of <it>s</it>, <it>&#963;</it>is the noise level and MAD is the estimated median absolute deviation in the first scale. An important issue is the selection of a suitable wavelet function. As the signal can be roughly approximated to a zero-order polynomial, a boxcar-like function such as the Haar wavelet gives a reasonable level of correlation (i.e., a good pattern matching) with the target signal.</p><p>Another well established method of wavelet shrinkage is SUREShrink <abbrgrp>
						<abbr bid="B30">30</abbr>
					</abbrgrp>. This is based on Stein&#8217;s Unbiased Estimator for Risk (SURE). A subband adaptive threshold is applied. If the wavelet coefficients in the <it>jth</it> subband are {<it>x</it>
					<sub>
						<it>i</it>
					</sub>:<it>i</it>=1,&#8230;,<it>n</it>}, we consider a soft thresholding procedure and apply Stein&#8217;s result. The quantity </p><p>
					<display-formula id="M6">
						<m:math name="1471-2105-13-222-i10" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:mtext>SURE</m:mtext>
<m:mo>(</m:mo>
<m:mi>T</m:mi>
<m:mo>;</m:mo>
<m:mi>x</m:mi>
<m:mo>)</m:mo>
<m:mo>=</m:mo>
<m:mi>n</m:mi>
<m:mo>&#8722;</m:mo>
<m:mn>2</m:mn>
<m:mo>&#183;</m:mo>
<m:mi>&#9839;</m:mi>
<m:mo>{</m:mo>
<m:mi>i</m:mi>
<m:mo>:</m:mo>
<m:mo>|</m:mo>
<m:msub>
   <m:mrow>
      <m:mi>x</m:mi>
   </m:mrow>
   <m:mrow>
      <m:mi>i</m:mi>
   </m:mrow>
</m:msub>
<m:mo>|</m:mo>
<m:mo>&#8804;</m:mo>
<m:mi>T</m:mi>
<m:mo>}</m:mo>
<m:mo>+</m:mo>
<m:munderover>
   <m:mrow>
      <m:mo>&#8721;</m:mo>
   </m:mrow>
   <m:mrow>
      <m:mi>i</m:mi>
      <m:mo>=</m:mo>
      <m:mn>1</m:mn>
   </m:mrow>
   <m:mrow>
      <m:mi>n</m:mi>
   </m:mrow>
</m:munderover>
<m:msup>
   <m:mrow>
      <m:mo>(</m:mo>
      <m:msub>
         <m:mrow>
            <m:mi>x</m:mi>
         </m:mrow>
         <m:mrow>
            <m:mi>i</m:mi>
         </m:mrow>
      </m:msub>
      <m:mo>&#8743;</m:mo>
      <m:mi>T</m:mi>
      <m:mo>)</m:mo>
   </m:mrow>
   <m:mrow>
      <m:mn>2</m:mn>
   </m:mrow>
</m:msup>
</m:math>
					</display-formula>
				</p><p>is an unbiased estimate of risk, where <it>T</it> is the threshold and <it>x</it>
					<sub>
						<it>i</it>
					</sub>&#8743;<it>t</it>=min(<it>x</it>
					<sub>
						<it>i</it>
					</sub>
					<it>T</it>). This estimator can be used to select a threshold: </p><p>
					<display-formula id="M7">
						<m:math name="1471-2105-13-222-i11" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:msub>
   <m:mrow>
      <m:mi>T</m:mi>
   </m:mrow>
   <m:mrow>
      <m:mtext>SURE</m:mtext>
   </m:mrow>
</m:msub>
<m:mo>=</m:mo>
<m:mi>argmi</m:mi>
<m:msub>
   <m:mrow>
      <m:mi>n</m:mi>
   </m:mrow>
   <m:mrow>
      <m:mn>0</m:mn>
      <m:mo>&#8804;</m:mo>
      <m:mi>T</m:mi>
      <m:mo>&#8804;</m:mo>
      <m:msqrt>
         <m:mrow>
            <m:mn>2</m:mn>
            <m:mi>logn</m:mi>
         </m:mrow>
      </m:msqrt>
   </m:mrow>
</m:msub>
<m:mtext>SURE</m:mtext>
<m:mo>(</m:mo>
<m:mi>T</m:mi>
<m:mo>;</m:mo>
<m:mi>x</m:mi>
<m:mo>)</m:mo>
</m:math>
					</display-formula>
				</p><p>For a large dimension <it>n</it> the law of large numbers will ensure that <it>T</it>
					<sub>SURE</sub> will be almost the optimal threshold <abbrgrp>
						<abbr bid="B30">30</abbr>
					</abbrgrp>.</p>
			</sec>
			<sec>
				<st>
					<p>WT-based segmentation</p>
				</st><p>An important issue in signal processing is to define an appropriate representation able to compress most of the signal information into few representative features. Sharp variations in amplitude (i.e., transitions and peaks) are among the most meaningful features of a signal. For that reason, many segmentation algorithms rely on their detection. Previous studies have detected the peaks in mass spectrometry data using either the ridge lines <abbrgrp>
						<abbr bid="B43">43</abbr>
					</abbrgrp> or the zero-crossing lines <abbrgrp>
						<abbr bid="B44">44</abbr>
					</abbrgrp> in a multi-scale decomposition of the signal. Zero-crossing lines seems a more consistent description as they belong to connected curves, are more robust to noise and easier to detect that ridge lines <abbrgrp>
						<abbr bid="B44">44</abbr>
					</abbrgrp>.</p><p>It has been previously shown that the position of multiscale sharp transitions can be obtained from the zero-crossings of the signal convolved with the Laplacian of a Gaussian <abbrgrp>
						<abbr bid="B45">45</abbr>
					</abbrgrp>. We define a wavelet at scale <it>a</it> as </p><p>
					<display-formula id="M8">
						<m:math name="1471-2105-13-222-i12" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:mi>&#968;</m:mi>
<m:mo>(</m:mo>
<m:mi>x</m:mi>
<m:mo>)</m:mo>
<m:mo>=</m:mo>
<m:mfrac>
   <m:mrow>
      <m:msup>
         <m:mrow>
            <m:mi>d</m:mi>
         </m:mrow>
         <m:mrow>
            <m:mn>2</m:mn>
         </m:mrow>
      </m:msup>
      <m:msub>
         <m:mrow>
            <m:mi>&#952;</m:mi>
         </m:mrow>
         <m:mrow>
            <m:mi>a</m:mi>
         </m:mrow>
      </m:msub>
      <m:mo>(</m:mo>
      <m:mi>x</m:mi>
      <m:mo>)</m:mo>
   </m:mrow>
   <m:mrow>
      <m:mi>d</m:mi>
      <m:msup>
         <m:mrow>
            <m:mi>x</m:mi>
         </m:mrow>
         <m:mrow>
            <m:mn>2</m:mn>
         </m:mrow>
      </m:msup>
   </m:mrow>
</m:mfrac>
</m:math>
					</display-formula>
				</p><p>where <it>&#952;</it>
					<sub>
						<it>a</it>
					</sub> is a Gaussian function dilated by a factor <it>a</it>. Since the wavelet transform can be represented as </p><p>
					<display-formula id="M9">
						<m:math name="1471-2105-13-222-i13" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:mi>CWT</m:mi>
<m:mo>(</m:mo>
<m:mi>a</m:mi>
<m:mo>,</m:mo>
<m:mi>b</m:mi>
<m:mo>)</m:mo>
<m:mo>=</m:mo>
<m:mo>(</m:mo>
<m:mi>s</m:mi>
<m:mo>&#8727;</m:mo>
<m:msub>
   <m:mrow>
      <m:mi>&#968;</m:mi>
   </m:mrow>
   <m:mrow>
      <m:mi>a</m:mi>
   </m:mrow>
</m:msub>
<m:mo>)</m:mo>
<m:mo>(</m:mo>
<m:mi>x</m:mi>
<m:mo>)</m:mo>
<m:msub>
   <m:mrow>
      <m:mo>|</m:mo>
   </m:mrow>
   <m:mrow>
      <m:mi>x</m:mi>
      <m:mo>=</m:mo>
      <m:mi>b</m:mi>
   </m:mrow>
</m:msub>
</m:math>
					</display-formula>
				</p><p>we derive that </p><p>
					<display-formula id="M10">
						<m:math name="1471-2105-13-222-i14" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:mtable columnalign="left">
   <m:mtr>
      <m:mtd>
         <m:mi>CWT</m:mi>
         <m:mo>(</m:mo>
         <m:mi>a</m:mi>
         <m:mo>,</m:mo>
         <m:mi>b</m:mi>
         <m:mo>)</m:mo>
         <m:mo>=</m:mo>
         <m:mfenced separators="" open="[" close="]">
            <m:mrow>
               <m:mi>s</m:mi>
               <m:mo>&#8727;</m:mo>
               <m:mfenced separators="" open="(" close=")">
                  <m:mrow>
                     <m:msup>
                        <m:mrow>
                           <m:mi>a</m:mi>
                        </m:mrow>
                        <m:mrow>
                           <m:mn>2</m:mn>
                        </m:mrow>
                     </m:msup>
                     <m:mfrac>
                        <m:mrow>
                           <m:msup>
                              <m:mrow>
                                 <m:mi>d</m:mi>
                              </m:mrow>
                              <m:mrow>
                                 <m:mn>2</m:mn>
                              </m:mrow>
                           </m:msup>
                           <m:msub>
                              <m:mrow>
                                 <m:mi>&#952;</m:mi>
                              </m:mrow>
                              <m:mrow>
                                 <m:mi>a</m:mi>
                              </m:mrow>
                           </m:msub>
                        </m:mrow>
                        <m:mrow>
                           <m:mi>d</m:mi>
                           <m:msup>
                              <m:mrow>
                                 <m:mi>x</m:mi>
                              </m:mrow>
                              <m:mrow>
                                 <m:mn>2</m:mn>
                              </m:mrow>
                           </m:msup>
                        </m:mrow>
                     </m:mfrac>
                  </m:mrow>
               </m:mfenced>
            </m:mrow>
         </m:mfenced>
         <m:mo>(</m:mo>
         <m:mi>x</m:mi>
         <m:mo>)</m:mo>
         <m:msub>
            <m:mrow>
               <m:mo>|</m:mo>
            </m:mrow>
            <m:mrow>
               <m:mi>x</m:mi>
               <m:mo>=</m:mo>
               <m:mi>b</m:mi>
            </m:mrow>
         </m:msub>
      </m:mtd>
   </m:mtr>
   <m:mtr>
      <m:mtd>
         <m:mspace width="5em"/>
         <m:mo>=</m:mo>
         <m:msup>
            <m:mrow>
               <m:mi>a</m:mi>
            </m:mrow>
            <m:mrow>
               <m:mn>2</m:mn>
            </m:mrow>
         </m:msup>
         <m:mfrac>
            <m:mrow>
               <m:msup>
                  <m:mrow>
                     <m:mi>d</m:mi>
                  </m:mrow>
                  <m:mrow>
                     <m:mn>2</m:mn>
                  </m:mrow>
               </m:msup>
            </m:mrow>
            <m:mrow>
               <m:mi>d</m:mi>
               <m:msup>
                  <m:mrow>
                     <m:mi>x</m:mi>
                  </m:mrow>
                  <m:mrow>
                     <m:mn>2</m:mn>
                  </m:mrow>
               </m:msup>
            </m:mrow>
         </m:mfrac>
         <m:mo>(</m:mo>
         <m:mi>s</m:mi>
         <m:mo>&#8727;</m:mo>
         <m:msub>
            <m:mrow>
               <m:mi>&#952;</m:mi>
            </m:mrow>
            <m:mrow>
               <m:mi>a</m:mi>
            </m:mrow>
         </m:msub>
         <m:mo>)</m:mo>
         <m:mo>(</m:mo>
         <m:mi>x</m:mi>
         <m:mo>)</m:mo>
         <m:msub>
            <m:mrow>
               <m:mo>|</m:mo>
            </m:mrow>
            <m:mrow>
               <m:mi>x</m:mi>
               <m:mo>=</m:mo>
               <m:mi>b</m:mi>
            </m:mrow>
         </m:msub>
      </m:mtd>
   </m:mtr>
</m:mtable>
</m:math>
					</display-formula>
				</p><p>Hence, the wavelet transform of <it>s</it>(<it>x</it>) is proportional to the second derivative of <it>s</it>(<it>x</it>)smoothed by <it>&#952;</it>
					<sub>
						<it>a</it>
					</sub>(<it>x</it>). The zero-crossings of <it>CWT</it>(<it>a</it>
					<it>b</it>) correspond to the inflection points of <it>s</it>&#8727;<it>&#952;</it>
					<sub>
						<it>a</it>
					</sub>. The identification of transcript start and end sites is achieved by computation of the redundant CWT over a wide scale range followed by zero-crossing line detection and length thresholding. The chosen mother wavelet is the second derivative of a Gaussian. The redundancy of the CWT yields enhanced information on the position-scale localization of the features of interest (in this case, the transitions) <abbrgrp>
						<abbr bid="B46">46</abbr>
					</abbrgrp>.</p><p>An illustrative example is given in Figure <figr fid="F7">7</figr>. We generated a simulated transcriptional unit with a rectangular pulse signal of 721 samples with additive Gaussian noise (mean 0 and standard deviation 0.5) (see Figure <figr fid="F7">7</figr>(a)). The absolute values of the wavelet transform coefficients and the zero crossing lines are shown in Figure <figr fid="F7">7</figr>(b) and (c), respectively. Observe how the position of these lines corresponds to abrupt intensity transitions in the noisy signal and the longest connected curves identify the start and end points of the rectangular pulse. The R functions provided as supplementary material detect the zero-crossing lines and identify them as transcription start sites (TSS) and transcription end sites (TES) depending on the slope sign.</p>
				<fig id="F7"><title><p>Figure 7</p></title><caption><p>Zero-crossing lines of the second derivative Gaussian wavelet</p></caption><text>
   <p><b>Zero-crossing lines of the second derivative Gaussian wavelet.</b> An illustration of zero crossing lines detection. <b>(a)</b> Box signal contaminated with additive Gaussian noise (standard deviation = 0.5). <b>(b)</b> Absolute values of the CWT coefficients. The second derivative of the Gaussian was used as the mother wavelet. <b>(c)</b> All zero-crossing lines are shown. Note how the two longest lines correspond to the two sharp transitions of the box signal.</p>
</text><graphic file="1471-2105-13-222-7"/></fig>
			</sec>
			<sec>
				<st>
					<p>Identification of transcriptional active regions</p>
				</st><p>The candidates start and end sites detected as described in the previous section, are filtered to remove incorrect assignments. The purpose of this procedure is to filter those transitions that do not correspond to variations in signal intensity. For the generation of TARs we considered the signal transitions in which variation in intensity is at least 10% of the dynamic range of the analyzed signal. We also eliminate from the list of detected TAR all the start and end points that are not correctly paired off. We use the sign of the zero-crossing lines to separate start and end points and we match each start site with its corresponding end site. Finally, we define the minimum normalized intensity threshold required for the segments to be considered as transcriptional active regions. This value is calculated as the median of the signal intensity distribution, but this threshold can also be user-defined. In order to improve the definition of TARs, we cluster together consecutive segments for which the mean normalized intensity value is over the threshold.</p>
			</sec>
		</sec>
		<sec>
			<st>
				<p>Competing interests</p>
			</st><p>The authors declare that they have no competing interests.</p>
		</sec>
		<sec>
			<st>
				<p>Authors&#8217; contributions</p>
			</st><p>VS and AMB conceived the idea, developed the methods and implemented the software. ATA and IL design the NA-Staph-b520729F microarray and MU and ATA carried out the processing and hybridization of samples. ATA, IL and VS made the biological interpretation of the results. All authors participated in writing and revising the manuscript.</p>
		</sec>
	</bdy>
	<bm>
		<ack>
			<sec>
				<st>
					<p>Acknowledgements</p>
				</st><p>We thank Prof. Fernando J. Corrales and Lourdes Ortiz (Genomics Core Facility) for technical support and all the useful comments about the manuscript. This work was supported by the spanish Torres-Quevedo fellowship [PTQ-08-03-07769] to VS. ATA and AMB were supported by Spanish Ministry of Science and Innovation &#8216;Ram&#243;n y Cajal&#8217; contracts. This work was supported by the Spanish Ministry of Science and Innovation Grants BIO2008-05284-C02-01, BFU2011-23222, ERA-NET Pathogenomics PIM2010EPA-00606 and the agreement between &#8216;Fundaci&#243;n para la Investigaci&#243;n m&#233;dica aplicada&#8217; (FIMA) and the &#8217;UTE project CIMA&#8217;.</p>
			</sec>
		</ack>
		<refgrp><bibl id="B1"><title><p>Applications of DNA tiling arrays for whole-genome analysis</p></title><aug><au><snm>Mockler</snm><fnm>TC</fnm></au><au><snm>Chan</snm><fnm>S</fnm></au><au><snm>Sundaresan</snm><fnm>A</fnm></au><au><snm>Chen</snm><fnm>H</fnm></au><au><snm>Jacobsen</snm><fnm>SE</fnm></au><au><snm>Ecker</snm><fnm>JR</fnm></au></aug><source>Genomics</source><pubdate>2005</pubdate><volume>85</volume><fpage>1</fpage><lpage>15</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.ygeno.2004.10.005</pubid><pubid idtype="pmpid" link="fulltext">15607417</pubid></pubidlist></xrefbib></bibl><bibl id="B2"><title><p>RNA sequencing: advances, challenges and opportunities</p></title><aug><au><snm>Ozsolak</snm><fnm>F</fnm></au><au><snm>Milos</snm><fnm>PM</fnm></au></aug><source>Nat Rev Genet</source><pubdate>2011</pubdate><volume>12</volume><issue>2</issue><fpage>87</fpage><lpage>98</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nrg2934</pubid><pubid idtype="pmcid">3031867</pubid><pubid idtype="pmpid" link="fulltext">21191423</pubid></pubidlist></xrefbib></bibl><bibl id="B3"><title><p>Human transcriptome array for high-throughput clinical studies</p></title><aug><au><snm>Xu</snm><fnm>W</fnm></au><au><snm>Seok</snm><fnm>J</fnm></au><au><snm>Mindrinos</snm><fnm>MN</fnm></au><au><snm>Schweitzer</snm><fnm>AC</fnm></au><au><snm>Jiang</snm><fnm>H</fnm></au><au><snm>Wilhelmy</snm><fnm>J</fnm></au><au><snm>Clark</snm><fnm>TA</fnm></au><au><snm>Kapur</snm><fnm>K</fnm></au><au><snm>Xing</snm><fnm>Y</fnm></au><au><snm>Faham</snm><fnm>M</fnm></au><au><snm>Storey</snm><fnm>JD</fnm></au><au><snm>Moldawer</snm><fnm>LL</fnm></au><au><snm>Maier</snm><fnm>RV</fnm></au><au><snm>Tompkins</snm><fnm>RG</fnm></au><au><snm>Wong</snm><fnm>WH</fnm></au><au><snm>Davis</snm><fnm>RW</fnm></au><au><snm>Xiao</snm><fnm>W</fnm></au><au><cnm>the Inflammation and Host Response to Injury Large-Scale Collaborative Research Program</cnm></au></aug><source>Proc Natl Acad Sci U S A</source><pubdate>2011</pubdate><volume>108</volume><issue>9</issue><fpage>3707</fpage><lpage>3712</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1073/pnas.1019753108</pubid><pubid idtype="pmcid">3048146</pubid><pubid idtype="pmpid" link="fulltext">21317363</pubid></pubidlist></xrefbib></bibl><bibl id="B4"><title><p>RNA maps reveal new RNA classes and a possible function for pervasive transcription</p></title><aug><au><snm>Kapranov</snm><fnm>P</fnm></au><au><snm>Cheng</snm><fnm>J</fnm></au><au><snm>Dike</snm><fnm>S</fnm></au><au><snm>Nix</snm><fnm>DA</fnm></au><au><snm>Duttagupta</snm><fnm>R</fnm></au><au><snm>Willingham</snm><fnm>AT</fnm></au><au><snm>Stadler</snm><fnm>PF</fnm></au><au><snm>Hertel</snm><fnm>J</fnm></au><au><snm>Hackerm&#252;ller</snm><fnm>J</fnm></au><au><snm>Hofacker</snm><fnm>IL</fnm></au><au><snm>Bel</snm><fnm>lI</fnm></au><au><snm>Cheung</snm><fnm>E</fnm></au><au><snm>Drenkow</snm><fnm>J</fnm></au><au><snm>Dumais</snm><fnm>E</fnm></au><au><snm>Patel</snm><fnm>S</fnm></au><au><snm>Helt</snm><fnm>G</fnm></au><au><snm>Ganesh</snm><fnm>M</fnm></au><au><snm>Ghosh</snm><fnm>S</fnm></au><au><snm>Piccolboni</snm><fnm>A</fnm></au><au><snm>Sementchenko</snm><fnm>V</fnm></au><au><snm>Tammana</snm><fnm>H</fnm></au><au><snm>Gingeras</snm><fnm>TR</fnm></au></aug><source>Science</source><pubdate>2007</pubdate><volume>316</volume><issue>5830</issue><fpage>1484</fpage><lpage>1488</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1126/science.1138341</pubid><pubid idtype="pmpid" link="fulltext">17510325</pubid></pubidlist></xrefbib></bibl><bibl id="B5"><title><p>Assessing the performance of different high-density tiling microarray strategies for mapping transcribed regions of the human genome</p></title><aug><au><snm>Emanuelsson</snm><fnm>O</fnm></au><au><snm>Nagalakshmi</snm><fnm>U</fnm></au><au><snm>Zheng</snm><fnm>D</fnm></au><au><snm>Rozowsky</snm><fnm>JS</fnm></au><au><snm>Urban</snm><fnm>AE</fnm></au><au><snm>Du</snm><fnm>J</fnm></au><au><snm>Lian</snm><fnm>Z</fnm></au><au><snm>Stolc</snm><fnm>V</fnm></au><au><snm>Weissman</snm><fnm>S</fnm></au><au><snm>Snyder</snm><fnm>M</fnm></au><au><snm>Gerstein</snm><fnm>MB</fnm></au></aug><source>Genome Res</source><pubdate>2007</pubdate><volume>17</volume><issue>6</issue><fpage>886</fpage><lpage>897</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1101/gr.5014606</pubid><pubid idtype="pmcid">1891347</pubid><pubid idtype="pmpid" link="fulltext">17119069</pubid></pubidlist></xrefbib></bibl><bibl id="B6"><title><p>Novel RNAs identified from an in-depth analysis of the transcriptome of human chromosomes 21 and 22</p></title><aug><au><snm>Kampa</snm><fnm>D</fnm></au><au><snm>Cheng</snm><fnm>J</fnm></au><au><snm>Kapranov</snm><fnm>P</fnm></au><au><snm>Yamanaka</snm><fnm>M</fnm></au><au><snm>Brubaker</snm><fnm>S</fnm></au><au><snm>Cawley</snm><fnm>S</fnm></au><au><snm>Drenkow</snm><fnm>J</fnm></au><au><snm>Piccolboni</snm><fnm>A</fnm></au><au><snm>Bekiranov</snm><fnm>S</fnm></au><au><snm>Helt</snm><fnm>G</fnm></au><au><snm>Tammana</snm><fnm>H</fnm></au><au><snm>Gingeras</snm><fnm>TR</fnm></au></aug><source>Genome Res</source><pubdate>2004</pubdate><volume>14</volume><issue>3</issue><fpage>331</fpage><lpage>342</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1101/gr.2094104</pubid><pubid idtype="pmcid">353210</pubid><pubid idtype="pmpid" link="fulltext">14993201</pubid></pubidlist></xrefbib></bibl><bibl id="B7"><title><p>Tilescope: online analysis pipeline for high-density tiling microarray data</p></title><aug><au><snm>Zhang</snm><fnm>ZD</fnm></au><au><snm>Rozowsky</snm><fnm>J</fnm></au><au><snm>Lam</snm><fnm>HY</fnm></au><au><snm>Du</snm><fnm>J</fnm></au><au><snm>Snyder</snm><fnm>M</fnm></au><au><snm>Gerstein</snm><fnm>M</fnm></au></aug><source>Genome Biol</source><pubdate>2007</pubdate><volume>8</volume><issue>5</issue></bibl><bibl id="B8"><title><p>An efficient pseudomedian filter for tiling microrrays</p></title><aug><au><snm>Royce</snm><fnm>TE</fnm></au><au><snm>Carriero</snm><fnm>NJ</fnm></au><au><snm>Gerstein</snm><fnm>MB</fnm></au></aug><source>BMC Bioinformatics</source><pubdate>2007</pubdate><volume>8</volume><fpage>186</fpage><lpage>186</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/1471-2105-8-186</pubid><pubid idtype="pmcid">1913926</pubid><pubid idtype="pmpid" link="fulltext">17555595</pubid></pubidlist></xrefbib></bibl><bibl id="B9"><title><p>MSB: a mean-shift-based approach for the analysis of structural variation in the genome</p></title><aug><au><snm>Wang</snm><fnm>LY</fnm></au><au><snm>Abyzov</snm><fnm>A</fnm></au><au><snm>Korbel</snm><fnm>JO</fnm></au><au><snm>Snyder</snm><fnm>M</fnm></au><au><snm>Gerstein</snm><fnm>M</fnm></au></aug><source>Genome Res</source><pubdate>2009</pubdate><volume>19</volume><fpage>106</fpage><lpage>117</lpage><xrefbib><pubidlist><pubid idtype="pmcid">2612956</pubid><pubid idtype="pmpid" link="fulltext">19037015</pubid></pubidlist></xrefbib></bibl><bibl id="B10"><title><p>TileMap: create chromosomal map of tiling array hybridizations</p></title><aug><au><snm>Ji</snm><fnm>H</fnm></au><au><snm>Wong</snm><fnm>WH</fnm></au></aug><source>Bioinformatics</source><pubdate>2005</pubdate><volume>21</volume><issue>18</issue><fpage>3629</fpage><lpage>3636</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/bioinformatics/bti593</pubid><pubid idtype="pmpid" link="fulltext">16046496</pubid></pubidlist></xrefbib></bibl><bibl id="B11"><title><p>A supervised hidden markov model framework for efficiently segmenting tiling array data in transcriptional and chIP-chip experiments: systematically incorporating validated biological knowledge</p></title><aug><au><snm>Du</snm><fnm>J</fnm></au><au><snm>Rozowsky</snm><fnm>JS</fnm></au><au><snm>Korbel</snm><fnm>JO</fnm></au><au><snm>Zhang</snm><fnm>ZD</fnm></au><au><snm>Royce</snm><fnm>TE</fnm></au><au><snm>Schultz</snm><fnm>MH</fnm></au><au><snm>Snyder</snm><fnm>M</fnm></au><au><snm>Gerstein</snm><fnm>M</fnm></au></aug><source>Bioinformatics</source><pubdate>2006</pubdate><volume>22</volume><issue>24</issue><fpage>3016</fpage><lpage>3024</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/bioinformatics/btl515</pubid><pubid idtype="pmpid" link="fulltext">17038339</pubid></pubidlist></xrefbib></bibl><bibl id="B12"><title><p>Tiling array-driven elucidation of transcriptional structures based on maximum-likelihood and Markov models</p></title><aug><au><snm>Toyoda</snm><fnm>T</fnm></au><au><snm>Shinozaki</snm><fnm>K</fnm></au></aug><source>Plant J</source><pubdate>2005</pubdate><volume>43</volume><issue>4</issue><fpage>611</fpage><lpage>621</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1111/j.1365-313X.2005.02470.x</pubid><pubid idtype="pmpid" link="fulltext">16098113</pubid></pubidlist></xrefbib></bibl><bibl id="B13"><title><p>Transcriptional landscape estimation from tiling array data using a model of signal shift and drift</p></title><aug><au><snm>Nicolas</snm><fnm>P</fnm></au><au><snm>Leduc</snm><fnm>A</fnm></au><au><snm>Robin</snm><fnm>S</fnm></au><au><snm>Rasmussen</snm><fnm>S</fnm></au><au><snm>Jarmer</snm><fnm>H</fnm></au><au><snm>Bessi&#232;res</snm><fnm>P</fnm></au></aug><source>Bioinformatics</source><pubdate>2009</pubdate><volume>25</volume><issue>18</issue><fpage>2341</fpage><lpage>2347</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/bioinformatics/btp395</pubid><pubid idtype="pmcid">2735659</pubid><pubid idtype="pmpid" link="fulltext">19561016</pubid></pubidlist></xrefbib></bibl><bibl id="B14"><title><p>Circular binary segmentation for the analysis of array-based DNA copy number data</p></title><aug><au><snm>Olshen</snm><fnm>AB</fnm></au><au><snm>Venkatraman</snm><fnm>ES</fnm></au><au><snm>Lucito</snm><fnm>R</fnm></au><au><snm>Wigler</snm><fnm>M</fnm></au></aug><source>Biostatistics</source><pubdate>2004</pubdate><volume>5</volume><issue>4</issue><fpage>557</fpage><lpage>572</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/biostatistics/kxh008</pubid><pubid idtype="pmpid" link="fulltext">15475419</pubid></pubidlist></xrefbib></bibl><bibl id="B15"><title><p>Transcript mapping with high-density oligonucleotide tiling arrays</p></title><aug><au><snm>Huber</snm><fnm>W</fnm></au><au><snm>Toedling</snm><fnm>J</fnm></au><au><snm>Steinmetz</snm><fnm>LM</fnm></au></aug><source>Bioinformatics</source><pubdate>2006</pubdate><volume>22</volume><issue>16</issue><fpage>1963</fpage><lpage>1970</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/bioinformatics/btl289</pubid><pubid idtype="pmpid" link="fulltext">16787969</pubid></pubidlist></xrefbib></bibl><bibl id="B16"><title><p>A high-resolution map of transcription in the yeast genome</p></title><aug><au><snm>David</snm><fnm>L</fnm></au><au><snm>Huber</snm><fnm>W</fnm></au><au><snm>Granovskaia</snm><fnm>M</fnm></au><au><snm>Toedling</snm><fnm>J</fnm></au><au><snm>Palm</snm><fnm>CJ</fnm></au><au><snm>Bofkin</snm><fnm>L</fnm></au><au><snm>Jones</snm><fnm>T</fnm></au><au><snm>Davis</snm><fnm>RW</fnm></au><au><snm>Steinmetz</snm><fnm>LM</fnm></au></aug><source>Proc Natl Acad Sci U S A</source><pubdate>2006</pubdate><volume>103</volume><issue>14</issue><fpage>5320</fpage><lpage>5325</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1073/pnas.0601091103</pubid><pubid idtype="pmcid">1414796</pubid><pubid idtype="pmpid" link="fulltext">16569694</pubid></pubidlist></xrefbib></bibl><bibl id="B17"><aug><au><snm>Mallat</snm><fnm>S</fnm></au></aug><source>A Wavelet Tour of Signal Processing</source><publisher>San Diego: Academic Press</publisher><pubdate>1998</pubdate></bibl><bibl id="B18"><title><p>Tiling array data analysis: a multiscale approach using wavelets</p></title><aug><au><snm>Karpikov</snm><fnm>A</fnm></au><au><snm>Rozowsky</snm><fnm>J</fnm></au><au><snm>Gerstein</snm><fnm>M</fnm></au></aug><source>BMC Bioinformatics</source><pubdate>2011</pubdate><volume>12</volume><fpage>57</fpage><lpage>57</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/1471-2105-12-57</pubid><pubid idtype="pmcid">3055839</pubid><pubid idtype="pmpid" link="fulltext">21338513</pubid></pubidlist></xrefbib></bibl><bibl id="B19"><title><p>ChIP-chip: considerations for the design, analysis, and application of genome-wide chromatin immunoprecipitation experiments</p></title><aug><au><snm>Buck</snm><fnm>MJ</fnm></au><au><snm>Lieb</snm><fnm>JD</fnm></au></aug><source>Genomics</source><pubdate>2004</pubdate><volume>83</volume><issue>3</issue><fpage>349</fpage><lpage>360</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.ygeno.2003.11.004</pubid><pubid idtype="pmpid">14986705</pubid></pubidlist></xrefbib></bibl><bibl id="B20"><title><p>Zero-crossings of a wavelet transform</p></title><aug><au><snm>Mallat</snm><fnm>S</fnm></au></aug><source>IEEE Trans Info Th</source><pubdate>1991</pubdate><volume>37</volume><issue>4</issue><fpage>1019</fpage><lpage>1033</lpage><xrefbib><pubid idtype="doi">10.1109/18.86995</pubid></xrefbib></bibl><bibl id="B21"><title><p>Microarray-based analysis of the Staphylococcus aureus sigmaB regulon</p></title><aug><au><snm>Bischoff</snm><fnm>M</fnm></au><au><snm>Dunman</snm><fnm>P</fnm></au><au><snm>Kormanec</snm><fnm>J</fnm></au><au><snm>Macapagal</snm><fnm>D</fnm></au><au><snm>Murphy</snm><fnm>E</fnm></au><au><snm>Mounts</snm><fnm>W</fnm></au><au><snm>Berger-B&#228;chi</snm><fnm>B</fnm></au><au><snm>Projan</snm><fnm>S</fnm></au></aug><source>J Bacteriol</source><pubdate>2004</pubdate><volume>186</volume><issue>13</issue><fpage>4085</fpage><lpage>4099</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1128/JB.186.13.4085-4099.2004</pubid><pubid idtype="pmcid">421609</pubid><pubid idtype="pmpid" link="fulltext">15205410</pubid></pubidlist></xrefbib></bibl><bibl id="B22"><title><p>Regulation of sigmaB-dependent transcription of sigB and asp23 in two different Staphylococcus aureus strains</p></title><aug><au><snm>Gertz</snm><fnm>S</fnm></au><au><snm>Engelmann</snm><fnm>S</fnm></au><au><snm>Schmid</snm><fnm>R</fnm></au><au><snm>Ohlsen</snm><fnm>K</fnm></au><au><snm>Hacker</snm><fnm>J</fnm></au><au><snm>Hecker</snm><fnm>M</fnm></au></aug><source>Mol Gen Genet</source><pubdate>1999</pubdate><volume>261</volume><issue>3</issue><fpage>558</fpage><lpage>566</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1007/s004380051001</pubid><pubid idtype="pmpid">10323238</pubid></pubidlist></xrefbib></bibl><bibl id="B23"><title><p>Deletion of the alternative sigma factor sigmaB in Staphylococcus aureus reveals its function as a global regulator of virulence genes</p></title><aug><au><snm>Kullik</snm><fnm>I</fnm></au><au><snm>Giachino</snm><fnm>P</fnm></au><au><snm>Fuchs</snm><fnm>T</fnm></au></aug><source>J Bacteriol</source><pubdate>1998</pubdate><volume>180</volume><issue>18</issue><fpage>4814</fpage><lpage>4820</lpage><xrefbib><pubidlist><pubid idtype="pmcid">107504</pubid><pubid idtype="pmpid" link="fulltext">9733682</pubid></pubidlist></xrefbib></bibl><bibl id="B24"><title><p>Extracellular proteins of Staphylococcus aureus and the role of SarA and sigma B</p></title><aug><au><snm>Ziebandt</snm><fnm>AK</fnm></au><au><snm>Weber</snm><fnm>H</fnm></au><au><snm>Rudolph</snm><fnm>J</fnm></au><au><snm>Schmid</snm><fnm>R</fnm></au><au><snm>H&#246;per</snm><fnm>D</fnm></au><au><snm>Engelmann</snm><fnm>S</fnm></au><au><snm>Hecker</snm><fnm>M</fnm></au></aug><source>Proteomics</source><pubdate>2001</pubdate><volume>1</volume><issue>4</issue><fpage>480</fpage><lpage>493</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1002/1615-9861(200104)1:4&lt;480::AID-PROT480&gt;3.0.CO;2-O</pubid><pubid idtype="pmpid">11681202</pubid></pubidlist></xrefbib></bibl><bibl id="B25"><title><p>Variation in extracellular protease production among clinical isolates of Staphylococcus aureus due to different levels of expression of the protease repressor sarA</p></title><aug><au><snm>Karlsson</snm><fnm>A</fnm></au><au><snm>Arvidson</snm><fnm>S</fnm></au></aug><source>Infect Immun</source><pubdate>2002</pubdate><volume>70</volume><issue>8</issue><fpage>4239</fpage><lpage>4246</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1128/IAI.70.8.4239-4246.2002</pubid><pubid idtype="pmcid">128181</pubid><pubid idtype="pmpid" link="fulltext">12117932</pubid></pubidlist></xrefbib></bibl><bibl id="B26"><title><p>Sigma(B) activity depends on RsbU in Staphylococcus aureus</p></title><aug><au><snm>Giachino</snm><fnm>P</fnm></au><au><snm>Engelmann</snm><fnm>S</fnm></au><au><snm>Bischoff</snm><fnm>M</fnm></au></aug><source>J Bacteriol</source><pubdate>2001</pubdate><volume>183</volume><issue>6</issue><fpage>1843</fpage><lpage>1852</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1128/JB.183.6.1843-1852.2001</pubid><pubid idtype="pmcid">95078</pubid><pubid idtype="pmpid" link="fulltext">11222581</pubid></pubidlist></xrefbib></bibl><bibl id="B27"><aug><au><snm>Gentleman</snm><fnm>R</fnm></au><au><snm>Carey</snm><fnm>V</fnm></au><au><snm>Huber</snm><fnm>W</fnm></au><au><snm>Irizarry</snm><fnm>R</fnm></au><au><snm>Dudoit</snm><fnm>S</fnm></au></aug><source>Bioinformatics and Computational Biology Solutions Using R and Bioconductor (Statistics for Biology and Health)</source><publisher>Secaucus, NJ, USA: Springer-Verlag New York, Inc.</publisher><pubdate>2005</pubdate></bibl><bibl id="B28"><title><p>Antisense artifacts in transcriptome microarray experiments are resolved by actinomycin D</p></title><aug><au><snm>Perocchi</snm><fnm>F</fnm></au><au><snm>Xu</snm><fnm>Z</fnm></au><au><snm>Clauder-M&#252;nster</snm><fnm>S</fnm></au><au><snm>Steinmetz</snm><fnm>LM</fnm></au></aug><source>Nucleic Acids Res</source><pubdate>2007</pubdate><volume>35</volume><issue>19</issue><fpage>e128</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/nar/gkm683</pubid><pubid idtype="pmcid">2095812</pubid><pubid idtype="pmpid" link="fulltext">17897965</pubid></pubidlist></xrefbib></bibl><bibl id="B29"><title><p>Ideal Spatial Adaptation by Wavelet Shrinkage</p></title><aug><au><snm>Donoho</snm><fnm>D</fnm></au><au><snm>Johnstone</snm><fnm>I</fnm></au><au><snm>Johnstone</snm><fnm>IM</fnm></au></aug><source>Biometrika</source><pubdate>1993</pubdate><volume>81</volume><fpage>425</fpage><lpage>455</lpage></bibl><bibl id="B30"><title><p>Adapting to Unknown Smoothness via Wavelet Shrinkage</p></title><aug><au><snm>Donoho</snm><fnm>D</fnm></au><au><snm>Johnstone</snm><fnm>IM</fnm></au></aug><source>J Am Stat Assoc</source><pubdate>1995</pubdate><volume>90</volume><issue>432</issue><fpage>1200</fpage><lpage>1224</lpage><xrefbib><pubid idtype="doi">10.1080/01621459.1995.10476626</pubid></xrefbib></bibl><bibl id="B31"><aug><au><snm>Hollander</snm><fnm>M</fnm></au><au><snm>Wolfe</snm><fnm>DA</fnm></au></aug><source>Nonparametric Statistical Methods, 2nd Edition</source><publisher>Wiley-Interscience</publisher><pubdate>1999</pubdate></bibl><bibl id="B32"><title><p>A statistical approach for array CGH data analysis</p></title><aug><au><snm>Picard</snm><fnm>F</fnm></au><au><snm>Robin</snm><fnm>S</fnm></au><au><snm>Lavielle</snm><fnm>M</fnm></au><au><snm>Vaisse</snm><fnm>C</fnm></au><au><snm>Daudin</snm><fnm>JJ</fnm></au></aug><source>BMC Bioinformatics</source><pubdate>2005</pubdate><volume>6</volume><fpage>27</fpage><lpage>27</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/1471-2105-6-27</pubid><pubid idtype="pmcid">549559</pubid><pubid idtype="pmpid" link="fulltext">15705208</pubid></pubidlist></xrefbib></bibl><bibl id="B33"><title><p>Double error shrinkage method for identifying protein binding sites observed by tiling arrays with limited replication</p></title><aug><au><snm>Kim</snm><fnm>Y</fnm></au><au><snm>Bekiranov</snm><fnm>S</fnm></au><au><snm>Lee</snm><fnm>JK</fnm></au><au><snm>Park</snm><fnm>T</fnm></au></aug><source>Bioinformatics</source><pubdate>2009</pubdate><volume>25</volume><issue>19</issue><fpage>2486</fpage><lpage>2491</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/bioinformatics/btp471</pubid><pubid idtype="pmcid">2800349</pubid><pubid idtype="pmpid" link="fulltext">19667080</pubid></pubidlist></xrefbib></bibl><bibl id="B34"><title><p>Summaries of Affymetrix GeneChip probe level data</p></title><aug><au><snm>Irizarry</snm><fnm>RA</fnm></au><au><snm>Bolstad</snm><fnm>BM</fnm></au><au><snm>Collin</snm><fnm>F</fnm></au><au><snm>Cope</snm><fnm>LM</fnm></au><au><snm>Hobbs</snm><fnm>B</fnm></au><au><snm>Speed</snm><fnm>TP</fnm></au></aug><source>Nucleic Acids Res</source><pubdate>2003</pubdate><volume>31</volume><issue>4</issue><fpage>e15</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/nar/gng015</pubid><pubid idtype="pmcid">150247</pubid><pubid idtype="pmpid" link="fulltext">12582260</pubid></pubidlist></xrefbib></bibl><bibl id="B35"><title><p>Prokaryotic transcriptomics: a new view on regulation, physiology and pathogenicity</p></title><aug><au><snm>Sorek</snm><fnm>R</fnm></au><au><snm>Cossart</snm><fnm>P</fnm></au></aug><source>Nat Rev Genet</source><pubdate>2010</pubdate><volume>11</volume><fpage>9</fpage><lpage>16</lpage><xrefbib><pubid idtype="pmpid" link="fulltext">19935729</pubid></xrefbib></bibl><bibl id="B36"><title><p>Small non-coding RNAs in Caulobacter crescentus</p></title><aug><au><snm>Landt</snm><fnm>SG</fnm></au><au><snm>Abeliuk</snm><fnm>E</fnm></au><au><snm>McGrath</snm><fnm>PT</fnm></au><au><snm>Lesley</snm><fnm>JA</fnm></au><au><snm>McAdams</snm><fnm>HH</fnm></au><au><snm>Shapiro</snm><fnm>L</fnm></au></aug><source>Mol Microbiol</source><pubdate>2008</pubdate><volume>68</volume><issue>3</issue><fpage>600</fpage><lpage>614</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1111/j.1365-2958.2008.06172.x</pubid><pubid idtype="pmpid" link="fulltext">18373523</pubid></pubidlist></xrefbib></bibl><bibl id="B37"><title><p>The transcription unit architecture of the Escherichia coli genome</p></title><aug><au><snm>Cho</snm><fnm>BK</fnm></au><au><snm>Zengler</snm><fnm>K</fnm></au><au><snm>Qiu</snm><fnm>Y</fnm></au><au><snm>Park</snm><fnm>YS</fnm></au><au><snm>Knight</snm><fnm>EM</fnm></au><au><snm>Barrett</snm><fnm>CL</fnm></au><au><snm>Gao</snm><fnm>Y</fnm></au><au><snm>Palsson</snm><fnm>B&#216;</fnm></au></aug><source>Nat Biotechnol</source><pubdate>2009</pubdate><volume>27</volume><issue>11</issue><fpage>1043</fpage><lpage>1049</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nbt.1582</pubid><pubid idtype="pmpid" link="fulltext">19881496</pubid></pubidlist></xrefbib></bibl><bibl id="B38"><title><p>The Listeria transcriptional landscape from saprophytism to virulence</p></title><aug><au><snm>Toledo-Arana</snm><fnm>A</fnm></au><au><snm>Dussurget</snm><fnm>O</fnm></au><au><snm>Nikitas</snm><fnm>G</fnm></au><au><snm>Sesto</snm><fnm>N</fnm></au><au><snm>Guet-Revillet</snm><fnm>H</fnm></au><au><snm>Balestrino</snm><fnm>D</fnm></au><au><snm>Loh</snm><fnm>E</fnm></au><au><snm>Gripenland</snm><fnm>J</fnm></au><au><snm>Tiensuu</snm><fnm>T</fnm></au><au><snm>Vaitkevicius</snm><fnm>K</fnm></au><au><snm>Barthelemy</snm><fnm>M</fnm></au><au><snm>Vergassola</snm><fnm>M</fnm></au><au><snm>Nahori</snm><fnm>MA</fnm></au><au><snm>Soubigou</snm><fnm>G</fnm></au><au><snm>R&#233;gnault</snm><fnm>B</fnm></au><au><snm>Copp&#233;e</snm><fnm>JY</fnm></au><au><snm>Lecuit</snm><fnm>M</fnm></au><au><snm>Johansson</snm><fnm>J</fnm></au><au><snm>Cossart</snm><fnm>P</fnm></au></aug><source>Nature</source><pubdate>2009</pubdate><volume>459</volume><issue>7249</issue><fpage>950</fpage><lpage>956</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nature08080</pubid><pubid idtype="pmpid" link="fulltext">19448609</pubid></pubidlist></xrefbib></bibl><bibl id="B39"><title><p>Linear models and empirical bayes methods for assessing differential expression in microarray experiments</p></title><aug><au><snm>Smyth</snm><fnm>GK</fnm></au></aug><source>Stat Appl Genet Mol Biol</source><pubdate>2004</pubdate><volume>3</volume><fpage>Article3</fpage><xrefbib><pubid idtype="pmpid" link="fulltext">16646809</pubid></xrefbib></bibl><bibl id="B40"><aug><au><snm>Daubechies</snm><fnm>I</fnm></au></aug><source>Ten Lectures on Wavelets (CBMS-NSF Regional Conference Series in Applied Mathematics)</source><publisher>1 edition. SIAM: Society for Industrial and Applied Mathematics</publisher><pubdate>1992</pubdate></bibl><bibl id="B41"><title><p>Continuous Wavelet Transform with Arbitrary Scales and <inline-formula><m:math name="1471-2105-13-222-i15" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:mi mathvariant="bold-script">O</m:mi>
<m:mo mathvariant="bold">(</m:mo>
<m:mi mathvariant="bold">N</m:mi>
<m:mo mathvariant="bold">)</m:mo>
</m:math></inline-formula> Complexity</p></title><aug><au><snm>Munoz Barrutia</snm><fnm>A</fnm></au><au><snm>Ertle</snm><fnm>R</fnm></au><au><snm>Unser</snm><fnm>M</fnm></au></aug><source>Signal Processing</source><pubdate>2002</pubdate><volume>82</volume><issue>5</issue><fpage>749</fpage><lpage>757</lpage><xrefbib><pubid idtype="doi">10.1016/S0165-1684(02)00140-8</pubid></xrefbib></bibl><bibl id="B42"><title><p>A theory for multiresolution signal decomposition: the wavelet representation</p></title><aug><au><snm>Mallat</snm><fnm>SG</fnm></au></aug><source>Pattern Anal and Machine Intelligence, IEEE Trans on</source><pubdate>1989</pubdate><volume>11</volume><issue>7</issue><fpage>674</fpage><lpage>693</lpage><xrefbib><pubid idtype="doi">10.1109/34.192463</pubid></xrefbib></bibl><bibl id="B43"><title><p>Improved peak detection in mass spectrum by incorporating continuous wavelet transform-based pattern matching</p></title><aug><au><snm>Du</snm><fnm>P</fnm></au><au><snm>Kibbe</snm><fnm>WA</fnm></au><au><snm>Lin</snm><fnm>SM</fnm></au></aug><source>Bioinformatics</source><pubdate>2006</pubdate><volume>22</volume><issue>17</issue><fpage>2059</fpage><lpage>2065</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/bioinformatics/btl355</pubid><pubid idtype="pmpid" link="fulltext">16820428</pubid></pubidlist></xrefbib></bibl><bibl id="B44"><title><p>Mass spectrometry data processing using zero-crossing lines in multi-scale of Gaussian derivative wavelet</p></title><aug><au><snm>Nguyen</snm><fnm>N</fnm></au><au><snm>Huang</snm><fnm>H</fnm></au><au><snm>Oraintara</snm><fnm>S</fnm></au><au><snm>Vo</snm><fnm>A</fnm></au></aug><source>Bioinformatics</source><pubdate>2010</pubdate><volume>26</volume><issue>18</issue><fpage>659</fpage><lpage>665</lpage><xrefbib><pubid idtype="doi">10.1093/bioinformatics/btq397</pubid></xrefbib></bibl><bibl id="B45"><title><p>Theory of Edge Detection</p></title><aug><au><snm>Marr</snm><fnm>D</fnm></au><au><snm>Hildreth</snm><fnm>E</fnm></au></aug><source>Proc R Soc London Ser B</source><pubdate>1980</pubdate><volume>207</volume><fpage>187</fpage><lpage>217</lpage><xrefbib><pubid idtype="doi">10.1098/rspb.1980.0020</pubid></xrefbib></bibl><bibl id="B46"><title><p>Wavelets and signal processing</p></title><aug><au><snm>Rioul</snm><fnm>O</fnm></au><au><snm>Vetterli</snm><fnm>M</fnm></au></aug><source>IEEE Signal Process Mag</source><pubdate>1991</pubdate><volume>8</volume><issue>4</issue><fpage>14</fpage><lpage>38</lpage></bibl></refgrp>
	</bm>
</art>