<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
	<ui>1471-2105-8-S6-S3</ui>
	<ji>1471-2105</ji>
	<fm>
		<dochead>Review</dochead>
		<bibl>
			<title>
				<p>Computational analyses of eukaryotic promoters</p>
			</title>
			<aug>
				<au id="A1" ca="yes">
					<snm>Zhang</snm>
					<mi>Q</mi>
					<fnm>Michael</fnm>
					<insr iid="I1"/>
				</au>
			</aug>
			<insg>
				<ins id="I1">
					<p>Cold Spring Harbor Laboratory, 1 Bungtown Road, Cold Spring Harbor, NY 11724, USA</p>
				</ins>
			</insg>
			<source>BMC Bioinformatics</source>
			<supplement>
				<title>
					<p>Otto Warburg International Summer School and Workshop on Networks and Regulation</p>
				</title>
				<editor>Peter F Arndt and Martin Vingron</editor>
				<note>Reviews</note>
			</supplement>
			<issn>1471-2105</issn>
			<pubdate>2007</pubdate>
			<volume>8</volume>
			<issue>Suppl 6</issue>
			<fpage>S3</fpage>
			<url>http://www.biomedcentral.com/1471-2105/8/S6/S3</url>
			<xrefbib>
				<pubidlist><pubid idtype="pmpid">17903284</pubid><pubid idtype="doi">10.1186/1471-2105-8-S6-S3</pubid>
				</pubidlist></xrefbib>
		</bibl>
		<history>
			<pub>
				<date>
					<day>27</day>
					<month>9</month>
					<year>2007</year>
				</date>
			</pub>
		</history>
		<cpyrt>
			<year>2007</year>
			<collab>Zhang; licensee BioMed Central Ltd.</collab>
			<note>This is an open access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
		</cpyrt>
		<abs>
			<sec>
				<st>
					<p>Abstract</p>
				</st>
				<p>Computational analysis of eukaryotic promoters is one of the most difficult problems in computational genomics and is essential for understanding gene expression profiles and reverse-engineering gene regulation network circuits. Here I give a basic introduction of the problem and recent update on both experimental and computational approaches. More details may be found in the extended references. This review is based on a summer lecture given at Max Planck Institute at Berlin in 2005.</p>
			</sec>
		</abs>
	</fm>
	<bdy>
		<sec>
			<st>
				<p>Background</p>
			</st>
			<p>The promoter of a gene is defined as the <it>cis</it>-regulatory DNA region at a specific location (the transcription start site, or TSS) that can drive the transcription of its target gene in response to environmental signals. Computationally, it is often conveniently divided into three regions: the core-promoter (~80&#8211;100 bp surrounding the TSS), the proximal-promoter (~250&#8211;1000 bp upstream of the core-promoter) and the distal-promoter (further upstream, normally excluding enhancer or other regulatory regions whose influences are position/orientation independent). The core-promoter is minimally required for the assembly of the preinitiation complex (PIC) and can drive a reporter gene at a basal level from the TSS. The proximal-promoter often contains major <it>cis</it>-regulatory elements for driving activated reporter gene expression with some tissue-specificity. However, the distal-promoter together with distal enhancers/silencers and insulators are often necessary for accurately reproducing the endogenous gene expression patterns <it>in vivo</it>, especially for early developmental genes. Distal <it>cis</it>-regulatory elements also occur in the introns and the downstream regions, and therefore computational studies of these regions have been difficult and often limited to only the conserved sub-regions and/or regions in which functional <it>cis</it>-regulatory elements form clusters. Most of our work has been focused on 1 kb proximal-promoters (defined as -700 to +300 with respect to the TSS). We have shown that DNA motifs in this region can predict tissue-specific gene expression <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>. Computational promoter analyses usually face two related problems: the localization of the core-promoter (TSS prediction) and the identification of <it>cis</it>-regulatory elements (motif discovery). Basic computational methods have been reviewed previously <abbrgrp><abbr bid="B2">2</abbr></abbrgrp>, here I emphasize some recent developments.</p>
		</sec>
		<sec>
			<st>
				<p>Results</p>
			</st>
			<sec>
				<st>
					<p>New experimental developments</p>
				</st>
				<p>One recent surprise, revealed after more detailed biochemistry studies of promoter activation, is that people have underestimated the diversity and complexity of core-promoter architecture and regulation. I refer readers to the recent comprehensive review on "the general transcription machinery and general cofactors" <abbrgrp><abbr bid="B3">3</abbr></abbrgrp>.</p>
				<p>Although several core-promoter elements have been identified (Figure <figr fid="F1">1</figr>), with each element being short and degenerate and not every element occurring in a given core-promoter, the combinatorial regulatory code within core-promoters remains elusive. Their predictive value has also been very limited, despite some weak statistical correlations among certain subsets of the elements which were uncovered recently <abbrgrp><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr></abbrgrp>. Further biochemical characterization of core-promoter binding factors under various functional conditions is necessary before a reliable computational classification of core-promoters becomes possible. An example of the type of question that must be answered is how CK2 phosphorylation of TAF1 may switch TFIID binding specificity from a DCE to DPE function <abbrgrp><abbr bid="B6">6</abbr></abbrgrp> (Figure <figr fid="F1">1</figr>).</p>
				<fig id="F1">
					<title>
						<p>Figure 1</p>
					</title>
					<caption>
						<p>Regulation of core-promoter elements by TFIID and TFIIB (adapted from Fig. 2 of Thomas &amp; Chiang 2006 [3])</p>
					</caption>
					<text>
						<p>Regulation of core-promoter elements by TFIID and TFIIB (adapted from Fig. 2 of Thomas &amp; Chiang 2006 [3]).</p>
					</text>
					<graphic file="1471-2105-8-S6-S3-1"/>
				</fig>
				<p>The most significant advance comes from the new sequencing and microarray technologies that, for the first time, can provide ample and accurate 5'UTR sequence and core-promoter/TFBS location data. In particular, large-scale 5'RACE technology at Tokyo University and 5'CAGE tag technology at Riken have provided DBTSS (Database of Transcriptional Start Sites, mainly human) <abbrgrp><abbr bid="B7">7</abbr></abbrgrp> and Fantom (Functional Annotation of Mouse) <abbrgrp><abbr bid="B8">8</abbr><abbr bid="B9">9</abbr></abbrgrp> with an order of magnitude more promoter sequences derived from full-length 5'UTRs/cDNAs than were present in the traditional part of EPD (Eukaryotic Promoter Database) <abbrgrp><abbr bid="B10">10</abbr></abbrgrp>. These sequences serve as the best training data for all current computational studies in promoter recognition. Many of the surprising new statistical features of the core-promoter have come from the recent analyses of such data (see <abbrgrp><abbr bid="B11">11</abbr></abbrgrp> for a nice updated summary). One particularly interesting point made in this reference is that "Contrary to expectations, only a small fraction of RNAP II promoters appear to contain a TATA box. In contrast, a large proportion of RNAP II promoters in metazoan genomes appear to contain an INR element. Finally, about 25% of human promoters appear to lack known core promoter elements. This may point to the existence of additional core promoter sequence elements that remain to be identified and functionally characterized.". More mammalian promoter statistics are discussed in <abbrgrp><abbr bid="B12">12</abbr></abbrgrp> which presents a comprehensive study of Fantom3 data.</p>
				<p>In addition to sequence data, ChIP-chip technologies (e.g. see review <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>) provide genome-wide <it>in vivo </it>mapping of protein-DNA binding regions which provide the best experimental data for all current computational studies in <it>cis</it>-regulatory motif discovery. Most of the important data for promoter prediction has come from the ChIP-chip localization of PIC at active core-promoters in the whole genome at sub-100 bp resolution <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>. When more such data are produced for different tissues/cells and development stages, it will transform the field of computational promoter prediction and genome regulation networks (further discussed below).</p>
			</sec>
			<sec>
				<st>
					<p>Advances in motif discovery</p>
				</st>
				<p>The traditional approach for finding <it>cis</it>-elements is to collect a set of (target gene) promoter sequences believed to be enriched by some common TFBS motifs. They may either be collected from the literature or from systematic experiments (such as SELEX, <it>etc</it>.). There are many <it>de novo </it>TFBS motif finding algorithms available. For a recent review on computational TFBS finding methods, see <it>e.g</it>., <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>. For a recent benchmark of some popular motif finders, see <abbrgrp><abbr bid="B16">16</abbr></abbrgrp>. In addition to the classical alignment-based motif finding algorithms, such as CONSENSUS <abbrgrp><abbr bid="B17">17</abbr></abbrgrp>, EM <abbrgrp><abbr bid="B18">18</abbr></abbrgrp>/MEME <abbrgrp><abbr bid="B19">19</abbr></abbrgrp> and the Gibbs sampler <abbrgrp><abbr bid="B20">20</abbr></abbrgrp> which have been reviewed previously <abbrgrp><abbr bid="B21">21</abbr></abbrgrp>, most modern approaches have tried to extend either to the discovery of motif combinations (called <it>cis</it>-regulatory modules or CRMs), the use of evolutionary conservation information (with either phylogenetic footprinting or shadowing approaches), or a combination of both approaches. One can also increase specificity by incorporating structural information, for example, if the protein binds as a homodimer, one could restrict the search to only the palindromic motifs.</p>
				<p>More powerful and flexible motif finders can take the advantage of a separate sequence set called a background set, serving as a negative control. The goal is to search only for motifs that are most discriminating, <it>i.e</it>. only those enriched in the foreground set relative to the background set. Examples of such motif finders, called discriminant motif finders, include: ANN-Spec <abbrgrp><abbr bid="B22">22</abbr></abbrgrp>, DMOTIFS <abbrgrp><abbr bid="B23">23</abbr></abbrgrp>, DWE <abbrgrp><abbr bid="B24">24</abbr></abbrgrp> and DME <abbrgrp><abbr bid="B25">25</abbr></abbrgrp>. DME is particularly novel and powerful; it can enumerate all possible (discretized) weight matrices above user-defined minimum information content. A newer version (called DME-B <abbrgrp><abbr bid="B26">26</abbr></abbrgrp>) of DME can optimize the classification ability of the identified motifs based on whether or not the sequence contains at least one occurrence of the motif. This technology has been used to systematically catalog of mammalian tissue-specific TFBS motifs <abbrgrp><abbr bid="B27">27</abbr><abbr bid="B28">28</abbr></abbrgrp>.</p>
				<p>The most powerful generalization of this idea would be to turn motif finding into a feature selection problem in regression analysis by asking what is the set of features <b><it>X </it></b>(some functions of the motifs or CRMs) that can best explain the microarray data <it>Y </it>(<it>e.g</it>. expression scores). This is very similar to the general problem in genetics: <it>Y </it>represents the phenotype (mRNA expression) and <b><it>X </it></b>represents the genotype (promoter DNA elements). One would like to learn a model (function <it>f</it>) so that <it>f(</it><b><it>X</it></b><it>) </it>can best predict <it>Y</it>. When "best" is measured by the average squared error based on the distribution <it>Pr(</it><b><it>X</it></b><it>, Y)</it>, the solution is the conditional expectation (also known as the regression function, see, <it>e.g</it>. <abbrgrp><abbr bid="B29">29</abbr></abbrgrp>): <it>f(</it><b><it>X</it></b><it>) = <it>E (Y</it>| </it><b><it>X </it></b>= <b><it>x</it></b><it>)</it>. REDUCE was the first successful motif selection algorithm based on linear regression <abbrgrp><abbr bid="B30">30</abbr></abbrgrp>. It has now been generalized to include cross-interaction terms <abbrgrp><abbr bid="B31">31</abbr></abbrgrp>, to use nucleotide weight matrices discovered by MDscan (Motif Regressor <abbrgrp><abbr bid="B32">32</abbr></abbrgrp>), to apply logistic regression <abbrgrp><abbr bid="B33">33</abbr></abbrgrp> and to a nonlinear model based on regression trees called MARSMotif <abbrgrp><abbr bid="B34">34</abbr><abbr bid="B35">35</abbr></abbrgrp>. The matrix version of REDUCE (called MatrixREDUCE <abbrgrp><abbr bid="B36">36</abbr></abbrgrp>) and of MARSMotif (called MARSMotif-M <abbrgrp><abbr bid="B37">37</abbr></abbrgrp>) are becoming important motif discovery tools for mammalian promoter analyses. Almost all the tools developed for analyzing expression microarray data can also be easily applied to the analysis of localization data, such as ChIP-chip data. Although ChIP-chip is a global measurement for <it>in vivo </it>binding of proteins to chromatin DNA and hence is potentially capable of revealing direct target genes (most targets identified in expression arrays are not direct targets); due to the current resolution and to non-specific or non-functional cross-links, not all putative targets are functional or possess functional <it>cis</it>-elements. ChIP-chip data have also been used to further refine motifs found by expression data (<it>e.g</it>. using a boosting approach <abbrgrp><abbr bid="B38">38</abbr></abbrgrp>).</p>
			</sec>
			<sec>
				<st>
					<p>Better promoter prediction</p>
				</st>
				<p>A number of statistical and machine learning approaches that can discriminate between the known promoter and some non-promoter sequences have been applied to TSS prediction. In a recent large scale comparison <abbrgrp><abbr bid="B39">39</abbr></abbrgrp>, eight prediction algorithms were compared. Among the most successful algorithms were Eponine <abbrgrp><abbr bid="B40">40</abbr></abbrgrp> (which trains Relevant Vector Machines to recognize a TATA-box motif in a G+C rich domain and uses Monte Carlo sampling), McPromoter <abbrgrp><abbr bid="B41">41</abbr></abbrgrp> (based on Neural Networks, interpolated Markov models and physical properties of promoter regions), FirstEF <abbrgrp><abbr bid="B42">42</abbr></abbrgrp> (based on quadratic discriminant analysis of promoters, first exons and the first donor site) and DragonGSF <abbrgrp><abbr bid="B43">43</abbr><abbr bid="B39">39</abbr></abbrgrp> (based on artificial neural networks). However, DragonGSF is not publicly available and uses additional binding site information based on the TRANSFAC database <abbrgrp><abbr bid="B44">44</abbr></abbrgrp>, exploiting specific information that is typically not available for unknown promoters.</p>
				<p>Two new de novo promoter prediction algorithms have emerged that further improve in accuracy. One is ARTS <abbrgrp><abbr bid="B45">45</abbr></abbrgrp>, which is based on Support Vector Machines with multiple sophisticated sequence kernels. It claims to find about 35% true positives at a false positive rate of 1/1000, where the above mentioned methods find only about half as many true positives (18%). ARTS uses only downstream genic sequences as the negative set (non-promoters), and therefore it may get more false-positives from upstream non-genic regions. Furthermore, ARTS does not distinquish if a promoter is CpG-island related or not and it is not clear how ARTS may peform on non-CpG-island related promoters. Another novel TSS prediction algorithm is CoreBoost <abbrgrp><abbr bid="B46">46</abbr></abbrgrp> which is based on simple LogitBoosting with stumps. It has a false positive rate of 1/5000 at the same sensitivity level (Zhao, personal communication). CoreBoost uses both immediate upstream and downstream fragments as negative sets and trains separate classifiers for each before combining the two. The training sample is 300 bp fragments (-250, +50), hence it is more localized than ARTS which has training sample of 2 kb fragments (-1 kb, +1 kb). The ideal application of TSS prediction algorithms is to combine them with gene prediction algorithms <abbrgrp><abbr bid="B21">21</abbr></abbrgrp> and/or with the ChIP-chip PIC mapping data <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>.</p>
			</sec>
		</sec>
		<sec>
			<st>
				<p>Future direction: epigenetics and chromatin states</p>
			</st>
			<p>Although much progress has been made in promoter prediction and <it>cis</it>-regulatory motif discovery, false-positives are still the main problem when scanning through the whole genome. Fundamentally this is because the information about chromatin structure is still missing in all our models! Protein-DNA binding specificity is partly determined by the energetics and partly determined by "entropy", which depends on how much of the genome is accessible to the DNA binding protein <abbrgrp><abbr bid="B47">47</abbr></abbrgrp> Without knowing which regions of chromatin are open or closed (and to what degree), researchers have to assume the whole genome is accessible for binding, which is obviously wrong and will lead to more false positives (and false negatives because of the extra noise). This is clearly shown by recent genome-wide ChIP-chip data as well as DNase I Hypersensitivity mapping data. There is a necessity for higher order prediction algorithms that are capable of predicting chromatin states based upon, perhaps, genome-wide epigenetic measurements, CpG-islands and repeat characteristics in addition to genomic sequences. It is fortunate that such kinds of data are rapidly being generated <abbrgrp><abbr bid="B48">48</abbr><abbr bid="B49">49</abbr><abbr bid="B50">50</abbr><abbr bid="B51">51</abbr><abbr bid="B52">52</abbr><abbr bid="B53">53</abbr><abbr bid="B54">54</abbr></abbrgrp> and the corresponding analysis tools <abbrgrp><abbr bid="B55">55</abbr><abbr bid="B56">56</abbr><abbr bid="B57">57</abbr></abbrgrp> are also coming along. The days of more realistic dynamic modeling of chromatin structure and its relation to expression and regulation are finally coming.</p>
		</sec>
	</bdy>
	<bm>
		<ack>
			<sec>
				<st>
					<p>Acknowledgements</p>
				</st>
				<p>I would like to thank Dr. Dustin Schones for his careful proof-reading of the manuscript. Work in my lab is partially supported by grants from NSF, NIH and Dart NeuroGenomics Alliance.</p>
				<p>This article has been published as part of <it>BMC Bioinformatics</it> Volume 8 Supplement 6, 2007: Otto Warburg International Summer School and Workshop on Networks and Regulation. The full contents of the supplement are available online at <url>http://www.biomedcentral.com/1471-2105/8?issue=S6</url></p>
			</sec>
		</ack>
		<refgrp>
			<bibl id="B1">
				<title>
					<p>DNA motifs in human and mouse proximal promoters predict tissue-specific expression</p>
				</title>
				<aug>
					<au>
						<snm>Smith</snm>
						<fnm>AD</fnm>
					</au>
					<au>
						<snm>Sumazin</snm>
						<fnm>P</fnm>
					</au>
					<au>
						<snm>Xuan</snm>
						<fnm>Z</fnm>
					</au>
					<au>
						<snm>Zhang</snm>
						<fnm>MQ</fnm>
					</au>
				</aug>
				<source>Proc Natl Acad Sci USA</source>
				<pubdate>2006</pubdate>
				<volume>103</volume>
				<fpage>6275</fpage>
				<lpage>6280</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">1458868</pubid>
						<pubid idtype="pmpid" link="fulltext">16606849</pubid>
						<pubid idtype="doi">10.1073/pnas.0508169103</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B2">
				<title>
					<p>Computational Methods for Promoter Recognition</p>
				</title>
				<aug>
					<au>
						<snm>Zhang</snm>
						<fnm>MQ</fnm>
					</au>
				</aug>
				<publisher>MIT Press, Cambridge, Massachusetts</publisher>
				<editor>Jiang T, Xu Y, Zhang MQ</editor>
				<fpage>249</fpage>
				<lpage>268</lpage>
			</bibl>
			<bibl id="B3">
				<title>
					<p>The general transcription machinery and general cofactors</p>
				</title>
				<aug>
					<au>
						<snm>Thomas</snm>
						<fnm>MC</fnm>
					</au>
					<au>
						<snm>Chiang</snm>
						<fnm>CM</fnm>
					</au>
				</aug>
				<source>Crit Rev Biochem Mol Biol</source>
				<pubdate>2006</pubdate>
				<volume>41</volume>
				<fpage>105</fpage>
				<lpage>78</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1080/10409230600648736</pubid>
						<pubid idtype="pmpid">16858867</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B4">
				<title>
					<p>Genome-wide analysis of core promoter elements from conserved human and mouse orthologous pairs</p>
				</title>
				<aug>
					<au>
						<snm>Jin</snm>
						<fnm>VX</fnm>
					</au>
					<au>
						<snm>Singer</snm>
						<fnm>GA</fnm>
					</au>
					<au>
						<snm>Agosto-Perez</snm>
						<fnm>FJ</fnm>
					</au>
					<au>
						<snm>Liyanarachchi</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Davuluri</snm>
						<fnm>RV</fnm>
					</au>
				</aug>
				<source>BMC Bioinformatics</source>
				<pubdate>2006</pubdate>
				<volume>7</volume>
				<fpage>114</fpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">1475891</pubid>
						<pubid idtype="pmpid" link="fulltext">16522199</pubid>
						<pubid idtype="doi">10.1186/1471-2105-7-114</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B5">
				<title>
					<p>The features of Drosophila core promoters revealed by statistical analysis</p>
				</title>
				<aug>
					<au>
						<snm>Gershenzon</snm>
						<fnm>NI</fnm>
					</au>
					<au>
						<snm>Trifonov</snm>
						<fnm>EN</fnm>
					</au>
					<au>
						<snm>Ioshikhes</snm>
						<fnm>IP</fnm>
					</au>
				</aug>
				<source>BMC Genomics</source>
				<pubdate>2006</pubdate>
				<volume>7</volume>
				<fpage>161</fpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">1538597</pubid>
						<pubid idtype="pmpid" link="fulltext">16790048</pubid>
						<pubid idtype="doi">10.1186/1471-2164-7-161</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B6">
				<title>
					<p>Functional characterization of core promoter elements: DPE-specific transcription requires the protein kinase CK2 and the PC4 coactivator</p>
				</title>
				<aug>
					<au>
						<snm>Lewis</snm>
						<fnm>BA</fnm>
					</au>
					<au>
						<snm>Sims</snm>
						<fnm>RJ</fnm>
						<suf>3rd</suf>
					</au>
					<au>
						<snm>Lane</snm>
						<fnm>WS</fnm>
					</au>
					<au>
						<snm>Reinberg</snm>
						<fnm>D</fnm>
					</au>
				</aug>
				<source>Mol Cell</source>
				<pubdate>2005</pubdate>
				<volume>18</volume>
				<fpage>471</fpage>
				<lpage>481</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1016/j.molcel.2005.04.005</pubid>
						<pubid idtype="pmpid" link="fulltext">15893730</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B7">
				<title>
					<p>DBTSS, DataBase of Transcriptional Start Sites: Progress Report 2004</p>
				</title>
				<aug>
					<au>
						<snm>Suzuki</snm>
						<fnm>Y</fnm>
					</au>
					<au>
						<snm>Yamashita</snm>
						<fnm>R</fnm>
					</au>
					<au>
						<snm>Sugano</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Nakai</snm>
						<fnm>K</fnm>
					</au>
				</aug>
				<source>Nucleic Acids Res</source>
				<pubdate>2004</pubdate>
				<volume>32</volume>
				<fpage>D78</fpage>
				<lpage>81</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">308810</pubid>
						<pubid idtype="pmpid" link="fulltext">14681363</pubid>
						<pubid idtype="doi">10.1093/nar/gkh076</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B8">
				<title>
					<p>Transcript annotation in FANTOM3: mouse gene catalog based on physical cDNAs</p>
				</title>
				<aug>
					<au>
						<snm>Maeda</snm>
						<fnm>N</fnm>
					</au>
					<au>
						<snm>Kasukawa</snm>
						<fnm>T</fnm>
					</au>
					<au>
						<snm>Oyama</snm>
						<fnm>R</fnm>
					</au>
					<au>
						<snm>Gough</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Frith</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Engstrom</snm>
						<fnm>PG</fnm>
					</au>
					<au>
						<snm>Lenhard</snm>
						<fnm>B</fnm>
					</au>
					<au>
						<snm>Aturaliya</snm>
						<fnm>RN</fnm>
					</au>
					<au>
						<snm>Batalov</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Beisel</snm>
						<fnm>KW</fnm>
					</au>
					<au>
						<snm>Bult</snm>
						<fnm>CJ</fnm>
					</au>
					<au>
						<snm>Fletcher</snm>
						<fnm>CF</fnm>
					</au>
					<au>
						<snm>Forrest</snm>
						<fnm>AR</fnm>
					</au>
					<au>
						<snm>Furuno</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Hill</snm>
						<fnm>D</fnm>
					</au>
					<au>
						<snm>Itoh</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Kanamori-Katayama</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Katayama</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Katoh</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Kawashima</snm>
						<fnm>T</fnm>
					</au>
					<au>
						<snm>Quackenbush</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Ravasi</snm>
						<fnm>T</fnm>
					</au>
					<au>
						<snm>Ring</snm>
						<fnm>BZ</fnm>
					</au>
					<au>
						<snm>Shibata</snm>
						<fnm>K</fnm>
					</au>
					<au>
						<snm>Sugiura</snm>
						<fnm>K</fnm>
					</au>
					<au>
						<snm>Takenaka</snm>
						<fnm>Y</fnm>
					</au>
					<au>
						<snm>Teasdale</snm>
						<fnm>RD</fnm>
					</au>
					<au>
						<snm>Wells</snm>
						<fnm>CA</fnm>
					</au>
					<au>
						<snm>Zhu</snm>
						<fnm>Y</fnm>
					</au>
					<au>
						<snm>Kai</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Kawai</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Hume</snm>
						<fnm>DA</fnm>
					</au>
					<au>
						<snm>Carninci</snm>
						<fnm>P</fnm>
					</au>
					<au>
						<snm>Hayashizaki</snm>
						<fnm>Y</fnm>
					</au>
				</aug>
				<source>PLoS Genet</source>
				<pubdate>2006</pubdate>
				<volume>2</volume>
				<fpage>e62</fpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">1449903</pubid>
						<pubid idtype="pmpid" link="fulltext">16683036</pubid>
						<pubid idtype="doi">10.1371/journal.pgen.0020062</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B9">
				<title>
					<p>Genomewide analysis of mammalian promoter architecture and evolution</p>
				</title>
				<aug>
					<au>
						<snm>Carninci</snm>
						<fnm>P</fnm>
					</au>
					<au>
						<snm>Sandelin</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Lenhard</snm>
						<fnm>B</fnm>
					</au>
					<au>
						<snm>Katayama</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Shimokawa</snm>
						<fnm>K</fnm>
					</au>
					<au>
						<snm>Ponjavic</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Semple</snm>
						<fnm>CA</fnm>
					</au>
					<au>
						<snm>Taylor</snm>
						<fnm>MS</fnm>
					</au>
					<au>
						<snm>Engstrom</snm>
						<fnm>PG</fnm>
					</au>
					<au>
						<snm>Frith</snm>
						<fnm>MC</fnm>
					</au>
					<au>
						<snm>Forrest</snm>
						<fnm>AR</fnm>
					</au>
					<au>
						<snm>Alkema</snm>
						<fnm>WB</fnm>
					</au>
					<au>
						<snm>Tan</snm>
						<fnm>SL</fnm>
					</au>
					<au>
						<snm>Plessy</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Kodzius</snm>
						<fnm>R</fnm>
					</au>
					<au>
						<snm>Ravasi</snm>
						<fnm>T</fnm>
					</au>
					<au>
						<snm>Kasukawa</snm>
						<fnm>T</fnm>
					</au>
					<au>
						<snm>Fukuda</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Kanamori-Katayama</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Kitazume</snm>
						<fnm>Y</fnm>
					</au>
					<au>
						<snm>Kawaji</snm>
						<fnm>H</fnm>
					</au>
					<au>
						<snm>Kai</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Nakamura</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Konno</snm>
						<fnm>H</fnm>
					</au>
					<au>
						<snm>Nakano</snm>
						<fnm>K</fnm>
					</au>
					<au>
						<snm>Mottagui-Tabar</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Arner</snm>
						<fnm>P</fnm>
					</au>
					<au>
						<snm>Chesi</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Gustincich</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Persichetti</snm>
						<fnm>F</fnm>
					</au>
					<au>
						<snm>Suzuki</snm>
						<fnm>H</fnm>
					</au>
					<au>
						<snm>Grimmond</snm>
						<fnm>SM</fnm>
					</au>
					<au>
						<snm>Wells</snm>
						<fnm>CA</fnm>
					</au>
					<au>
						<snm>Orlando</snm>
						<fnm>V</fnm>
					</au>
					<au>
						<snm>Wahlestedt</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Liu</snm>
						<fnm>ET</fnm>
					</au>
					<au>
						<snm>Harbers</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Kawai</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Bajic</snm>
						<fnm>VB</fnm>
					</au>
					<au>
						<snm>Hume</snm>
						<fnm>DA</fnm>
					</au>
					<au>
						<snm>Hayashizaki</snm>
						<fnm>Y</fnm>
					</au>
				</aug>
				<source>Nat Genet</source>
				<pubdate>2006</pubdate>
				<volume>38</volume>
				<fpage>626</fpage>
				<lpage>635</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1038/ng1789</pubid>
						<pubid idtype="pmpid" link="fulltext">16645617</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B10">
				<title>
					<p>EPD in its twentieth year: towards complete promoter coverage of selected model organisms</p>
				</title>
				<aug>
					<au>
						<snm>Schmid</snm>
						<fnm>CD</fnm>
					</au>
					<au>
						<snm>Perier</snm>
						<fnm>R</fnm>
					</au>
					<au>
						<snm>Praz</snm>
						<fnm>V</fnm>
					</au>
					<au>
						<snm>Bucher</snm>
						<fnm>P</fnm>
					</au>
				</aug>
				<source>Nucleic Acids Res</source>
				<pubdate>2006</pubdate>
				<volume>34</volume>
				<fpage>D82</fpage>
				<lpage>5</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">1347508</pubid>
						<pubid idtype="pmpid" link="fulltext">16381980</pubid>
						<pubid idtype="doi">10.1093/nar/gkj146</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B11">
				<title>
					<p>Core promoter-selective RNA polymerase II transcription</p>
				</title>
				<aug>
					<au>
						<snm>Gross</snm>
						<fnm>P</fnm>
					</au>
					<au>
						<snm>Oelgeschlager</snm>
						<fnm>T</fnm>
					</au>
				</aug>
				<source>Biochem Soc Symp</source>
				<pubdate>2006</pubdate>
				<volume>73</volume>
				<fpage>225</fpage>
				<lpage>36</lpage>
				<xrefbib>
					<pubid idtype="pmpid">16626302</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B12">
				<title>
					<p>Mice and men: their promoter properties</p>
				</title>
				<aug>
					<au>
						<snm>Bajic</snm>
						<fnm>VB</fnm>
					</au>
					<au>
						<snm>Tan</snm>
						<fnm>SL</fnm>
					</au>
					<au>
						<snm>Christoffels</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Schonbach</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Lipovich</snm>
						<fnm>L</fnm>
					</au>
					<au>
						<snm>Yang</snm>
						<fnm>L</fnm>
					</au>
					<au>
						<snm>Hofmann</snm>
						<fnm>O</fnm>
					</au>
					<au>
						<snm>Kruger</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Hide</snm>
						<fnm>W</fnm>
					</au>
					<au>
						<snm>Kai</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Kawai</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Hume</snm>
						<fnm>DA</fnm>
					</au>
					<au>
						<snm>Carninci</snm>
						<fnm>P</fnm>
					</au>
					<au>
						<snm>Hayashizaki</snm>
						<fnm>Y</fnm>
					</au>
				</aug>
				<source>PLoS Genet</source>
				<pubdate>2006</pubdate>
				<volume>2</volume>
				<fpage>e54</fpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">1449896</pubid>
						<pubid idtype="pmpid" link="fulltext">16683032</pubid>
						<pubid idtype="doi">10.1371/journal.pgen.0020054</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B13">
				<title>
					<p>Genome-Wide Analysis of Protein-DNA Interactions</p>
				</title>
				<aug>
					<au>
						<snm>Kim</snm>
						<fnm>TH</fnm>
					</au>
					<au>
						<snm>Ren</snm>
						<fnm>B</fnm>
					</au>
				</aug>
				<source>Annu Rev Genomics Hum Genet</source>
				<pubdate>2006</pubdate>
				<volume>7</volume>
				<fpage>81</fpage>
				<lpage>102</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1146/annurev.genom.7.080505.115634</pubid>
						<pubid idtype="pmpid" link="fulltext">16722805</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B14">
				<title>
					<p>A high-resolution map of active promoters in the human genome</p>
				</title>
				<aug>
					<au>
						<snm>Kim</snm>
						<fnm>TH</fnm>
					</au>
					<au>
						<snm>Barrera</snm>
						<fnm>LO</fnm>
					</au>
					<au>
						<snm>Zheng</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Qu</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Singer</snm>
						<fnm>MA</fnm>
					</au>
					<au>
						<snm>Richmond</snm>
						<fnm>TA</fnm>
					</au>
					<au>
						<snm>Wu</snm>
						<fnm>Y</fnm>
					</au>
					<au>
						<snm>Green</snm>
						<fnm>RD</fnm>
					</au>
					<au>
						<snm>Ren</snm>
						<fnm>B</fnm>
					</au>
				</aug>
				<source>Nature</source>
				<pubdate>2005</pubdate>
				<volume>436</volume>
				<fpage>876</fpage>
				<lpage>80</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1038/nature03877</pubid>
						<pubid idtype="pmpid" link="fulltext">15988478</pubid>
						<pubid idtype="pmcid">1895599</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B15">
				<title>
					<p>Applied bioinformatics for the identification of regulatory elements</p>
				</title>
				<aug>
					<au>
						<snm>Wasserman</snm>
						<fnm>WW</fnm>
					</au>
					<au>
						<snm>Sandelin</snm>
						<fnm>A</fnm>
					</au>
				</aug>
				<source>Nat Rev Genet</source>
				<pubdate>2004</pubdate>
				<volume>5</volume>
				<fpage>276</fpage>
				<lpage>87</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1038/nrg1315</pubid>
						<pubid idtype="pmpid" link="fulltext">15131651</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B16">
				<title>
					<p>Assessing computational tools for the discovery of transcription factor binding sites</p>
				</title>
				<aug>
					<au>
						<snm>Tompa</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Li</snm>
						<fnm>N</fnm>
					</au>
					<au>
						<snm>Bailey</snm>
						<fnm>TL</fnm>
					</au>
					<au>
						<snm>Church</snm>
						<fnm>GM</fnm>
					</au>
					<au>
						<snm>De Moor</snm>
						<fnm>B</fnm>
					</au>
					<au>
						<snm>Eskin</snm>
						<fnm>E</fnm>
					</au>
					<au>
						<snm>Favorov</snm>
						<fnm>AV</fnm>
					</au>
					<au>
						<snm>Frith</snm>
						<fnm>MC</fnm>
					</au>
					<au>
						<snm>Fu</snm>
						<fnm>Y</fnm>
					</au>
					<au>
						<snm>Kent</snm>
						<fnm>WJ</fnm>
					</au>
					<au>
						<snm>Makeev</snm>
						<fnm>VJ</fnm>
					</au>
					<au>
						<snm>Mironov</snm>
						<fnm>AA</fnm>
					</au>
					<au>
						<snm>Noble</snm>
						<fnm>WS</fnm>
					</au>
					<au>
						<snm>Pavesi</snm>
						<fnm>G</fnm>
					</au>
					<au>
						<snm>Pesole</snm>
						<fnm>G</fnm>
					</au>
					<au>
						<snm>Regnier</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Simonis</snm>
						<fnm>N</fnm>
					</au>
					<au>
						<snm>Sinha</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Thijs</snm>
						<fnm>G</fnm>
					</au>
					<au>
						<snm>van Helden</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Vandenbogaert</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Weng</snm>
						<fnm>Z</fnm>
					</au>
					<au>
						<snm>Workman</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Ye</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Zhu</snm>
						<fnm>Z</fnm>
					</au>
				</aug>
				<source>Nat Biotechnol</source>
				<pubdate>2005</pubdate>
				<volume>23</volume>
				<fpage>137</fpage>
				<lpage>44</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1038/nbt1053</pubid>
						<pubid idtype="pmpid" link="fulltext">15637633</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B17">
				<title>
					<p>Identification of consensus patterns in unaligned DNA sequences known to be functionally related</p>
				</title>
				<aug>
					<au>
						<snm>Hertz</snm>
						<fnm>GZ</fnm>
					</au>
					<au>
						<snm>Hartzell</snm>
						<fnm>GW</fnm>
						<suf>3rd</suf>
					</au>
					<au>
						<snm>Stormo</snm>
						<fnm>GD</fnm>
					</au>
				</aug>
				<source>Comput Appl Biosci</source>
				<pubdate>1990</pubdate>
				<volume>6</volume>
				<fpage>81</fpage>
				<lpage>92</lpage>
				<xrefbib>
					<pubid idtype="pmpid">2193692</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B18">
				<title>
					<p>An expectation maximization (EM) algorithm for the identification and characterization of common sites in unaligned biopolymer sequences</p>
				</title>
				<aug>
					<au>
						<snm>Lawrence</snm>
						<fnm>CE</fnm>
					</au>
					<au>
						<snm>Reilly</snm>
						<fnm>AA</fnm>
					</au>
				</aug>
				<source>Proteins</source>
				<pubdate>1990</pubdate>
				<volume>7</volume>
				<fpage>41</fpage>
				<lpage>51</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1002/prot.340070105</pubid>
						<pubid idtype="pmpid">2184437</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B19">
				<title>
					<p>The value of prior knowledge in discovering motifs with MEME</p>
				</title>
				<aug>
					<au>
						<snm>Bailey</snm>
						<fnm>TL</fnm>
					</au>
					<au>
						<snm>Elkan</snm>
						<fnm>C</fnm>
					</au>
				</aug>
				<source>Proceedings of the International Conference on Intelligent Systems for Molecular Biology</source>
				<pubdate>1995</pubdate>
				<volume>3</volume>
				<fpage>21</fpage>
				<lpage>9</lpage>
			</bibl>
			<bibl id="B20">
				<title>
					<p>Detecting subtle sequence signals: A Gibbs sampling strategy for multiple alignment</p>
				</title>
				<aug>
					<au>
						<snm>Lawrence</snm>
						<fnm>CE</fnm>
					</au>
					<au>
						<snm>Altschul</snm>
						<fnm>SF</fnm>
					</au>
					<au>
						<snm>Boguski</snm>
						<fnm>MS</fnm>
					</au>
					<au>
						<snm>Liu</snm>
						<fnm>JS</fnm>
					</au>
					<au>
						<snm>Neuwald</snm>
						<fnm>AF</fnm>
					</au>
					<au>
						<snm>Wootton</snm>
						<fnm>JC</fnm>
					</au>
				</aug>
				<source>Science</source>
				<pubdate>1993</pubdate>
				<volume>262</volume>
				<fpage>208</fpage>
				<lpage>14</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1126/science.8211139</pubid>
						<pubid idtype="pmpid" link="fulltext">8211139</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B21">
				<title>
					<p>Computational Prediction of Eukaryotic Protein-Coding Genes</p>
				</title>
				<aug>
					<au>
						<snm>Zhang</snm>
						<fnm>MQ</fnm>
					</au>
				</aug>
				<source>Nat Rev Genet</source>
				<pubdate>2002</pubdate>
				<volume>3</volume>
				<issue>9</issue>
				<fpage>698</fpage>
				<lpage>709</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1038/nrg890</pubid>
						<pubid idtype="pmpid" link="fulltext">12209144</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B22">
				<title>
					<p>ANN-Spec: A method for discovering transcription factor binding sites with improved specificity</p>
				</title>
				<aug>
					<au>
						<snm>Workman</snm>
						<fnm>CT</fnm>
					</au>
					<au>
						<snm>Stormo</snm>
						<fnm>GD</fnm>
					</au>
				</aug>
				<source>Pacific Symposium on Biocomputing</source>
				<pubdate>2002</pubdate>
				<fpage>467</fpage>
				<lpage>78</lpage>
			</bibl>
			<bibl id="B23">
				<title>
					<p>Discriminative motifs</p>
				</title>
				<aug>
					<au>
						<snm>Sinha</snm>
						<fnm>S</fnm>
					</au>
				</aug>
				<source>J Comput Biol</source>
				<pubdate>2003</pubdate>
				<volume>10</volume>
				<fpage>599</fpage>
				<lpage>615</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1089/10665270360688219</pubid>
						<pubid idtype="pmpid" link="fulltext">12935347</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B24">
				<title>
					<p>DWE: Discriminating word enumerator</p>
				</title>
				<aug>
					<au>
						<snm>Sumazin</snm>
						<fnm>P</fnm>
					</au>
					<au>
						<snm>Chen</snm>
						<fnm>G</fnm>
					</au>
					<au>
						<snm>Hata</snm>
						<fnm>N</fnm>
					</au>
					<au>
						<snm>Smith</snm>
						<fnm>AD</fnm>
					</au>
					<au>
						<snm>Zhang</snm>
						<fnm>T</fnm>
					</au>
					<au>
						<snm>Zhang</snm>
						<fnm>MQ</fnm>
					</au>
				</aug>
				<source>Bioinformatics</source>
				<pubdate>2005</pubdate>
				<volume>21</volume>
				<fpage>31</fpage>
				<lpage>8</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1093/bioinformatics/bth471</pubid>
						<pubid idtype="pmpid" link="fulltext">15333453</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B25">
				<title>
					<p>Identifying tissue-selective transcription factor binding sites in vertebrate promoters</p>
				</title>
				<aug>
					<au>
						<snm>Smith</snm>
						<fnm>AD</fnm>
					</au>
					<au>
						<snm>Sumazin</snm>
						<fnm>P</fnm>
					</au>
					<au>
						<snm>Zhang</snm>
						<fnm>MQ</fnm>
					</au>
				</aug>
				<source>Proc Natl Acad Sci USA</source>
				<pubdate>2005</pubdate>
				<volume>102</volume>
				<fpage>1560</fpage>
				<lpage>5</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">547828</pubid>
						<pubid idtype="pmpid" link="fulltext">15668401</pubid>
						<pubid idtype="doi">10.1073/pnas.0406123102</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B26">
				<title>
					<p>Mining ChIP-chip data for transcription factor and cofactor binding sites</p>
				</title>
				<aug>
					<au>
						<snm>Smith</snm>
						<fnm>AD</fnm>
					</au>
					<au>
						<snm>Sumazin</snm>
						<fnm>P</fnm>
					</au>
					<au>
						<snm>Das</snm>
						<fnm>D</fnm>
					</au>
					<au>
						<snm>Zhang</snm>
						<fnm>MQ</fnm>
					</au>
				</aug>
				<source>Bioinformatics</source>
				<pubdate>2005</pubdate>
				<volume>21</volume>
				<issue>Suppl 1</issue>
				<fpage>i403</fpage>
				<lpage>12</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1093/bioinformatics/bti1043</pubid>
						<pubid idtype="pmpid" link="fulltext">15961485</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B27">
				<title>
					<p>Computational prediction of novel components of lung transcriptional networks</p>
				</title>
				<aug>
					<au>
						<snm>Martinez</snm>
						<fnm>MJ</fnm>
					</au>
					<au>
						<snm>Smith</snm>
						<fnm>AD</fnm>
					</au>
					<au>
						<snm>Li</snm>
						<fnm>B</fnm>
					</au>
					<au>
						<snm>Zhang</snm>
						<fnm>MQ</fnm>
					</au>
					<au>
						<snm>Harrod</snm>
						<fnm>KS</fnm>
					</au>
				</aug>
				<source>Bioinformatics</source>
				<pubdate>2007</pubdate>
				<volume>23</volume>
				<fpage>21</fpage>
				<lpage>29</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1093/bioinformatics/btl531</pubid>
						<pubid idtype="pmpid" link="fulltext">17050569</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B28">
				<title>
					<p>Tissue-specific regulatory elements in mammalian promoters</p>
				</title>
				<aug>
					<au>
						<snm>Smith</snm>
						<fnm>AD</fnm>
					</au>
					<au>
						<snm>Sumazin</snm>
						<fnm>P</fnm>
					</au>
					<au>
						<snm>Zhang</snm>
						<fnm>MQ</fnm>
					</au>
				</aug>
				<source>Mol Syst Biol</source>
				<pubdate>2007</pubdate>
				<volume>3</volume>
				<fpage>73</fpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">1800356</pubid>
						<pubid idtype="pmpid" link="fulltext">17224917</pubid>
						<pubid idtype="doi">10.1038/msb4100114</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B29">
				<aug>
					<au>
						<snm>Hastie</snm>
						<fnm>T</fnm>
					</au>
					<au>
						<snm>Tibshirani</snm>
						<fnm>R</fnm>
					</au>
					<au>
						<snm>Friedman</snm>
						<fnm>J</fnm>
					</au>
				</aug>
				<source>The Elements of Statistical Learning: Data Mining, Inference, and Prediction</source>
				<publisher>Springer, New York</publisher>
				<pubdate>2001</pubdate>
			</bibl>
			<bibl id="B30">
				<title>
					<p>Regulatory element detection using correlation with expression</p>
				</title>
				<aug>
					<au>
						<snm>Bussemaker</snm>
						<fnm>HJ</fnm>
					</au>
					<au>
						<snm>Li</snm>
						<fnm>H</fnm>
					</au>
					<au>
						<snm>Siggia</snm>
						<fnm>ED</fnm>
					</au>
				</aug>
				<source>Nat Genet</source>
				<pubdate>2001</pubdate>
				<volume>27</volume>
				<fpage>167</fpage>
				<lpage>71</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1038/84792</pubid>
						<pubid idtype="pmpid" link="fulltext">11175784</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B31">
				<title>
					<p>Identification of regulatory elements using a feature selection method</p>
				</title>
				<aug>
					<au>
						<snm>Keles</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>van der Laan</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Eisen</snm>
						<fnm>MB</fnm>
					</au>
				</aug>
				<source>Bioinformatics</source>
				<pubdate>2002</pubdate>
				<volume>18</volume>
				<fpage>1167</fpage>
				<lpage>75</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1093/bioinformatics/18.9.1167</pubid>
						<pubid idtype="pmpid" link="fulltext">12217908</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B32">
				<title>
					<p>Integrating regulatory motif discovery and genome-wide expression analysis</p>
				</title>
				<aug>
					<au>
						<snm>Conlon</snm>
						<fnm>EM</fnm>
					</au>
					<au>
						<snm>XS</snm>
						<fnm>Liu</fnm>
					</au>
					<au>
						<snm>JD</snm>
						<fnm>Lieb</fnm>
					</au>
					<au>
						<snm>JS</snm>
						<fnm>Liu</fnm>
					</au>
				</aug>
				<source>Proc Natl Acad Sci USA</source>
				<pubdate>2003</pubdate>
				<volume>100</volume>
				<fpage>3339</fpage>
				<lpage>44</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">152294</pubid>
						<pubid idtype="pmpid" link="fulltext">12626739</pubid>
						<pubid idtype="doi">10.1073/pnas.0630591100</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B33">
				<title>
					<p>Regulatory motif finding by logic regression</p>
				</title>
				<aug>
					<au>
						<snm>Keles</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>van der Laan</snm>
						<fnm>MJ</fnm>
					</au>
					<au>
						<snm>Vulpe</snm>
						<fnm>C</fnm>
					</au>
				</aug>
				<source>Bioinformatics</source>
				<pubdate>2004</pubdate>
				<volume>20</volume>
				<fpage>2799</fpage>
				<lpage>811</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1093/bioinformatics/bth333</pubid>
						<pubid idtype="pmpid" link="fulltext">15166027</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B34">
				<title>
					<p>Multivariate adaptive regression splines</p>
				</title>
				<aug>
					<au>
						<snm>Friedman</snm>
						<fnm>J</fnm>
					</au>
				</aug>
				<source>Ann Stat</source>
				<pubdate>1991</pubdate>
				<volume>19</volume>
				<fpage>1</fpage>
				<lpage>141</lpage>
			</bibl>
			<bibl id="B35">
				<title>
					<p>Interacting models of cooperative gene regulation</p>
				</title>
				<aug>
					<au>
						<snm>Das</snm>
						<fnm>D</fnm>
					</au>
					<au>
						<snm>Banerjee</snm>
						<fnm>N</fnm>
					</au>
					<au>
						<snm>Zhang</snm>
						<fnm>MQ</fnm>
					</au>
				</aug>
				<source>Proc Natl Acad Sci USA</source>
				<pubdate>2004</pubdate>
				<volume>101</volume>
				<fpage>16234</fpage>
				<lpage>9</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">528978</pubid>
						<pubid idtype="pmpid" link="fulltext">15534222</pubid>
						<pubid idtype="doi">10.1073/pnas.0407365101</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B36">
				<title>
					<p>Statistical mechanical modeling of genome-wide transcription factor occupancy data by MatrixREDUCE</p>
				</title>
				<aug>
					<au>
						<snm>Foat</snm>
						<fnm>BC</fnm>
					</au>
					<au>
						<snm>Morozov</snm>
						<fnm>AV</fnm>
					</au>
					<au>
						<snm>Bussemaker</snm>
						<fnm>HJ</fnm>
					</au>
				</aug>
				<source>Bioinformatics</source>
				<pubdate>2006</pubdate>
				<volume>22</volume>
				<fpage>e141</fpage>
				<lpage>9</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1093/bioinformatics/btl223</pubid>
						<pubid idtype="pmpid" link="fulltext">16873464</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B37">
				<title>
					<p>Adaptively inferring human transcriptional subnetworks</p>
				</title>
				<aug>
					<au>
						<snm>Das</snm>
						<fnm>D</fnm>
					</au>
					<au>
						<snm>Nahle</snm>
						<fnm>Z</fnm>
					</au>
					<au>
						<snm>Zhang</snm>
						<fnm>MQ</fnm>
					</au>
				</aug>
				<source>Mol Syst Biol</source>
				<pubdate>2006</pubdate>
				<volume>2</volume>
				<note>2006.0029. Epub Jun 6</note>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">1681499</pubid>
						<pubid idtype="pmpid" link="fulltext">16760900</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B38">
				<title>
					<p>A boosting approach for motif modeling using ChIP-chip data</p>
				</title>
				<aug>
					<au>
						<snm>Hong</snm>
						<fnm>P</fnm>
					</au>
					<au>
						<snm>Liu</snm>
						<fnm>XS</fnm>
					</au>
					<au>
						<snm>Zhou</snm>
						<fnm>Q</fnm>
					</au>
					<au>
						<snm>Lu</snm>
						<fnm>X</fnm>
					</au>
					<au>
						<snm>Liu</snm>
						<fnm>JS</fnm>
					</au>
					<au>
						<snm>Wong</snm>
						<fnm>WH</fnm>
					</au>
				</aug>
				<source>Bioinformatics</source>
				<pubdate>2005</pubdate>
				<volume>21</volume>
				<fpage>2636</fpage>
				<lpage>43</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1093/bioinformatics/bti402</pubid>
						<pubid idtype="pmpid" link="fulltext">15817698</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B39">
				<title>
					<p>Promoter prediction analysis on the whole human genome</p>
				</title>
				<aug>
					<au>
						<snm>Bajic</snm>
						<fnm>VB</fnm>
					</au>
					<au>
						<snm>Tan</snm>
						<fnm>SL</fnm>
					</au>
					<au>
						<snm>Suzuki</snm>
						<fnm>Y</fnm>
					</au>
					<au>
						<snm>Sugano</snm>
						<fnm>S</fnm>
					</au>
				</aug>
				<source>Nat Biotechnol</source>
				<pubdate>2004</pubdate>
				<volume>22</volume>
				<fpage>1467</fpage>
				<lpage>73</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1038/nbt1032</pubid>
						<pubid idtype="pmpid" link="fulltext">15529174</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B40">
				<title>
					<p>Computational detection and location of transcription start sites in mammalian genomic DNA</p>
				</title>
				<aug>
					<au>
						<snm>Down</snm>
						<fnm>TA</fnm>
					</au>
					<au>
						<snm>Hubbard</snm>
						<fnm>TJ</fnm>
					</au>
				</aug>
				<source>Genome Res</source>
				<pubdate>2002</pubdate>
				<volume>12</volume>
				<fpage>458</fpage>
				<lpage>61</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">155284</pubid>
						<pubid idtype="pmpid" link="fulltext">11875034</pubid>
						<pubid idtype="doi">10.1101/gr.216102</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B41">
				<title>
					<p>Computational analysis of core promoters in the Drosophila genome</p>
				</title>
				<aug>
					<au>
						<snm>Ohler</snm>
						<fnm>U</fnm>
					</au>
					<au>
						<snm>Liao</snm>
						<fnm>GC</fnm>
					</au>
					<au>
						<snm>Niemann</snm>
						<fnm>H</fnm>
					</au>
					<au>
						<snm>Rubin</snm>
						<fnm>GM</fnm>
					</au>
				</aug>
				<source>Genome Biol</source>
				<pubdate>2002</pubdate>
				<volume>3</volume>
				<note>RESEARCH0087. Epub 2002 Dec 20</note>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">151189</pubid>
						<pubid idtype="pmpid" link="fulltext">12537576</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B42">
				<title>
					<p>Computational identification of promoters and first exons in the human genome</p>
				</title>
				<aug>
					<au>
						<snm>Davuluri</snm>
						<fnm>RV</fnm>
					</au>
					<au>
						<snm>Grosse</snm>
						<fnm>I</fnm>
					</au>
					<au>
						<snm>Zhang</snm>
						<fnm>MQ</fnm>
					</au>
				</aug>
				<source>Nat Genet</source>
				<pubdate>2001</pubdate>
				<volume>29</volume>
				<issue>4</issue>
				<fpage>412</fpage>
				<lpage>417</lpage>
				<note>Erratum: <it>Nat Genet </it>2002, <b>32</b>(3):459.</note>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1038/ng780</pubid>
						<pubid idtype="pmpid" link="fulltext">11726928</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B43">
				<title>
					<p>Dragon Gene Start Finder identifies approximate locations of the 5' ends of genes</p>
				</title>
				<aug>
					<au>
						<snm>Bajic</snm>
						<fnm>VB</fnm>
					</au>
					<au>
						<snm>Seah</snm>
						<fnm>SH</fnm>
					</au>
				</aug>
				<source>Nucleic Acids Res</source>
				<pubdate>2003</pubdate>
				<volume>31</volume>
				<fpage>3560</fpage>
				<lpage>3</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">168976</pubid>
						<pubid idtype="pmpid" link="fulltext">12824365</pubid>
						<pubid idtype="doi">10.1093/nar/gkg570</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B44">
				<title>
					<p>TRANSFAC and its module TRANSCompel: transcriptional gene regulation in eukaryotes</p>
				</title>
				<aug>
					<au>
						<snm>Matys</snm>
						<fnm>V</fnm>
					</au>
					<au>
						<snm>Kel-Margoulis</snm>
						<fnm>OV</fnm>
					</au>
					<au>
						<snm>Fricke</snm>
						<fnm>E</fnm>
					</au>
					<au>
						<snm>Liebich</snm>
						<fnm>I</fnm>
					</au>
					<au>
						<snm>Land</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Barre-Dirrie</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Reuter</snm>
						<fnm>I</fnm>
					</au>
					<au>
						<snm>Chekmenev</snm>
						<fnm>D</fnm>
					</au>
					<au>
						<snm>Krull</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Hornischer</snm>
						<fnm>K</fnm>
					</au>
					<au>
						<snm>Voss</snm>
						<fnm>N</fnm>
					</au>
					<au>
						<snm>Stegmaier</snm>
						<fnm>P</fnm>
					</au>
					<au>
						<snm>Lewicki-Potapov</snm>
						<fnm>B</fnm>
					</au>
					<au>
						<snm>Saxel</snm>
						<fnm>H</fnm>
					</au>
					<au>
						<snm>Kel</snm>
						<fnm>AE</fnm>
					</au>
					<au>
						<snm>Wingender</snm>
						<fnm>E</fnm>
					</au>
				</aug>
				<source>Nucleic Acids Res</source>
				<pubdate>2006</pubdate>
				<volume>34</volume>
				<fpage>D108</fpage>
				<lpage>10</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">1347505</pubid>
						<pubid idtype="pmpid" link="fulltext">16381825</pubid>
						<pubid idtype="doi">10.1093/nar/gkj143</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B45">
				<title>
					<p>ARTS: accurate recognition of transcription starts in human</p>
				</title>
				<aug>
					<au>
						<snm>Sonnenburg</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Zien</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Ratsch</snm>
						<fnm>G</fnm>
					</au>
				</aug>
				<source>Bioinformatics</source>
				<pubdate>2006</pubdate>
				<volume>22</volume>
				<fpage>e472</fpage>
				<lpage>80</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1093/bioinformatics/btl250</pubid>
						<pubid idtype="pmpid" link="fulltext">16873509</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B46">
				<title>
					<p>Boosting with stumps for predicting transcription start sites</p>
				</title>
				<aug>
					<au>
						<snm>Zhao</snm>
						<fnm>X</fnm>
					</au>
					<au>
						<snm>Xuan</snm>
						<fnm>Z</fnm>
					</au>
					<au>
						<snm>Zhang</snm>
						<fnm>MQ</fnm>
					</au>
				</aug>
				<source>Genome Biol</source>
				<pubdate>2007</pubdate>
				<volume>8</volume>
				<issue>2</issue>
				<fpage>R17</fpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1186/gb-2007-8-2-r17</pubid>
						<pubid idtype="pmpid" link="fulltext">17274821</pubid>
						<pubid idtype="pmcid">1852414</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B47">
				<title>
					<p>A chromatin-mediated mechanism for specification of conditional transcription factor targets</p>
				</title>
				<aug>
					<au>
						<snm>Buck</snm>
						<fnm>MJ</fnm>
					</au>
					<au>
						<snm>Lieb</snm>
						<fnm>JD</fnm>
					</au>
				</aug>
				<source>Nat Genet</source>
				<pubdate>2006</pubdate>
				<volume>38</volume>
				<fpage>1446</fpage>
				<lpage>51</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1038/ng1917</pubid>
						<pubid idtype="pmpid" link="fulltext">17099712</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B48">
				<title>
					<p>Genomic views of chromatin</p>
				</title>
				<aug>
					<au>
						<snm>Huebert</snm>
						<fnm>DJ</fnm>
					</au>
					<au>
						<snm>Bernstein</snm>
						<fnm>BE</fnm>
					</au>
				</aug>
				<source>Curr Opin Genet Dev</source>
				<pubdate>2005</pubdate>
				<volume>15</volume>
				<fpage>476</fpage>
				<lpage>81</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1016/j.gde.2005.08.001</pubid>
						<pubid idtype="pmpid" link="fulltext">16099159</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B49">
				<title>
					<p>Genome-scale identification of nucleosome positions in S. cerevisiae</p>
				</title>
				<aug>
					<au>
						<snm>Yuan</snm>
						<fnm>GC</fnm>
					</au>
					<au>
						<snm>Liu</snm>
						<fnm>YJ</fnm>
					</au>
					<au>
						<snm>Dion</snm>
						<fnm>MF</fnm>
					</au>
					<au>
						<snm>Slack</snm>
						<fnm>MD</fnm>
					</au>
					<au>
						<snm>Wu</snm>
						<fnm>LF</fnm>
					</au>
					<au>
						<snm>Altschuler</snm>
						<fnm>SJ</fnm>
					</au>
					<au>
						<snm>Rando</snm>
						<fnm>OJ</fnm>
					</au>
				</aug>
				<source>Science</source>
				<pubdate>2005</pubdate>
				<volume>309</volume>
				<fpage>626</fpage>
				<lpage>30</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1126/science.1112178</pubid>
						<pubid idtype="pmpid" link="fulltext">15961632</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B50">
				<title>
					<p>Large-scale structure of genomic methylation patterns</p>
				</title>
				<aug>
					<au>
						<snm>Rollins</snm>
						<fnm>RA</fnm>
					</au>
					<au>
						<snm>Haghighi</snm>
						<fnm>F</fnm>
					</au>
					<au>
						<snm>Edwards</snm>
						<fnm>JR</fnm>
					</au>
					<au>
						<snm>Das</snm>
						<fnm>R</fnm>
					</au>
					<au>
						<snm>Zhang</snm>
						<fnm>MQ</fnm>
					</au>
					<au>
						<snm>Ju</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Bestor</snm>
						<fnm>TH</fnm>
					</au>
				</aug>
				<source>Genome Res</source>
				<pubdate>2006</pubdate>
				<volume>16</volume>
				<fpage>157</fpage>
				<lpage>63</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">1361710</pubid>
						<pubid idtype="pmpid" link="fulltext">16365381</pubid>
						<pubid idtype="doi">10.1101/gr.4362006</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B51">
				<title>
					<p>Gene Regulation by Chromatin Structure: Paradigms Established in Drosophila melanogaster</p>
				</title>
				<aug>
					<au>
						<snm>Schulze</snm>
						<fnm>SR</fnm>
					</au>
					<au>
						<snm>Wallrath</snm>
						<fnm>LL</fnm>
					</au>
				</aug>
				<source>Annu Rev Entomol</source>
				<pubdate>2007</pubdate>
				<volume>52</volume>
				<fpage>171</fpage>
				<lpage>92</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1146/annurev.ento.51.110104.151007</pubid>
						<pubid idtype="pmpid" link="fulltext">16881818</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B52">
				<title>
					<p>Chromatin and epigenetics in development: blending cellular memory with cell fate plasticity</p>
				</title>
				<aug>
					<au>
						<snm>Cavalli</snm>
						<fnm>G</fnm>
					</au>
				</aug>
				<source>Development</source>
				<pubdate>2006</pubdate>
				<volume>133</volume>
				<fpage>2089</fpage>
				<lpage>94</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1242/dev.02402</pubid>
						<pubid idtype="pmpid" link="fulltext">16672331</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B53">
				<title>
					<p>Genome-scale mapping of DNase I sensitivity in vivo using tiling DNA microarrays</p>
				</title>
				<aug>
					<au>
						<snm>Sabo</snm>
						<fnm>PJ</fnm>
					</au>
					<au>
						<snm>Kuehn</snm>
						<fnm>MS</fnm>
					</au>
					<au>
						<snm>Thurman</snm>
						<fnm>R</fnm>
					</au>
					<au>
						<snm>Johnson</snm>
						<fnm>BE</fnm>
					</au>
					<au>
						<snm>Johnson</snm>
						<fnm>EM</fnm>
					</au>
					<au>
						<snm>Cao</snm>
						<fnm>H</fnm>
					</au>
					<au>
						<snm>Yu</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Rosenzweig</snm>
						<fnm>E</fnm>
					</au>
					<au>
						<snm>Goldy</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Haydock</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Weaver</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Shafer</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Lee</snm>
						<fnm>K</fnm>
					</au>
					<au>
						<snm>Neri</snm>
						<fnm>F</fnm>
					</au>
					<au>
						<snm>Humbert</snm>
						<fnm>R</fnm>
					</au>
					<au>
						<snm>Singer</snm>
						<fnm>MA</fnm>
					</au>
					<au>
						<snm>Richmond</snm>
						<fnm>TA</fnm>
					</au>
					<au>
						<snm>Dorschner</snm>
						<fnm>MO</fnm>
					</au>
					<au>
						<snm>McArthur</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Hawrylycz</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Green</snm>
						<fnm>RD</fnm>
					</au>
					<au>
						<snm>Navas</snm>
						<fnm>PA</fnm>
					</au>
					<au>
						<snm>Noble</snm>
						<fnm>WS</fnm>
					</au>
					<au>
						<snm>Stamatoyannopoulos</snm>
						<fnm>JA</fnm>
					</au>
				</aug>
				<source>Nat Methods</source>
				<pubdate>2006</pubdate>
				<volume>3</volume>
				<fpage>511</fpage>
				<lpage>8</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1038/nmeth890</pubid>
						<pubid idtype="pmpid" link="fulltext">16791208</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B54">
				<title>
					<p>Distinct and predictive chromatin signatures of transcriptional promoters and enhancers in the human genome</p>
				</title>
				<aug>
					<au>
						<snm>Heintzman</snm>
						<fnm>ND</fnm>
					</au>
					<au>
						<snm>Stuart</snm>
						<fnm>RK</fnm>
					</au>
					<au>
						<snm>Hon</snm>
						<fnm>G</fnm>
					</au>
					<au>
						<snm>Fu</snm>
						<fnm>Y</fnm>
					</au>
					<au>
						<snm>Ching</snm>
						<fnm>CW</fnm>
					</au>
					<au>
						<snm>Hawkins</snm>
						<fnm>RD</fnm>
					</au>
					<au>
						<snm>Barrera</snm>
						<fnm>LO</fnm>
					</au>
					<au>
						<snm>Van Calcar</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Qu</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Ching</snm>
						<fnm>KA</fnm>
					</au>
					<au>
						<snm>Wang</snm>
						<fnm>W</fnm>
					</au>
					<au>
						<snm>Weng</snm>
						<fnm>Z</fnm>
					</au>
					<au>
						<snm>Green</snm>
						<fnm>RD</fnm>
					</au>
					<au>
						<snm>Crawford</snm>
						<fnm>GE</fnm>
					</au>
					<au>
						<snm>Ren</snm>
						<fnm>B</fnm>
					</au>
				</aug>
				<source>Nat Genet</source>
				<pubdate>2007</pubdate>
				<volume>39</volume>
				<fpage>311</fpage>
				<lpage>318</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1038/ng1966</pubid>
						<pubid idtype="pmpid" link="fulltext">17277777</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B55">
				<title>
					<p>CpG island methylation in human lymphocytes is highly correlated with DNA sequence, repeats, and predicted DNA structure</p>
				</title>
				<aug>
					<au>
						<snm>Bock</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Paulsen</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Tierling</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Mikeska</snm>
						<fnm>T</fnm>
					</au>
					<au>
						<snm>Lengauer</snm>
						<fnm>T</fnm>
					</au>
					<au>
						<snm>Walter</snm>
						<fnm>J</fnm>
					</au>
				</aug>
				<source>PLoS Genet</source>
				<pubdate>2006</pubdate>
				<volume>2</volume>
				<fpage>e26</fpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">1386721</pubid>
						<pubid idtype="pmpid" link="fulltext">16520826</pubid>
						<pubid idtype="doi">10.1371/journal.pgen.0020026</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B56">
				<title>
					<p>Computational prediction of methylation status in human genomic sequences</p>
				</title>
				<aug>
					<au>
						<snm>Das</snm>
						<fnm>R</fnm>
					</au>
					<au>
						<snm>Dimitrova</snm>
						<fnm>N</fnm>
					</au>
					<au>
						<snm>Xuan</snm>
						<fnm>Z</fnm>
					</au>
					<au>
						<snm>Rollins</snm>
						<fnm>RA</fnm>
					</au>
					<au>
						<snm>Haghighi</snm>
						<fnm>F</fnm>
					</au>
					<au>
						<snm>Edwards</snm>
						<fnm>JR</fnm>
					</au>
					<au>
						<snm>Ju</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Bestor</snm>
						<fnm>TH</fnm>
					</au>
					<au>
						<snm>Zhang</snm>
						<fnm>MQ</fnm>
					</au>
				</aug>
				<source>Proc Natl Acad Sci USA</source>
				<pubdate>2006</pubdate>
				<volume>103</volume>
				<fpage>10713</fpage>
				<lpage>6</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">1502297</pubid>
						<pubid idtype="pmpid" link="fulltext">16818882</pubid>
						<pubid idtype="doi">10.1073/pnas.0602949103</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B57">
				<title>
					<p>A genomic code for nucleosome positioning</p>
				</title>
				<aug>
					<au>
						<snm>Segal</snm>
						<fnm>E</fnm>
					</au>
					<au>
						<snm>Fondufe-Mittendorf</snm>
						<fnm>Y</fnm>
					</au>
					<au>
						<snm>Chen</snm>
						<fnm>L</fnm>
					</au>
					<au>
						<snm>Thastrom</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Field</snm>
						<fnm>Y</fnm>
					</au>
					<au>
						<snm>Moore</snm>
						<fnm>IK</fnm>
					</au>
					<au>
						<snm>Wang</snm>
						<fnm>JP</fnm>
					</au>
					<au>
						<snm>Widom</snm>
						<fnm>J</fnm>
					</au>
				</aug>
				<source>Nature</source>
				<pubdate>2006</pubdate>
				<volume>442</volume>
				<issue>7104</issue>
				<fpage>772</fpage>
				<lpage>8</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1038/nature04979</pubid>
						<pubid idtype="pmpid" link="fulltext">16862119</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
		</refgrp>
	</bm>
</art>
