<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
	<ui>gb-2004-5-5-r37</ui>
	<ji>GBJ</ji>
	<fm>
		<dochead>Software</dochead>
		<bibl>
			<title>
				<p>Genome2D: a visualization tool for the rapid analysis of bacterial transcriptome data</p>
			</title>
			<aug>
				<au id="A1">
					<snm>Baerends</snm>
					<mi>JS</mi>
					<fnm>Richard</fnm>
					<insr iid="I1"/>
				</au>
				<au id="A2">
					<snm>Smits</snm>
					<mnm>Klaas</mnm>
					<fnm>Wiep</fnm>
					<insr iid="I1"/>
				</au>
				<au id="A3" ca="yes">
					<snm>de Jong</snm>
					<fnm>Anne</fnm>
					<insr iid="I1"/>
					<email>a.de.jong@biol.rug.nl</email>
				</au>
				<au id="A4">
					<snm>Hamoen</snm>
					<mi>W</mi>
					<fnm>Leendert</fnm>
					<insr iid="I1"/>
					<insr iid="I2"/>
				</au>
				<au id="A5">
					<snm>Kok</snm>
					<fnm>Jan</fnm>
					<insr iid="I1"/>
				</au>
				<au id="A6">
					<snm>Kuipers</snm>
					<mi>P</mi>
					<fnm>Oscar</fnm>
					<insr iid="I1"/>
					<email>o.p.kuipers@biol.rug.nl</email>
				</au>
			</aug>
			<insg>
				<ins id="I1">
					<p>Molecular Genetics, Groningen Biomolecular Sciences and Biotechnology Institute, University of Groningen, Kerklaan 30, 9751 NN Haren, The Netherlands</p>
				</ins>
				<ins id="I2">
					<p>Current address: Sir William Dunn School of Pathology, University of Oxford, South Parks Road, Oxford OX1 3RE, UK</p>
				</ins>
			</insg>
			<source>Genome Biology</source>
			<issn>1465-6906</issn>
			<pubdate>2004</pubdate>
			<volume>5</volume>
			<issue>5</issue>
			<fpage>R37</fpage>
			<url>http://genomebiology.com/2004/5/5/R37</url>
			<xrefbib>
				<pubid idtype="pmpid">15128451</pubid>
			</xrefbib>
		</bibl>
		<history>
			<rec>
				<date>
					<day>15</day>
					<month>1</month>
					<year>2004</year>
				</date>
			</rec>
			<revrec>
				<date>
					<day>26</day>
					<month>2</month>
					<year>2004</year>
				</date>
			</revrec>
			<acc>
				<date>
					<day>11</day>
					<month>3</month>
					<year>2004</year>
				</date>
			</acc>
			<pub>
				<date>
					<day>5</day>
					<month>4</month>
					<year>2004</year>
				</date>
			</pub>
		</history>
		<cpyrt>
			<year>2004</year>
			<collab>Baerends et al.; licensee BioMed Central Ltd. This is an Open Access article: verbatim copying and redistribution of this article are permitted in all media for any purpose, provided this notice is preserved along with the article's original URL.</collab>
		</cpyrt>
		<shorttitle>
			<p>Genome2D: a visualization tool for the rapid analysis of bacterial transcriptome data</p>
		</shorttitle>
		<shortabs>
			<p>Genome2D is a software tool that enables bacterial transcriptome data to be visualized on chromosome maps.</p>
		</shortabs>
		<abs>
			<sec>
				<st>
					<p>Abstract</p>
				</st>
				<p>Genome2D is a Windows-based software tool for visualization of bacterial transcriptome and customized datasets on linear chromosome maps constructed from annotated genome sequences. Genome2D facilitates the analysis of transcriptome data by using different color ranges to depict differences in gene-expression levels on a genome map. Such output format enables visual inspection of the transcriptome data, and will quickly reveal transcriptional units, without prior knowledge of expression level cutoff values. The compiled version of Genome2D is freely available for academic or non-profit use from <url>http://molgen.biol.rug.nl/molgen/research/molgensoftware.php</url>.</p>
			</sec>
		</abs>
	</fm>
	<meta>
		<classifications>
			<classification type="BMC" subtype="man_spc_id" id="30010002">Bioinformatics</classification>
			<classification type="BMC" subtype="man_spc_id" id="30010010">Genome studies</classification>
			<classification type="BMC" subtype="man_spc_id" id="30010014">Microbiology and parasitology</classification>
		</classifications>
	</meta>
	<bdy>
		<sec>
			<st>
				<p>Rationale</p>
			</st>
			<p>Current efforts in whole-genome sequencing have led to a rapidly increasing number of publicly available bacterial genome sequences <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr></abbrgrp>. Novel technologies, such as genome-wide transcriptional profiling using DNA microarrays, enables the study of the transcriptional regulation of various processes in these sequenced microorganisms, which can, subsequently, lead to the identification of the regulatory networks involved <abbrgrp><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr><abbr bid="B6">6</abbr></abbrgrp>. Bioinformatics tools that enable one to predict and/or identify transcription regulatory elements and terminator sites are publicly available <abbrgrp><abbr bid="B7">7</abbr><abbr bid="B8">8</abbr><abbr bid="B9">9</abbr><abbr bid="B10">10</abbr><abbr bid="B11">11</abbr><abbr bid="B12">12</abbr><abbr bid="B13">13</abbr><abbr bid="B14">14</abbr></abbrgrp>.</p>
			<p>Graphical representations have proved very useful for the efficient interpretation of large amounts of biological data (for example, metabolic pathway and gene regulatory network visualization <abbrgrp><abbr bid="B15">15</abbr><abbr bid="B16">16</abbr><abbr bid="B17">17</abbr></abbrgrp>, transcriptome data analysis and/or clustering <abbrgrp><abbr bid="B18">18</abbr><abbr bid="B19">19</abbr></abbrgrp>). Our group investigates metabolic pathways and gene regulatory networks of different Gram-positive bacteria. For easy and rapid interpretation of transcriptome data, we required software that enables us to project this onto a linear bacterial genome map, together with additional data (that is, terminator and regulator binding sites). Zimmer and co-workers have previously visualized transcriptome data (displayed as spots) in gene order <abbrgrp><abbr bid="B20">20</abbr></abbrgrp>. However, their program does not allow the inclusion of data on transcription regulatory and terminator sites or other customized data. Visualization of such information would facilitate the interpretation of transcriptomes by displaying which genes are coexpressed in a transcriptional unit (an operon <abbrgrp><abbr bid="B21">21</abbr></abbrgrp>), or are transcribed via readthrough from the neighboring gene (or genes), or lead to the formation of antisense RNA. The possibility of adding putative binding sites for transcriptional regulators onto the genome map would be a quick and convenient way to assess the biological relevance of such operator sites. Furthermore, visual analysis can be preferable over a statistical (mathematical) approach, as relevant data can easily be ignored if too high cutoff settings are applied. We screened several powerful commercial and public-domain software packages for transcriptome data visualization (GenVision (DNAStar, Madison, WI), GeneSpring (Silicon Genetics, Redwood City, CA), Kyoto Encyclopedia of Genes and Genomes (KEGG) <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>, EcoCyc <abbrgrp><abbr bid="B16">16</abbr></abbrgrp> and TM4 <abbrgrp><abbr bid="B19">19</abbr></abbrgrp>), but none of these fulfilled our needs. We therefore developed the Microsoft Windows-based program Genome2D.</p>
		</sec>
		<sec>
			<st>
				<p>Genome2D</p>
			</st>
			<p>Genome2D was programmed in Borland Delphi 6 and compiled to a Microsoft Windows 9x/NT/2000/XP application. With its graphical user interface the program is easy to use for non-experts and is easily accessible because of its low system requirements; it can be installed on a standard local Windows personal computer, making it fast and safe (when confidentiality is required). The object-oriented programming environment of Delphi makes it easy to extend Genome2D. The CADSys 4 library version 4.2 was used for two-dimensional visualization of genomes. This library extends the Delphi vectorial graphics support to include 2D/3D CAD-like functions in applications.</p>
			<p>The most prominent feature of Genome2D is a drawing module that generates comprehensive bacterial genome maps, in a single window screen, that can include specific genetic elements such as transcription terminators or regulator binding sites (Figure <figr fid="F1">1</figr>). The user can easily prepare figures for use in printed or digital format.</p>
			<fig id="F1">
				<title>
					<p>Figure 1</p>
				</title>
				<caption>
					<p>Genome2D visualization of the genomic organization of <it>L. lactis </it>IL1403 (GenBank annotation: AE0051576)</p>
				</caption>
				<text>
					<p>Genome2D visualization of the genomic organization of <it>L. lactis </it>IL1403 (GenBank annotation: AE0051576). The figure displays a partial, detailed view in which putative terminators, determined using the TIGR software package TransTerm, are shown as stem-loop structures <abbrgrp><abbr bid="B11">11</abbr><abbr bid="B46">46</abbr></abbrgrp>. Predicted promoter elements (-35 boxes in green; -10 boxes in blue) and <it>cre</it>-boxes (in red) are shown. See text for more details.</p>
				</text>
				<graphic file="gb-2004-5-5-r37-1"/>
			</fig>
			<p>Display of DNA microarray data in Genome2D is done by coloring the selected genes using a simple input file - that is, a tab-delimited text file with one column containing the names of the genes to be colored (corresponding to the gene names from the annotation file), and a second column with the color codes (black, white, red, yellow, fuchsia, green, lime, blue or aqua), or values, such as gene-expression ratios, on the basis of which color shades are assigned. A defined number of datasets from a complex transcriptome analysis experiment (for example, time-course measurements) can be loaded as separate input files, after which the data can be shown in animation, a feature that, to our knowledge, is not present in existing software. Clearly, the input files are not restricted to transcriptome data, and different kinds of datasets can be projected, such as from proteome analysis.</p>
			<sec>
				<st>
					<p>An umbrella for analysis tools</p>
				</st>
				<p>In addition to its visualization capabilities, Genome2D serves as a platform for different bioinformatics tools, such as data-extraction and conversion algorithms, which are summarized in Table <tblr tid="T1">1</tblr>. The combination of visualization and information extraction allows subsequent rounds of analyses, and thus an increase in data complexity, making Genome2D a powerful tool in the investigation of bacterial genomics data, especially from transcriptome and proteome analyses. Newly developed algorithms or tools can be easily implemented within the framework of the program.</p>
				<tbl id="T1" hint_layout="double">
					<title>
						<p>Table 1</p>
					</title>
					<caption>
						<p>Features of Genome2D*</p>
					</caption>
					<tblbdy cols="2">
						<r>
							<c ca="left">
								<p>Menu</p>
							</c>
							<c ca="left">
								<p>Description</p>
							</c>
						</r>
						<r>
							<c cspan="2">
								<hr/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>File</p>
							</c>
							<c ca="left">
								<p>Various input files (for example, FastA, GenBank, Glimmer, Paradox) can be loaded into Genome2D; contains commands to handle the program</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Blast</p>
							</c>
							<c ca="left">
								<p>Window to perform blast searches on a local system or at NCBI and handle blast results (data extraction)</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Search</p>
							</c>
							<c ca="left">
								<p>Algorithms to make a weight matrix (consensus sequence/motif); use weight matrix or input motif to screen loaded genome (see Example analysis: <it>CcpA </it>regulon in <it>L. lactis</it>)</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Drawing</p>
							</c>
							<c ca="left">
								<p>Drawing of whole genome on linear map including additional information (promoter sites, terminators, regulator binding sites). Individual genes can be colored (manual selection). Changes in gene expression (multiple datasets in animation) are indicated by variation in color or number (see Application example: <it>ComK </it>regulon in <it>B. subtilis</it>)</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Tools</p>
							</c>
							<c ca="left">
								<p>Algorithms for analysis of genomic DNA, randomization (statistical analysis) and extraction of coding or noncoding regions</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Boxes</p>
							</c>
							<c ca="left">
								<p>Algorithms to analyze operons, upstream regions, box sequences and promoters. Custom adaptation of these algorithms is easily implemented (see example of K-box analyses <abbrgrp><abbr bid="B24">24</abbr></abbrgrp>)</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Reformatting</p>
							</c>
							<c ca="left">
								<p>Algorithms to convert files to another format</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Proteomics</p>
							</c>
							<c ca="left">
								<p>Trypsin digestion on a database of proteins</p>
							</c>
						</r>
					</tblbdy>
					<tblfn>
						<p>*Online help can be obtained from <abbrgrp><abbr bid="B45">45</abbr></abbrgrp>.</p>
					</tblfn>
				</tbl>
			</sec>
			<sec>
				<st>
					<p>Applications</p>
				</st>
				<p>Genome2D can be used for all annotated bacterial genome sequences. In our group, Genome2D is commonly used for the analysis of genomics data from <it>Bacillus cereus, Bacillus subtilis, Lactococcus lactis, Lactobacillus plantarum </it>and <it>Streptococcus pneumoniae</it>. We will illustrate the strength of Genome2D in visualization of transcriptome data hereafter, using the genomes of <it>B. subtilis </it>168 <abbrgrp><abbr bid="B22">22</abbr></abbrgrp> and <it>L. lactis </it>IL1403 <abbrgrp><abbr bid="B23">23</abbr></abbrgrp> as examples.</p>
			</sec>
			<sec>
				<st>
					<p>The power of visualization</p>
				</st>
				<p>There are a number of benefits of visual inspection of transcriptome data compared with statistical analyses, which we will show here using published transcriptome data <abbrgrp><abbr bid="B24">24</abbr></abbrgrp>. Most important, visualization can help in discerning true low-level gene activation. For instance, <it>groES </it>was classified as a ComK-regulated gene, as it met the stringent cutoff set in the analysis of Hamoen and co-workers <abbrgrp><abbr bid="B24">24</abbr></abbrgrp>. However, <it>groEL </it>failed to meet these criteria. It has been shown that <it>groES </it>and <it>groEL </it>are part of a single operon in <it>B. subtilis </it><abbrgrp><abbr bid="B25">25</abbr></abbrgrp>. When the transcriptome data are visualized in Genome2D, one can see that <it>groEL </it>actually shows some level of activation, suggesting that <it>groEL </it>and <it>groES </it>are indeed activated as an operon (Figure <figr fid="F2">2a</figr>). Choosing cutoff values to define the set of regulated genes is a rather arbitrary process. Moreover, the statistical value of expression data in transcriptome studies is based on a limited number of data points, and it is therefore not surprising that several possibly relevant genes, such as <it>groEL</it>, will be missed. Another example is given in Figure <figr fid="F2">2b</figr>. <it>yvrP, yvrN and yvrM </it>were found to be ComK-activated, whereas <it>yvrO </it>did not meet the criteria <abbrgrp><abbr bid="B24">24</abbr></abbrgrp>. Visualization in Genome2D reveals that <it>yvrO </it>is also slightly activated, and allows the conclusion that all four genes are likely to form a ComK-dependent operon (Figure <figr fid="F2">2b</figr>).</p>
				<fig id="F2">
					<title>
						<p>Figure 2</p>
					</title>
					<caption>
						<p>Demonstration of the power of visualization in transcriptome analyses</p>
					</caption>
					<text>
						<p>Demonstration of the power of visualization in transcriptome analyses. The dataset used is from Hamoen and colleagues <abbrgrp><abbr bid="B24">24</abbr></abbrgrp>. The strength of up- or downregulation is depicted by the intensity of the color. Stem-loop structures indicate annotated terminators. <b>(a,b) </b>Probable cases of low-level activation. Genes are colored on the basis of expression ratios from DNA macroarray experiments <abbrgrp><abbr bid="B24">24</abbr></abbrgrp>, without applying a stringent cutoff. Red shades indicate ComK-dependent activation, whereas green shows downregulation. Gray shades indicate ratios of around 1. Stem-loop structures are used to depict annotated terminators. K-boxes are shown by vertical red lines. <b>(c,d) </b>Putative cases of transcriptional readthrough. Red shades indicate significant ComK-dependent expression. K-boxes are depicted by vertical green lines. Gray genes are not significantly ComK-dependent. <b>(e,f) </b>Probable cases in which antisense RNA has a role (colors and symbols identical to (c) and (d)).</p>
					</text>
					<graphic file="gb-2004-5-5-r37-2"/>
				</fig>
				<p>Second, visualized transcriptome data can reveal putative transcriptional readthrough. For example, in the study of Hamoen and colleagues <abbrgrp><abbr bid="B24">24</abbr></abbrgrp> mentioned above, thresholds of significance were partly based on the prior knowledge that limited readthrough from the <it>comF </it>operon occurs into the <it>yvyF</it>, <it>flgM </it>and <it>yvyG </it>genes <abbrgrp><abbr bid="B26">26</abbr></abbrgrp>. This becomes apparent also in the Genome2D visualization of the data from Hamoen and colleagues <abbrgrp><abbr bid="B24">24</abbr></abbrgrp>: the <it>comF </it>operon and downstream-located genes show differential levels of ComK-induction (Figure <figr fid="F2">2c</figr>). Extending this notion, one can predict that the reported ComK-dependent activation of <it>spoIIB</it>/<it>maf</it>/<it>ysxA </it>(<it>radC</it>) and <it>yqzE </it>is due to readthrough from <it>comC </it>and the <it>comG </it>operon, respectively (Figure <figr fid="F2">2d</figr>) <abbrgrp><abbr bid="B24">24</abbr></abbrgrp>.</p>
				<p>Third, it has been reported that the use of double-stranded amplicons in DNA array studies might lead to the detection of antisense RNA, the biological significance of which is unclear <abbrgrp><abbr bid="B24">24</abbr></abbrgrp>. Genome2D helps in the identification of putative antisense RNA detection by showing whether activated genes are located in reverse orientation downstream of activated genes. In the case of <it>comE</it>, it is known that the <it>comER </it>gene is not transcribed during competence <abbrgrp><abbr bid="B27">27</abbr></abbrgrp>. However, in several array studies this gene appeared to be strongly activated by ComK <abbrgrp><abbr bid="B24">24</abbr><abbr bid="B28">28</abbr><abbr bid="B29">29</abbr></abbrgrp>. From Figure <figr fid="F2">2e</figr>, it is apparent that this activation is due to the hybridization of antisense mRNA. Similarly, the observed expression of <it>yhxD </it>and several genes from the <it>yck</it>/<it>nucA</it>-<it>nin</it>/<it>tlpC </it>area may be instances of antisense RNA detection (Figure <figr fid="F2">2f</figr>). These observations cannot be made by normal statistical analyses without visual inspection.</p>
			</sec>
		</sec>
		<sec>
			<st>
				<p>Data extraction and analysis</p>
			</st>
			<p>To our knowledge no software is available in the public domain that allows information extraction and analysis in the way Genome2D does. To correlate expression with the activity of specific transcription factors more quantitatively, we incorporated several algorithms into Genome2D. The type of analyses that can be performed with these algorithms are exemplified below, using the analysis of the ComK-regulon in <it>B. subtilis </it>and the <it>in silico </it>prediction of the CcpA-regulon of <it>L. lactis</it>.</p>
			<p>Hamoen and colleagues <abbrgrp><abbr bid="B24">24</abbr></abbrgrp> used Genome2D to correlate the occurrence of ComK-binding sites (K-boxes) to ComK-dependent expression of genes, with the aim of testing whether the presence of a K-box upstream of a gene can be used to predict ComK-activation. They assigned genes to putative operons using a widely used algorithm <abbrgrp><abbr bid="B30">30</abbr></abbrgrp> incorporated into Genome2D ('Add First Gene of Operon to Gene List'). Furthermore, they identified all K-boxes in the <it>B. subtilis </it>genome (Box searches are available through the Search menu) and located the closest upstream box for all genes and operons ('Add Nearest Box to Gene List'). Finally, the program is able to link predicted binding sites to the genes located closest to the box ('Add Nearest Gene to Boxlist'). Using these and additional algorithms, the authors showed that the predictive value of a K-box can be significantly improved by taking into account genome organization, additional ComK-binding motifs, and binding sites for RNA polymerase <abbrgrp><abbr bid="B24">24</abbr></abbrgrp>.</p>
			<sec>
				<st>
					<p>Prediction of the CcpA regulon in <it>L. lactis</it></p>
				</st>
				<p>As is the case in other bacteria, many <it>L. lactis </it>ssp. <it>lactis </it>IL1403 genes are of unknown function <abbrgrp><abbr bid="B23">23</abbr><abbr bid="B31">31</abbr></abbrgrp>. Prediction of gene regulation can implicate unknown proteins in certain cellular processes and, by directing genetics approaches, can help to assign functions. This is illustrated by the prediction of the CcpA-regulon (sugar catabolism control) in <it>L. lactis </it>IL1403 using Genome2D. We searched for and visualized putative CcpA-binding sites and promoter elements in the genome of <it>L. lactis</it>. Using the search-module in Genome2D (&lt;<it>Search</it>&gt;, '<it>Make Trained Set'</it>) and a list of 36 catabolite-responsive element (<it>cre</it>-box) sequences from several Gram-positive bacteria (see Additional data file 1), a weight matrix <abbrgrp><abbr bid="B32">32</abbr></abbrgrp> was made that generated the consensus sequence ATGWAARCGTTTWCA (where W represents A or T, and R represents A or G) (see Additional data file 2). Subsequent screening (&lt;<it>Search</it>&gt;, '<it>Search Trained Set</it>') for this consensus sequence, with an arbitrary cutoff of 8 (a perfect match would give a score of 10.8 with our weight matrix), identified 1,807 putative <it>cre</it>-boxes in the genome of <it>L. lactis </it>IL1403. Around 43% of these boxes are located in intergenic regions. As CcpA can act as a repressor or activator depending on the position of a <it>cre</it>-box relative to the RNA polymerase binding site <abbrgrp><abbr bid="B33">33</abbr></abbrgrp>, consensus -35 and -10 promoter element positions <abbrgrp><abbr bid="B34">34</abbr></abbrgrp> of genes were predicted in the genome of <it>L. lactis </it>IL1403 using the MEME motif search routine <abbrgrp><abbr bid="B35">35</abbr></abbrgrp> and Genome2D (A.L. Zomer, G. Buist, J.K. and O.P.K, unpublished data). The prediction was performed on intergenic regions from the <it>L. lactis </it>IL1403 genome, the primary location for promoter elements, which were extracted using Genome2D. Finally, the datasets from the <it>cre</it>-box and promoter element predictions were visualized onto the linear genome map of <it>L. lactis </it>IL1403 (see Figure <figr fid="F1">1</figr> and Additional data file 3). Visual inspection confirmed the presence of operons previously described as regulated by CcpA <abbrgrp><abbr bid="B36">36</abbr><abbr bid="B37">37</abbr><abbr bid="B38">38</abbr></abbrgrp>. Thirteen <it>L. lactis </it>genes (out of 116 putative CcpA-regulated genes) have counterparts in <it>B. subtilis</it>, on the basis of protein sequence comparisons using BLASTP (<it>e-</it>values were lower than 10<sup>-49</sup>) (<abbrgrp><abbr bid="B39">39</abbr></abbrgrp> and see Additional data file 4). The 13 <it>B. subtilis </it>genes were among those that were recently shown to be CcpA-regulated in <it>B. subtilis </it>using DNA macroarray analyses <abbrgrp><abbr bid="B40">40</abbr></abbrgrp>, indicating that Genome2D can be used to generate relevant predictions on gene regulation. However, we would like to emphasize that the <it>in silico </it>prediction of gene regulation has to be corroborated by 'real' biological experiments, such as genome-wide transcriptome analysis <abbrgrp><abbr bid="B24">24</abbr><abbr bid="B41">41</abbr><abbr bid="B42">42</abbr></abbrgrp>.</p>
			</sec>
		</sec>
		<sec>
			<st>
				<p>Conclusions</p>
			</st>
			<p>Our analyses of transcriptome data in relation to the activity of specific transcription factors and their operator sites required a more flexible genome visualization program than is currently publicly available. We therefore developed Genome2D, a software tool that enables visualization of transcriptome data onto a linear map of an annotated bacterial genome and at the same time highlights additional features, such as putative regulatory sequences and terminators. The combination of information extraction and visualization facilitates rapid, easy and intuitive analysis of genomics data, and in our research group Genome2D proved to be of great assistance in the study of transcriptome data. New algorithms can be rapidly implemented in the Genome2D program menu structure. Regular updates of Genome2D will be available via the Internet <abbrgrp><abbr bid="B43">43</abbr></abbrgrp>. Because of the exponential increase of publicly available bacterial genome sequences and large-scale experiments, tools like Genome2D will become indispensable for the interpretation of complex datasets, such as those from transcriptome and proteome studies.</p>
		</sec>
		<sec>
			<st>
				<p>Additional data files</p>
			</st>
			<p>The following additional data are available with the online version of this article: a list of <it>cre</it>-box sequences found in Gram-positive bacteria (Additional data file <supplr sid="s1">1</supplr>); a screen dump from Genome2D showing the <it>cre</it>-box weight matrix (Additional data file <supplr sid="s2">2</supplr>); a Genome2D input (tab-delimited text) file with the coordinates of the identified <it>cre</it>-boxes and promoter elements in the genome of <it>L. lactis </it>IL1403 (color file) (Additional data file <supplr sid="s3">3</supplr>); a table of <it>cre</it>-boxes identified in promoters of genes in the <it>L. lactis </it>IL1403 genome (Additional data file <supplr sid="s4">4</supplr>). All additional data files can also be obtained from <abbrgrp><abbr bid="B44">44</abbr></abbrgrp>.</p>
			<suppl id="s1">
				<title>
					<p>Additional data file 1</p>
				</title>
				<caption>
					<p>A list of <it>cre</it>-box sequences found in Gram-positive bacteria</p>
				</caption>
				<text>
					<p>A list of <it>cre</it>-box sequences found in Gram-positive bacteria</p>
				</text>
				<file name="gb-2004-5-5-r37-s1.doc">
					<p>Click here for additional data file</p>
				</file>
			</suppl>
			<suppl id="s2">
				<title>
					<p>Additional data file 2</p>
				</title>
				<caption>
					<p>A screen dump from Genome2D showing the <it>cre</it>-box weight matrix</p>
				</caption>
				<text>
					<p>A screen dump from Genome2D showing the <it>cre</it>-box weight matrix</p>
				</text>
				<file name="gb-2004-5-5-r37-s2.doc">
					<p>Click here for additional data file</p>
				</file>
			</suppl>
			<suppl id="s3">
				<title>
					<p>Additional data file 3</p>
				</title>
				<caption>
					<p>A Genome2D input (tab-delimited text) file with the coordinates of the identified <it>cre</it>-boxes and promoter elements in the genome of <it>L. lactis </it>IL1403</p>
				</caption>
				<text>
					<p>A Genome2D input (tab-delimited text) file with the coordinates of the identified <it>cre</it>-boxes and promoter elements in the genome of <it>L. lactis </it>IL1403</p>
				</text>
				<file name="gb-2004-5-5-r37-s3.txt">
					<p>Click here for additional data file</p>
				</file>
			</suppl>
			<suppl id="s4">
				<title>
					<p>Additional data file 4</p>
				</title>
				<caption>
					<p>A table of <it>cre</it>-boxes identified in promoters of genes in the <it>L. lactis </it>IL1403 genome</p>
				</caption>
				<text>
					<p>A table of <it>cre</it>-boxes identified in promoters of genes in the <it>L. lactis </it>IL1403 genome</p>
				</text>
				<file name="gb-2004-5-5-r37-s4.doc">
					<p>Click here for additional data file</p>
				</file>
			</suppl>
		</sec>
	</bdy>
	<bm>
		<ack>
			<sec>
				<st>
					<p>Acknowledgements</p>
				</st>
				<p>The authors acknowledge Piero Valagussa for development and distribution of CADSys 4 library version 4.2 and thank Aldert Zomer for performing the MEME-search. They are also much obliged to the members of the Molecular Genetics group for their valuable comments and suggestions to improve the program.</p>
			</sec>
		</ack>
		<refgrp>
			<bibl id="B1">
				<title>
					<p>Microbial genomes multiply.</p>
				</title>
				<aug>
					<au>
						<snm>Doolittle</snm>
						<fnm>RF</fnm>
					</au>
				</aug>
				<source>Nature</source>
				<pubdate>2002</pubdate>
				<volume>416</volume>
				<fpage>697</fpage>
				<lpage>700</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1038/416697a</pubid>
						<pubid idtype="pmpid" link="fulltext">11961543</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B2">
				<title>
					<p>Gold Genomes Online Database: Prokaryotic Ongoing Genome Projects</p>
				</title>
				<url>http://wit.integratedgenomics.com/GOLD/index.cgi?want=Prokaryotic+Ongoing+Genomes</url>
			</bibl>
			<bibl id="B3">
				<title>
					<p>Exploring the metabolic and genetic control of gene expression on a genomic scale.</p>
				</title>
				<aug>
					<au>
						<snm>DeRisi</snm>
						<fnm>JL</fnm>
					</au>
					<au>
						<snm>Iyer</snm>
						<fnm>VR</fnm>
					</au>
					<au>
						<snm>Brown</snm>
						<fnm>PO</fnm>
					</au>
				</aug>
				<source>Science</source>
				<pubdate>1997</pubdate>
				<volume>278</volume>
				<fpage>680</fpage>
				<lpage>686</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1126/science.278.5338.680</pubid>
						<pubid idtype="pmpid" link="fulltext">9381177</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B4">
				<title>
					<p>Microarrays for microbiologists.</p>
				</title>
				<aug>
					<au>
						<snm>Lucchini</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Thompson</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Hinton</snm>
						<fnm>JCD</fnm>
					</au>
				</aug>
				<source>Microbiology</source>
				<pubdate>2001</pubdate>
				<volume>147</volume>
				<fpage>1403</fpage>
				<lpage>1414</lpage>
				<xrefbib>
					<pubid idtype="pmpid" link="fulltext">11390672</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B5">
				<title>
					<p>Deciphering gene expression regulatory networks.</p>
				</title>
				<aug>
					<au>
						<snm>Wyrick</snm>
						<fnm>JJ</fnm>
					</au>
					<au>
						<snm>Young</snm>
						<fnm>RA</fnm>
					</au>
				</aug>
				<source>Curr Opin Genet Dev</source>
				<pubdate>2002</pubdate>
				<volume>12</volume>
				<fpage>130</fpage>
				<lpage>136</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1016/S0959-437X(02)00277-0</pubid>
						<pubid idtype="pmpid" link="fulltext">11893484</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B6">
				<title>
					<p>Microarray expression profiling: capturing a genome-wide portrait of the transcriptome.</p>
				</title>
				<aug>
					<au>
						<snm>Conway</snm>
						<fnm>T</fnm>
					</au>
					<au>
						<snm>Schoolnik</snm>
						<fnm>GK</fnm>
					</au>
				</aug>
				<source>Mol Microbiol</source>
				<pubdate>2003</pubdate>
				<volume>47</volume>
				<fpage>879</fpage>
				<lpage>889</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1046/j.1365-2958.2003.03338.x</pubid>
						<pubid idtype="pmpid" link="fulltext">12581346</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B7">
				<title>
					<p>Combining frequency and positional information to predict transcription factor binding sites.</p>
				</title>
				<aug>
					<au>
						<snm>Kielbasa</snm>
						<fnm>SM</fnm>
					</au>
					<au>
						<snm>Korbel</snm>
						<fnm>JO</fnm>
					</au>
					<au>
						<snm>Beule</snm>
						<fnm>D</fnm>
					</au>
					<au>
						<snm>Schuchhardt</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Herzel</snm>
						<fnm>H</fnm>
					</au>
				</aug>
				<source>Bioinformatics</source>
				<pubdate>2001</pubdate>
				<volume>17</volume>
				<fpage>1019</fpage>
				<lpage>1026</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1093/bioinformatics/17.11.1019</pubid>
						<pubid idtype="pmpid" link="fulltext">11724730</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B8">
				<title>
					<p>A probabilistic method for identifying start codons in bacterial genomes.</p>
				</title>
				<aug>
					<au>
						<snm>Suzek</snm>
						<fnm>BE</fnm>
					</au>
					<au>
						<snm>Ermolaeva</snm>
						<fnm>MD</fnm>
					</au>
					<au>
						<snm>Schreiber</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Salzberg</snm>
						<fnm>SL</fnm>
					</au>
				</aug>
				<source>Bioinformatics</source>
				<pubdate>2001</pubdate>
				<volume>17</volume>
				<fpage>1123</fpage>
				<lpage>1130</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1093/bioinformatics/17.12.1123</pubid>
						<pubid idtype="pmpid" link="fulltext">11751220</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B9">
				<title>
					<p>Co-expression pattern from DNA microarray experiments as a tool for operon prediction.</p>
				</title>
				<aug>
					<au>
						<snm>Sabatti</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Rohlin</snm>
						<fnm>L</fnm>
					</au>
					<au>
						<snm>Oh</snm>
						<fnm>M-K</fnm>
					</au>
					<au>
						<snm>Liao</snm>
						<fnm>JC</fnm>
					</au>
				</aug>
				<source>Nucleic Acids Res</source>
				<pubdate>2002</pubdate>
				<volume>30</volume>
				<fpage>2886</fpage>
				<lpage>2893</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1093/nar/gkf388</pubid>
						<pubid idtype="pmpid" link="fulltext">12087173</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B10">
				<title>
					<p>Regulatory element detection using correlation with expression.</p>
				</title>
				<aug>
					<au>
						<snm>Bussemaker</snm>
						<fnm>HJ</fnm>
					</au>
					<au>
						<snm>Li</snm>
						<fnm>H</fnm>
					</au>
					<au>
						<snm>Siggia</snm>
						<fnm>ED</fnm>
					</au>
				</aug>
				<source>Nat Genet</source>
				<pubdate>2001</pubdate>
				<volume>27</volume>
				<fpage>167</fpage>
				<lpage>171</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1038/84792</pubid>
						<pubid idtype="pmpid" link="fulltext">11175784</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B11">
				<title>
					<p>Prediction of transcription terminators in bacterial genomes.</p>
				</title>
				<aug>
					<au>
						<snm>Ermolaeva</snm>
						<fnm>MD</fnm>
					</au>
					<au>
						<snm>Khalek</snm>
						<fnm>HG</fnm>
					</au>
					<au>
						<snm>White</snm>
						<fnm>O</fnm>
					</au>
					<au>
						<snm>Smith</snm>
						<fnm>HO</fnm>
					</au>
					<au>
						<snm>Salzberg</snm>
						<fnm>SL</fnm>
					</au>
				</aug>
				<source>J Mol Biol</source>
				<pubdate>2000</pubdate>
				<volume>301</volume>
				<fpage>27</fpage>
				<lpage>33</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1006/jmbi.2000.3836</pubid>
						<pubid idtype="pmpid" link="fulltext">10926490</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B12">
				<title>
					<p>STRING: a web-server to retrieve and display the repeatedly occurring neighbourhood of a gene.</p>
				</title>
				<aug>
					<au>
						<snm>Snel</snm>
						<fnm>B</fnm>
					</au>
					<au>
						<snm>Lehmann</snm>
						<fnm>G</fnm>
					</au>
					<au>
						<snm>Bork</snm>
						<fnm>P</fnm>
					</au>
					<au>
						<snm>Huynen</snm>
						<fnm>MA</fnm>
					</au>
				</aug>
				<source>Nucleic Acids Res</source>
				<pubdate>2000</pubdate>
				<volume>28</volume>
				<fpage>3442</fpage>
				<lpage>3444</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1093/nar/28.18.3442</pubid>
						<pubid idtype="pmpid" link="fulltext">10982861</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B13">
				<title>
					<p>Computational identification of operons in microbial genomes.</p>
				</title>
				<aug>
					<au>
						<snm>Zheng</snm>
						<fnm>Y</fnm>
					</au>
					<au>
						<snm>Szustakowski</snm>
						<fnm>JD</fnm>
					</au>
					<au>
						<snm>Fortnow</snm>
						<fnm>L</fnm>
					</au>
					<au>
						<snm>Roberts</snm>
						<fnm>RJ</fnm>
					</au>
					<au>
						<snm>Kasif</snm>
						<fnm>S</fnm>
					</au>
				</aug>
				<source>Genome Res</source>
				<pubdate>2002</pubdate>
				<volume>12</volume>
				<fpage>1221</fpage>
				<lpage>1230</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1101/gr.200601</pubid>
						<pubid idtype="pmpid" link="fulltext">12176930</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B14">
				<title>
					<p>Genome-wide analysis of bacterial promoter regions.</p>
				</title>
				<aug>
					<au>
						<snm>Eskin</snm>
						<fnm>E</fnm>
					</au>
					<au>
						<snm>Keich</snm>
						<fnm>U</fnm>
					</au>
					<au>
						<snm>Gelfand</snm>
						<fnm>MS</fnm>
					</au>
					<au>
						<snm>Pevzner</snm>
						<fnm>PA</fnm>
					</au>
				</aug>
				<source>Pac Symp Biocomput</source>
				<pubdate>2003</pubdate>
				<fpage>29</fpage>
				<lpage>40</lpage>
				<xrefbib>
					<pubid idtype="pmpid">12603015</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B15">
				<title>
					<p>The KEGG databases at GenomeNet.</p>
				</title>
				<aug>
					<au>
						<snm>Kanehisa</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Goto</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Kawashima</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Nakaya</snm>
						<fnm>A</fnm>
					</au>
				</aug>
				<source>Nucleic Acids Res</source>
				<pubdate>2002</pubdate>
				<volume>30</volume>
				<fpage>42</fpage>
				<lpage>46</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1093/nar/30.1.42</pubid>
						<pubid idtype="pmpid" link="fulltext">11752249</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B16">
				<title>
					<p>The MetaCyc Database.</p>
				</title>
				<aug>
					<au>
						<snm>Karp</snm>
						<fnm>PD</fnm>
					</au>
					<au>
						<snm>Riley</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Paley</snm>
						<fnm>SM</fnm>
					</au>
					<au>
						<snm>Pellegrini-Toole</snm>
						<fnm>A</fnm>
					</au>
				</aug>
				<source>Nucleic Acids Res</source>
				<pubdate>2002</pubdate>
				<volume>30</volume>
				<fpage>59</fpage>
				<lpage>61</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1093/nar/30.1.59</pubid>
						<pubid idtype="pmpid" link="fulltext">11752254</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B17">
				<title>
					<p>RegulonDB (Version 3.2): transcriptome regulation and operon organization in <it>Escherichia coli </it>K-12.</p>
				</title>
				<aug>
					<au>
						<snm>Salgado</snm>
						<fnm>H</fnm>
					</au>
					<au>
						<snm>Santos-Zavaleta</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Gama-Castro</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Millan-Zarate</snm>
						<fnm>D</fnm>
					</au>
					<au>
						<snm>Diaz-Peredo</snm>
						<fnm>E</fnm>
					</au>
					<au>
						<snm>Sanchez-Solano</snm>
						<fnm>F</fnm>
					</au>
					<au>
						<snm>Perez-Rueda</snm>
						<fnm>E</fnm>
					</au>
					<au>
						<snm>Bonavides-Martinez</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Collado-Vides</snm>
						<fnm>J</fnm>
					</au>
				</aug>
				<source>Nucleic Acids Res</source>
				<pubdate>2001</pubdate>
				<volume>29</volume>
				<fpage>72</fpage>
				<lpage>74</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1093/nar/29.1.72</pubid>
						<pubid idtype="pmpid" link="fulltext">11125053</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B18">
				<title>
					<p>Clustering analysis and display of genome-wide expression patterns.</p>
				</title>
				<aug>
					<au>
						<snm>Eisen</snm>
						<fnm>MB</fnm>
					</au>
					<au>
						<snm>Spellman</snm>
						<fnm>PT</fnm>
					</au>
					<au>
						<snm>Brown</snm>
						<fnm>PO</fnm>
					</au>
					<au>
						<snm>Botstein</snm>
						<fnm>D</fnm>
					</au>
				</aug>
				<source>Proc Natl Acad Sci USA</source>
				<pubdate>1998</pubdate>
				<volume>95</volume>
				<fpage>14863</fpage>
				<lpage>14868</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1073/pnas.95.25.14863</pubid>
						<pubid idtype="pmpid" link="fulltext">9843981</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B19">
				<title>
					<p>TM4: a free, open-source system for microarray data management and analysis.</p>
				</title>
				<aug>
					<au>
						<snm>Saeed</snm>
						<fnm>AI</fnm>
					</au>
					<au>
						<snm>Sharov</snm>
						<fnm>V</fnm>
					</au>
					<au>
						<snm>White</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Li</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Liang</snm>
						<fnm>W</fnm>
					</au>
					<au>
						<snm>Bhagabati</snm>
						<fnm>N</fnm>
					</au>
					<au>
						<snm>Braisted</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Klapa</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Currier</snm>
						<fnm>T</fnm>
					</au>
					<au>
						<snm>Thiagarajan</snm>
						<fnm>M</fnm>
					</au>
					<etal/>
				</aug>
				<source>Biotechniques</source>
				<pubdate>2003</pubdate>
				<volume>34</volume>
				<fpage>374</fpage>
				<lpage>378</lpage>
				<xrefbib>
					<pubid idtype="pmpid">12613259</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B20">
				<title>
					<p>Nitrogen regulatory protein C-controlled genes of <it>Escherichia coli</it>: scavenging as a defense against nitrogen limitation.</p>
				</title>
				<aug>
					<au>
						<snm>Zimmer</snm>
						<fnm>DP</fnm>
					</au>
					<au>
						<snm>Soupene</snm>
						<fnm>E</fnm>
					</au>
					<au>
						<snm>Lee</snm>
						<fnm>HL</fnm>
					</au>
					<au>
						<snm>Wendish</snm>
						<fnm>VF</fnm>
					</au>
					<au>
						<snm>Khodursky</snm>
						<fnm>AB</fnm>
					</au>
					<au>
						<snm>Peter</snm>
						<fnm>BJ</fnm>
					</au>
					<au>
						<snm>Bender</snm>
						<fnm>RA</fnm>
					</au>
					<au>
						<snm>Kustu</snm>
						<fnm>S</fnm>
					</au>
				</aug>
				<source>Proc Natl Acad Sci USA</source>
				<pubdate>2000</pubdate>
				<volume>97</volume>
				<fpage>14674</fpage>
				<lpage>14679</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1073/pnas.97.26.14674</pubid>
						<pubid idtype="pmpid" link="fulltext">11121068</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B21">
				<title>
					<p>Genetic regulatory mechanisms in the synthesis of proteins.</p>
				</title>
				<aug>
					<au>
						<snm>Jacob</snm>
						<fnm>F</fnm>
					</au>
					<au>
						<snm>Monod</snm>
						<fnm>J</fnm>
					</au>
				</aug>
				<source>J Mol Biol</source>
				<pubdate>1961</pubdate>
				<volume>3</volume>
				<fpage>318</fpage>
				<lpage>356</lpage>
				<xrefbib>
					<pubid idtype="pmpid">13718526</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B22">
				<title>
					<p>The complete genome sequence of the Gram-positive bacterium <it>Bacillus subtilis</it>.</p>
				</title>
				<aug>
					<au>
						<snm>Kunst</snm>
						<fnm>F</fnm>
					</au>
					<au>
						<snm>Ogasawara</snm>
						<fnm>N</fnm>
					</au>
					<au>
						<snm>Moszer</snm>
						<fnm>I</fnm>
					</au>
					<au>
						<snm>Albertini</snm>
						<fnm>AM</fnm>
					</au>
					<au>
						<snm>Alloni</snm>
						<fnm>G</fnm>
					</au>
					<au>
						<snm>Azevedo</snm>
						<fnm>V</fnm>
					</au>
					<au>
						<snm>Bertero</snm>
						<fnm>MG</fnm>
					</au>
					<au>
						<snm>Bessieres</snm>
						<fnm>P</fnm>
					</au>
					<au>
						<snm>Bolotin</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Borchert</snm>
						<fnm>S</fnm>
					</au>
					<etal/>
				</aug>
				<source>Nature</source>
				<pubdate>1997</pubdate>
				<volume>390</volume>
				<fpage>249</fpage>
				<lpage>256</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1038/36786</pubid>
						<pubid idtype="pmpid" link="fulltext">9384377</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B23">
				<title>
					<p>The complete genome sequence of the lactic acid bacterium <it>Lactococcus lactis </it>ssp. <it>lactis </it>IL1403.</p>
				</title>
				<aug>
					<au>
						<snm>Bolotin</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Wincker</snm>
						<fnm>P</fnm>
					</au>
					<au>
						<snm>Mauger</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Jaillon</snm>
						<fnm>O</fnm>
					</au>
					<au>
						<snm>Malarme</snm>
						<fnm>K</fnm>
					</au>
					<au>
						<snm>Weissenbach</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Ehrlich</snm>
						<fnm>SD</fnm>
					</au>
					<au>
						<snm>Sorokin</snm>
						<fnm>A</fnm>
					</au>
				</aug>
				<source>Genome Res</source>
				<pubdate>2001</pubdate>
				<volume>11</volume>
				<fpage>731</fpage>
				<lpage>753</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1101/gr.GR-1697R</pubid>
						<pubid idtype="pmpid" link="fulltext">11337471</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B24">
				<title>
					<p>Improving the predictive value of the competence transcription factor (ComK) binding site in <it>Bacillus subtilis </it>using a genomic approach.</p>
				</title>
				<aug>
					<au>
						<snm>Hamoen</snm>
						<fnm>LW</fnm>
					</au>
					<au>
						<snm>Smits</snm>
						<fnm>WK</fnm>
					</au>
					<au>
						<snm>de Jong</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Holsappel</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Kuipers</snm>
						<fnm>OP</fnm>
					</au>
				</aug>
				<source>Nucleic Acids Res</source>
				<pubdate>2002</pubdate>
				<volume>30</volume>
				<fpage>5517</fpage>
				<lpage>5528</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1093/nar/gkf698</pubid>
						<pubid idtype="pmpid" link="fulltext">12490720</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B25">
				<title>
					<p>Cloning, sequencing, mapping, and transcriptional analysis of the <it>groESL </it>operon from <it>Bacillus subtilis</it>.</p>
				</title>
				<aug>
					<au>
						<snm>Schmidt</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Schiesswohl</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Volker</snm>
						<fnm>U</fnm>
					</au>
					<au>
						<snm>Hecker</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Schumann</snm>
						<fnm>W</fnm>
					</au>
				</aug>
				<source>J Bacteriol</source>
				<pubdate>1992</pubdate>
				<volume>174</volume>
				<fpage>3993</fpage>
				<lpage>3999</lpage>
				<xrefbib>
					<pubid idtype="pmpid">1350777</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B26">
				<title>
					<p>A molecular switch controlling competence and motility: competence regulatory factors ComS, MecA, and ComK control sigma D-dependent gene expression in <it>Bacillus subtilis</it>.</p>
				</title>
				<aug>
					<au>
						<snm>Liu</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Zuber</snm>
						<fnm>P</fnm>
					</au>
				</aug>
				<source>J Bacteriol</source>
				<pubdate>1998</pubdate>
				<volume>180</volume>
				<fpage>4243</fpage>
				<lpage>4251</lpage>
				<xrefbib>
					<pubid idtype="pmpid" link="fulltext">9696775</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B27">
				<title>
					<p>Characterization of <it>comE</it>, a late competence operon of <it>Bacillus subtilis </it>required for the binding and uptake of transforming DNA.</p>
				</title>
				<aug>
					<au>
						<snm>Hahn</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Inamine</snm>
						<fnm>G</fnm>
					</au>
					<au>
						<snm>Kozlov</snm>
						<fnm>Y</fnm>
					</au>
					<au>
						<snm>Dubnau</snm>
						<fnm>D</fnm>
					</au>
				</aug>
				<source>Mol Microbiol</source>
				<pubdate>1993</pubdate>
				<volume>10</volume>
				<fpage>99</fpage>
				<lpage>111</lpage>
				<xrefbib>
					<pubid idtype="pmpid">7968523</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B28">
				<title>
					<p>Whole-genome analysis of genes regulated by the <it>Bacillus subtilis </it>competence transcription factor ComK.</p>
				</title>
				<aug>
					<au>
						<snm>Ogura</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Yamaguchi</snm>
						<fnm>H</fnm>
					</au>
					<au>
						<snm>Kobayashi</snm>
						<fnm>K</fnm>
					</au>
					<au>
						<snm>Ogasawara</snm>
						<fnm>N</fnm>
					</au>
					<au>
						<snm>Fujita</snm>
						<fnm>Y</fnm>
					</au>
					<au>
						<snm>Tanaka</snm>
						<fnm>T</fnm>
					</au>
				</aug>
				<source>J Bacteriol</source>
				<pubdate>2002</pubdate>
				<volume>184</volume>
				<fpage>2344</fpage>
				<lpage>2351</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1128/JB.184.9.2344-2351.2002</pubid>
						<pubid idtype="pmpid" link="fulltext">11948146</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B29">
				<title>
					<p>Microarray analysis of the <it>Bacillus subtilis </it>K-state: genome-wide expression changes dependent on ComK.</p>
				</title>
				<aug>
					<au>
						<snm>Berka</snm>
						<fnm>RM</fnm>
					</au>
					<au>
						<snm>Hahn</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Albano</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Draskovic</snm>
						<fnm>I</fnm>
					</au>
					<au>
						<snm>Persuh</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Cui</snm>
						<fnm>X</fnm>
					</au>
					<au>
						<snm>Sloma</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Widner</snm>
						<fnm>WD</fnm>
					</au>
					<au>
						<snm>Dubnau</snm>
						<fnm>D</fnm>
					</au>
				</aug>
				<source>Mol Microbiol</source>
				<pubdate>2002</pubdate>
				<volume>43</volume>
				<fpage>1331</fpage>
				<lpage>1345</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1046/j.1365-2958.2002.02833.x</pubid>
						<pubid idtype="pmpid" link="fulltext">11918817</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B30">
				<title>
					<p>Essentiality, not expressiveness, drives gene-strand bias in bacteria.</p>
				</title>
				<aug>
					<au>
						<snm>Rocha</snm>
						<fnm>EPC</fnm>
					</au>
					<au>
						<snm>Danchin</snm>
						<fnm>A</fnm>
					</au>
				</aug>
				<source>Nat Genet</source>
				<pubdate>2003</pubdate>
				<volume>34</volume>
				<fpage>377</fpage>
				<lpage>378</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1038/ng1209</pubid>
						<pubid idtype="pmpid" link="fulltext">12847524</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B31">
				<title>
					<p>Gene regulation in <it>Lactococcus lactis</it>: gap between predicted and characterized regulators.</p>
				</title>
				<aug>
					<au>
						<snm>Gu&#233;don</snm>
						<fnm>E</fnm>
					</au>
					<au>
						<snm>Jamet</snm>
						<fnm>E</fnm>
					</au>
					<au>
						<snm>Renault</snm>
						<fnm>P</fnm>
					</au>
				</aug>
				<source>Antonie van Leeuwenhoek</source>
				<pubdate>2002</pubdate>
				<volume>82</volume>
				<fpage>93</fpage>
				<lpage>112</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1023/A:1020680926267</pubid>
						<pubid idtype="pmpid" link="fulltext">12369207</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B32">
				<title>
					<p>Computer methods to locate signals in nucleic acid sequences.</p>
				</title>
				<aug>
					<au>
						<snm>Staden</snm>
						<fnm>R</fnm>
					</au>
				</aug>
				<source>Nucleic Acids Res</source>
				<pubdate>1984</pubdate>
				<volume>12</volume>
				<fpage>505</fpage>
				<lpage>519</lpage>
				<xrefbib>
					<pubid idtype="pmpid">6364039</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B33">
				<title>
					<p>The role of the CcpA transcriptional regulator in carbon metabolism in <it>Bacillus subtilis</it>.</p>
				</title>
				<aug>
					<au>
						<snm>Henkin</snm>
						<fnm>TM</fnm>
					</au>
				</aug>
				<source>FEMS Microbiol Lett</source>
				<pubdate>1996</pubdate>
				<volume>135</volume>
				<fpage>9</fpage>
				<lpage>15</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1016/0378-1097(95)00370-3</pubid>
						<pubid idtype="pmpid" link="fulltext">8598282</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B34">
				<title>
					<p>Gene expression in <it>Lactococcus lactis</it>.</p>
				</title>
				<aug>
					<au>
						<snm>van der Guchte</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Kok</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Venema</snm>
						<fnm>G</fnm>
					</au>
				</aug>
				<source>FEMS Microbiol Rev</source>
				<pubdate>1992</pubdate>
				<volume>8</volume>
				<fpage>73</fpage>
				<lpage>92</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmpid">1558766</pubid>
						<pubid idtype="doi">10.1016/0378-1097(92)90685-H</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B35">
				<title>
					<p>The value of prior knowledge in discovering motifs with MEME.</p>
				</title>
				<aug>
					<au>
						<snm>Bailey</snm>
						<fnm>TL</fnm>
					</au>
					<au>
						<snm>Elkan</snm>
						<fnm>C</fnm>
					</au>
				</aug>
				<source>Proc Int Conf Intell Syst Mol Biol</source>
				<pubdate>1995</pubdate>
				<volume>3</volume>
				<fpage>21</fpage>
				<lpage>29</lpage>
				<xrefbib>
					<pubid idtype="pmpid">7584439</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B36">
				<title>
					<p>Transcriptional activation of the glycolytic <it>las </it>operon and catabolite repression of the <it>gal </it>operon in <it>Lactococcus lactis </it>are mediated by the catabolite control protein CcpA.</p>
				</title>
				<aug>
					<au>
						<snm>Luesink</snm>
						<fnm>EJ</fnm>
					</au>
					<au>
						<snm>van Herpen</snm>
						<fnm>REMA</fnm>
					</au>
					<au>
						<snm>Grossiord</snm>
						<fnm>BP</fnm>
					</au>
					<au>
						<snm>Kuipers</snm>
						<fnm>OP</fnm>
					</au>
					<au>
						<snm>de Vos</snm>
						<fnm>WM</fnm>
					</au>
				</aug>
				<source>Mol Microbiol</source>
				<pubdate>1998</pubdate>
				<volume>30</volume>
				<fpage>789</fpage>
				<lpage>798</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1046/j.1365-2958.1998.01111.x</pubid>
						<pubid idtype="pmpid" link="fulltext">10094627</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B37">
				<title>
					<p>Molecular physiology of sugar catabolism in <it>Lactococcus lactis </it>IL1403.</p>
				</title>
				<aug>
					<au>
						<snm>Even</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Lindley</snm>
						<fnm>ND</fnm>
					</au>
					<au>
						<snm>Cocaign-Bousquet</snm>
						<fnm>M</fnm>
					</au>
				</aug>
				<source>J Bacteriol</source>
				<pubdate>2001</pubdate>
				<volume>183</volume>
				<fpage>3817</fpage>
				<lpage>3824</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1128/JB.183.13.3817-3824.2001</pubid>
						<pubid idtype="pmpid" link="fulltext">11395443</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B38">
				<title>
					<p>CcpA regulation of aerobic and respiration growth in <it>Lactococcus lactis</it>.</p>
				</title>
				<aug>
					<au>
						<snm>Gaudu</snm>
						<fnm>P</fnm>
					</au>
					<au>
						<snm>Lamberet</snm>
						<fnm>G</fnm>
					</au>
					<au>
						<snm>Poncet</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Gruss</snm>
						<fnm>A</fnm>
					</au>
				</aug>
				<source>Mol Microbiol</source>
				<pubdate>2003</pubdate>
				<volume>50</volume>
				<fpage>183</fpage>
				<lpage>192</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1046/j.1365-2958.2003.03700.x</pubid>
						<pubid idtype="pmpid" link="fulltext">14507373</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B39">
				<title>
					<p>Basic local alignment search tool.</p>
				</title>
				<aug>
					<au>
						<snm>Altschul</snm>
						<fnm>SF</fnm>
					</au>
					<au>
						<snm>Gish</snm>
						<fnm>W</fnm>
					</au>
					<au>
						<snm>Miller</snm>
						<fnm>W</fnm>
					</au>
					<au>
						<snm>Myers</snm>
						<fnm>EW</fnm>
					</au>
					<au>
						<snm>Lipman</snm>
						<fnm>DJ</fnm>
					</au>
				</aug>
				<source>J Mol Biol</source>
				<pubdate>1990</pubdate>
				<volume>215</volume>
				<fpage>403</fpage>
				<lpage>410</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1006/jmbi.1990.9999</pubid>
						<pubid idtype="pmpid" link="fulltext">2231712</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B40">
				<title>
					<p>Catabolite repression by the CcpA protein in <it>Bacillus subtilis</it>: novel modes of regulation by whole-genome analyses.</p>
				</title>
				<aug>
					<au>
						<snm>Moreno</snm>
						<fnm>MS</fnm>
					</au>
					<au>
						<snm>Schneider</snm>
						<fnm>BL</fnm>
					</au>
					<au>
						<snm>Maile</snm>
						<fnm>RR</fnm>
					</au>
					<au>
						<snm>Weyler</snm>
						<fnm>W</fnm>
					</au>
					<au>
						<snm>Saier</snm>
						<fnm>MH</fnm>
						<suf>Jr</suf>
					</au>
				</aug>
				<source>Mol Microbiol</source>
				<pubdate>2001</pubdate>
				<volume>39</volume>
				<fpage>1366</fpage>
				<lpage>1381</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1046/j.1365-2958.2001.02328.x</pubid>
						<pubid idtype="pmpid" link="fulltext">11251851</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B41">
				<title>
					<p>A motif co-occurrence approach for genome-wide prediction of transcription-factor-binding sites in <it>Escherichia coli</it>.</p>
				</title>
				<aug>
					<au>
						<snm>Bulyk</snm>
						<fnm>ML</fnm>
					</au>
					<au>
						<snm>McGuire</snm>
						<fnm>AM</fnm>
					</au>
					<au>
						<snm>Masuda</snm>
						<fnm>N</fnm>
					</au>
					<au>
						<snm>Church</snm>
						<fnm>GM</fnm>
					</au>
				</aug>
				<source>Genome Res</source>
				<pubdate>2004</pubdate>
				<volume>14</volume>
				<fpage>201</fpage>
				<lpage>208</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1101/gr.1448004</pubid>
						<pubid idtype="pmpid" link="fulltext">14762058</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B42">
				<title>
					<p>Computational prediction of transcription-factor binding site locations.</p>
				</title>
				<aug>
					<au>
						<snm>Bulyk</snm>
						<fnm>ML</fnm>
					</au>
				</aug>
				<source>Genome Biol</source>
				<pubdate>2003</pubdate>
				<volume>5</volume>
				<fpage>201</fpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1186/gb-2003-5-1-201</pubid>
						<pubid idtype="pmpid" link="fulltext">14709165</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B43">
				<title>
					<p>Molecular Genetics software</p>
				</title>
				<url>http://molgen.biol.rug.nl/molgen/research/molgensoftware.php</url>
			</bibl>
			<bibl id="B44">
				<title>
					<p>Genome2D: supplementary data</p>
				</title>
				<url>http://molgen.biol.rug.nl/publication/genome2d_data</url>
			</bibl>
			<bibl id="B45">
				<title>
					<p>Genome2D: online help - home</p>
				</title>
				<url>http://molgen.biol.rug.nl/genome2d</url>
			</bibl>
			<bibl id="B46">
				<title>
					<p>TIGR Software TransTerm, <it>Lactococcus lactis </it>subsp. <it>lactis </it>Rho-Independent Terminators</p>
				</title>
				<url>http://www.tigr.org/software/TransTermResults/ntll01.html</url>
			</bibl>
		</refgrp>
	</bm>
</art>
