<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1471-2180-5-35</ui>
   <ji>1471-2180</ji>
   <fm>
      <dochead>Research article</dochead>
      <bibl>
         <title>
            <p>A census of membrane-bound and intracellular signal transduction proteins in bacteria: Bacterial IQ, extroverts and introverts</p>
         </title>
         <aug>
            <au id="A1" ca="yes">
               <snm>Galperin</snm>
               <mi>Y</mi>
               <fnm>Michael</fnm>
               <insr iid="I1"/>
               <email>galperin@ncbi.nlm.nih.gov</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA</p>
            </ins>
         </insg>
         <source>BMC Microbiology</source>
         <issn>1471-2180</issn>
         <pubdate>2005</pubdate>
         <volume>5</volume>
         <issue>1</issue>
         <fpage>35</fpage>
         <url>http://www.biomedcentral.com/1471-2180/5/35</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">15955239</pubid>
               <pubid idtype="doi">10.1186/1471-2180-5-35</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>18</day>
               <month>4</month>
               <year>2005</year>
            </date>
         </rec>
         <acc>
            <date>
               <day>14</day>
               <month>6</month>
               <year>2005</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>14</day>
               <month>6</month>
               <year>2005</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2005</year>
         <collab>Galperin; licensee BioMed Central Ltd.</collab>
         <note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>Analysis of complete microbial genomes showed that intracellular parasites and other microorganisms that inhabit stable ecological niches encode relatively primitive signaling systems, whereas environmental microorganisms typically have sophisticated systems of environmental sensing and signal transduction.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>This paper presents results of a comprehensive census of signal transduction proteins &#8211; histidine kinases, methyl-accepting chemotaxis receptors, Ser/Thr/Tyr protein kinases, adenylate and diguanylate cyclases and c-di-GMP phosphodiesterases &#8211; encoded in 167 bacterial and archaeal genomes, sequenced by the end of 2004. The data have been manually checked to avoid false-negative and false-positive hits that commonly arise during large-scale automated analyses and compared against other available resources. The census data show uneven distribution of most signaling proteins among bacterial and archaeal phyla. The total number of signal transduction proteins grows approximately as a square of genome size. While histidine kinases are found in representatives of all phyla and are distributed according to the power law, other signal transducers are abundant in certain phylogenetic groups but virtually absent in others.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusion</p>
               </st>
               <p>The complexity of signaling systems differs even among closely related organisms. Still, it usually can be correlated with the phylogenetic position of the organism, its lifestyle, and typical environmental challenges it encounters. The number of encoded signal transducers (or their fraction in the total protein set) can be used as a measure of the organism's ability to adapt to diverse conditions, the 'bacterial IQ', while the ratio of transmembrane receptors to intracellular sensors can be used to define whether the organism is an 'extrovert', actively sensing the environmental parameters, or an 'introvert', more concerned about its internal homeostasis. Some of the microorganisms with the highest IQ, including the current leader <it>Wolinella succinogenes</it>, are found among the poorly studied beta-, delta- and epsilon-proteobacteria. Among all bacterial phyla, only cyanobacteria appear to be true introverts, probably due to their capacity to conduct oxygenic photosynthesis, using a complex system of intracellular membranes. The census data, available at <url>http://www.ncbi.nlm.nih.gov/Complete_Genomes/SignalCensus.html</url>, can be used to get an insight into metabolic and behavioral propensities of each given organism and improve prediction of the organism's properties based solely on its genome sequence.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <meta>
      <classifications>
         <classification type="bmc" subtype="user_supplied_xml" id="endnote"/>
      </classifications>
   </meta>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>All living organisms adjust their metabolism and behavior in response to the changes in their environment. For unicellular microorganisms, knowing themselves, i.e. constantly monitoring a variety of environmental and intracellular parameters, is a necessary condition of survival. Mechanisms of some adjustments can be as simple as those in the <it>lac </it>operon &#8211; the presence of a substrate induces expression of the genes that are necessary for assimilation of that substrate (although even <it>lac </it>operon has a complex high-level regulation through catabolite repression and inducer exclusion, see <abbrgrp><abbr bid="B1">1</abbr></abbrgrp> and references therein). More complex regulatory mechanisms include transmission of an external signal across the cytoplasmic membrane, followed by intracellular signal transduction to the appropriate genes (operons), metabolic enzymes, or to such organelles as bacterial flagella. Given that all these mechanisms have to be encoded in the organism's genome, the complexity of the signaling systems correlates with the genome size and the range of environmental challenges it normally encounters. Bacterial parasites that inhabit relatively stable host environments typically encode few, if any, signaling proteins (see <abbrgrp><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr></abbrgrp>).</p>
         <p>Analysis of the first three sequenced microbial genomes revealed very few signaling systems: four histidine kinases (HKs), five response regulators (RRs) and no methyl-accepting chemotaxis proteins (MCPs) in <it>Haemophilus influenzae</it>, none of these in <it>Mycoplasma genitalium </it>or <it>Methanococcus </it>(recently renamed <it>Methanocaldococcus</it>) <it>jannaschii</it>. Analysis of the fourth sequenced organism, the freshwater cyanobacterium <it>Synechocystis </it>sp. PCC 6803, revealed 42 HKs and 38 RRs <abbrgrp><abbr bid="B5">5</abbr></abbrgrp>, whereas the fifth, <it>Mycoplasma pneumoniae</it>, again had none. The list of signaling proteins encoded in microbial genomes grew by leaps and bounds ever since, generally following the exponential increase in the number of completely sequenced genomes and the total number of proteins that they encode (Figure <figr fid="F1">1</figr>). Given the importance of two-component signal transduction in bacteria <abbrgrp><abbr bid="B6">6</abbr><abbr bid="B7">7</abbr></abbrgrp>, the numbers of HKs and RRs were routinely reported in many genome descriptions. However, due to the limitations of employed algorithms and arbitrarily high cut-off values in most sequence comparison protocols, certain HK variants were often missed, for example, the HKs of the LytS family (family HPK8 in the classification of Grebe and Stock <abbrgrp><abbr bid="B8">8</abbr><abbr bid="B9">9</abbr></abbrgrp>). Some HKs of the recently described HWE family <abbrgrp><abbr bid="B10">10</abbr></abbrgrp> have not been recognized as HKs even when compared against SMART <abbrgrp><abbr bid="B11">11</abbr><abbr bid="B12">12</abbr></abbrgrp> and Pfam <abbrgrp><abbr bid="B13">13</abbr><abbr bid="B14">14</abbr></abbrgrp> domain databases <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>. Because of that, HKs were systematically undercounted: the number of HKs in <it>E. coli</it>, first reported to be 28 <abbrgrp><abbr bid="B16">16</abbr></abbrgrp>, was then revised upwards to 29 <abbrgrp><abbr bid="B2">2</abbr><abbr bid="B17">17</abbr></abbrgrp> and now stands at 30 <abbrgrp><abbr bid="B18">18</abbr></abbrgrp>; [see <supplr sid="S1">Additional file 1</supplr>]). Likewise, the number of HKs encoded by <it>Synechocystis </it>sp. PCC 6803, originally estimated to be 42 <abbrgrp><abbr bid="B5">5</abbr></abbrgrp>, has been revised to 46 <abbrgrp><abbr bid="B2">2</abbr></abbrgrp>. As a result, most estimates of the HK numbers published in previous years are unreliable. Besides, listings of signal transduction proteins typically did not take into account Ser/Thr/Tyr-specific protein kinases (STYKs) and protein phosphatases, which, as we now know, were encoded in the <it>H. influenzae</it>, <it>M. genitalium</it>, and <it>M. jannaschii </it>genomes [see <supplr sid="S1">Additional file 1</supplr>], see <abbrgrp><abbr bid="B19">19</abbr><abbr bid="B20">20</abbr></abbrgrp>. Further, cross-genome comparisons revealed entirely new classes of signaling molecules with GGDEF and EAL domains, involved in the turnover of the c-di-GMP, a novel secondary messenger <abbrgrp><abbr bid="B21">21</abbr><abbr bid="B22">22</abbr></abbrgrp>. Although genetic data and sequence considerations have long pointed to the diguanylate cyclase (c-di-GMP synthetase) activity of the GGDEF domain and the phosphodiesterase (c-di-GMP hydrolase) activity of the EAL domain, direct biochemical proof that this is indeed the case has became available only in the past year <abbrgrp><abbr bid="B23">23</abbr><abbr bid="B24">24</abbr><abbr bid="B25">25</abbr></abbrgrp>, reviewed in <abbrgrp><abbr bid="B21">21</abbr><abbr bid="B22">22</abbr></abbrgrp>. Predicted phosphodiesterase activity of the HD-GYP domain <abbrgrp><abbr bid="B26">26</abbr></abbrgrp> has never been experimentally verified. Finally, although participation of cellular adenylate cyclases (ACs) in signal transduction was never in question, class 3 enzymes (AC3s) were recognized as legitimate environmental sensors only last year, when they were shown to function as light receptors modulating motility in cyanobacteria <abbrgrp><abbr bid="B27">27</abbr><abbr bid="B28">28</abbr></abbrgrp>. Adenylate cyclases of class 1 and class 2, represented by experimentally characterized proteins from <it>E. coli </it>(AC1, <abbrgrp><abbr bid="B29">29</abbr></abbrgrp>) and <it>Aeromonas hydrophila </it>(AC2, <abbrgrp><abbr bid="B30">30</abbr></abbrgrp>), are cytoplasmic enzymes of relatively narrow phylogenetic distribution [see <supplr sid="S1">Additional file 1</supplr>] and are not known to function as environmental sensors.</p>
         <fig id="F1">
            <title>
               <p>Figure 1</p>
            </title>
            <caption>
               <p>Growth in the number of signal transduction proteins encoded in complete microbial genomes</p>
            </caption>
            <text>
               <p><b>Growth in the number of signal transduction proteins encoded in complete microbial genomes. </b>A. Histidine kinases (circles), methyl-accepting chemotaxis proteins (MCPs, squares) and Ser/Thr/Tyr protein kinases (STYKs, triangles). B. Diguanylate cyclases (GGDEF domains, diamonds), c-di-GMP-specific phosphodiesterases (EAL domains, triangles), and adenylate cyclases (circles). Open symbols indicate the total number of proteins with the corresponding domains, closed symbols &#8211; the number of membrane-bound proteins of each kind.</p>
            </text>
            <graphic file="1471-2180-5-35-1"/>
         </fig>
         <suppl id="S1">
            <title>
               <p>Additional File 1</p>
            </title>
            <text>
               <p>Results of the census of membrane-bound and intracellular signal transduction proteins in bacteria in HTML format</p>
            </text>
            <file name="1471-2180-5-35-S1.html">
               <p>Click here for file</p>
            </file>
         </suppl>
         <p>The diversity of the signal transduction systems made careful accounting for all of them a daunting task, further complicated by the paucity of the data on the cellular targets for STYKs <abbrgrp><abbr bid="B31">31</abbr></abbrgrp> and virtual absence of any data on the mechanisms of c-di-GMP-mediated regulation <abbrgrp><abbr bid="B21">21</abbr><abbr bid="B22">22</abbr></abbrgrp>. Hence, most signaling protein surveys focused exclusively on certain classes of membrane receptors (HKs and/or MCPs) and RRs <abbrgrp><abbr bid="B5">5</abbr><abbr bid="B16">16</abbr><abbr bid="B17">17</abbr><abbr bid="B32">32</abbr><abbr bid="B33">33</abbr><abbr bid="B34">34</abbr></abbrgrp>, or on certain organisms, mostly cyanobacteria and actinobacteria <abbrgrp><abbr bid="B35">35</abbr><abbr bid="B36">36</abbr><abbr bid="B37">37</abbr><abbr bid="B38">38</abbr></abbrgrp>. Shi, Kennelly and Potts performed a comprehensive survey of STYKs and protein phosphatases <abbrgrp><abbr bid="B19">19</abbr><abbr bid="B20">20</abbr><abbr bid="B39">39</abbr></abbrgrp>, but have not looked at other signaling proteins. Galperin and colleagues <abbrgrp><abbr bid="B2">2</abbr><abbr bid="B26">26</abbr></abbrgrp> performed a census of HKs, GGDEF, and EAL domains but never considered STYKs or ACs. Surveys of the MCP and AC3 distribution in complete microbial genomes by Zhulin <abbrgrp><abbr bid="B40">40</abbr></abbrgrp> and Shenoy and Visweswariah <abbrgrp><abbr bid="B41">41</abbr></abbrgrp>, respectively, were limited to these protein domains. The information on signaling systems is poorly represented in public databases. While HKs and RRs are covered in the KEGG database <abbrgrp><abbr bid="B42">42</abbr><abbr bid="B43">43</abbr></abbrgrp>, other signaling systems are not. The SENTRA <abbrgrp><abbr bid="B44">44</abbr><abbr bid="B45">45</abbr></abbrgrp>), SMART <abbrgrp><abbr bid="B11">11</abbr><abbr bid="B12">12</abbr></abbrgrp> and COG <abbrgrp><abbr bid="B46">46</abbr><abbr bid="B47">47</abbr></abbrgrp> databases have a good coverage of the first sequenced genomes but have not been updated in a while, whereas data in other databases, such as Pfam <abbrgrp><abbr bid="B13">13</abbr><abbr bid="B14">14</abbr></abbrgrp> or PEDANT <abbrgrp><abbr bid="B48">48</abbr><abbr bid="B49">49</abbr></abbrgrp> are generated mostly by automatic means and therefore prone to the biases described above (and also in the Results section).</p>
         <p>While preparing recent reviews on signal transduction in bacteria <abbrgrp><abbr bid="B3">3</abbr><abbr bid="B22">22</abbr></abbrgrp>, the need for comprehensive and reliable data on the distribution of specific signaling systems among different phylogenetic lineages became all too obvious. Since signal transduction systems grow in number and complexity with the genome size and play increasingly important roles in environmental bacteria <abbrgrp><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr></abbrgrp>, it has become clear that comparative analysis of such systems could provide a useful insight into bacterial behavior <abbrgrp><abbr bid="B50">50</abbr></abbrgrp>. Here I present a comprehensive census of HKs, MCPs, STYKs and ACs, as well as GGDEF, EAL, and HD-GYP domains encoded in complete genomes of 167 bacterial and archaeal species, sequenced by the end of 2004. I hope that availability of these data on a public web site <abbrgrp><abbr bid="B51">51</abbr></abbrgrp>, which will be updated as needed, will stimulate further analysis of microbial signal transduction and will lead to a better understanding of microbial behavior in various ecological niches.</p>
      </sec>
      <sec>
         <st>
            <p>Results</p>
         </st>
         <sec>
            <st>
               <p>Scope of the study</p>
            </st>
            <p>Bacterial signaling mechanisms are extremely diverse, ranging from simplest two-domain transcription regulators, such as AraC or LacI, to multi-component signaling cascades that regulate sporulation, flagellar biosynthesis or biofilm formation. Until recently, the term 'signal transduction' has been typically reserved for the two-component systems consisting of a sensor histidine kinase (HK) and a response regulator (RR). In keeping with this tradition, I did not include in this survey single-component transcriptional regulators, whether of AraC type <abbrgrp><abbr bid="B52">52</abbr></abbrgrp> or much more complex NorR type <abbrgrp><abbr bid="B53">53</abbr></abbrgrp> and considered only dedicated signaling systems that consist of more than two individual components. In addition to HKs, these included Ser/Thr protein kinases, adenylate and diguanylate cyclases and two types of predicted c-di-GMP phosphodiesterases, containing, respectively, EAL or HD-GYP domains. Other enzymatic output domains as well as DNA- or RNA-binding response regulators have not been considered here but could be added to the list in the future. Because of the previously noted parallelism between the domain architectures of intracellular signaling proteins (e.g. PAS-GGDEF-EAL) and respective response regulators (e.g. CheY-PAS-GGDEF-EAL) <abbrgrp><abbr bid="B3">3</abbr></abbrgrp>, no attempt has been made to distinguish such proteins; they were counted both in the GGDEF and EAL columns. Naturally, such proteins were counted only once to obtain the total number of signaling proteins encoded in any given genome.</p>
            <p>The data set included complete bacterial and archaeal genomes sequenced by the end of 2004. While Archaea and Bacteria are generally considered separate domains of life in the prokaryotic world, there are indications that many signal transduction systems in archaea have been acquired from bacteria through lateral gene transfer <abbrgrp><abbr bid="B2">2</abbr><abbr bid="B32">32</abbr></abbrgrp>. Hence, for the purposes of this study, domain Archaea was treated as just another bacterial phylum. Owing to the redundancy of the current genome list, only one representative genome per species was used in the analysis, typically the first one to be publicly released. Exceptions included two strains of <it>Escherichia coli</it>, K12 and O157:H7 <abbrgrp><abbr bid="B54">54</abbr><abbr bid="B55">55</abbr></abbrgrp>, and three serovars of <it>Salmonella enterica</it>, Typhi, Typhimurium, and Paratyphi <abbrgrp><abbr bid="B56">56</abbr><abbr bid="B57">57</abbr><abbr bid="B58">58</abbr></abbrgrp>.</p>
         </sec>
         <sec>
            <st>
               <p>Data validation</p>
            </st>
            <p>The total numbers of copies of each signaling domain encoded in each given genome were estimated in iterative PSI-BLAST <abbrgrp><abbr bid="B59">59</abbr></abbrgrp> searches, using the strict inclusion threshold expect values of 10<sup>-7</sup>&#8211;10<sup>-4</sup>, adjusting as necessary. Potential false-positive hits were checked at every step of PSI-BLAST using the CDD Domain viewer <abbrgrp><abbr bid="B60">60</abbr></abbrgrp> and manually removed (unselected) from the hit list for the next iteration of PSI-BLAST. The most typical sources of the false-positive hits were as follows.</p>
            <p><b><it>Histidine kinases </it></b>consist of two separate domains, (i) a well-conserved ATPase domain of the GHKL family <abbrgrp><abbr bid="B61">61</abbr><abbr bid="B62">62</abbr></abbrgrp>, referred to as HATPase_c domain [Pfam:PF02518 <url>http://www.sanger.ac.uk/cgi-bin/Pfam/getacc?PF02518</url>] in the Pfam database <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>, and (ii) a less-conserved phosphoacceptor (dimerization) domain, carrying the phosphorylatable His residue <abbrgrp><abbr bid="B7">7</abbr><abbr bid="B63">63</abbr></abbrgrp>. The dimerization domains are quite diverse in their sequence and comprise the His Kinase A (phosphoacceptor) domain clan in Pfam, which unifies four individual domain families: HisKA [Pfam:PF00512 <url>http://www.sanger.ac.uk/cgi-bin/Pfam/getacc?PF00512</url>], HisKA_2 [Pfam:PF07568 <url>http://www.sanger.ac.uk/cgi-bin/Pfam/getacc?PF07568</url>], HisKA_3 [Pfam:PF07730 <url>http://www.sanger.ac.uk/cgi-bin/Pfam/getacc?PF07730</url>, and HWE_HK [Pfam:PF07536 <url>http://www.sanger.ac.uk/cgi-bin/Pfam/getacc?PF02518</url>]. Due to the great variability of the HisKA domains, the results of PSI-BLAST search are largely determined by the presence of the HATPase_c domain and often include other members of the GHKL family, such as DNA gyrase B and DNA repair protein MutL, as well as anti-sigma F factors (SpoIIAB-like Ser/Thr kinases). Due to the presence of long &#945;-helices in the phosphoacceptor domains, they sometimes show spurious low-complexity hits.</p>
            <p><b><it>Methyl-accepting protein (MCP) domain </it></b>(PF00015) [Pfam:PF00015 <url>http://www.sanger.ac.uk/cgi-bin/Pfam/getacc?PF00015</url>] contains long &#945;-helices, which also attract low-complexity hits. However, the extremely high conservation of the (LI)LALNAAIEAARAGExGRGFAVVAxEVR sequence pattern allows a relatively easy recognition of false-positive hits.</p>
            <p><b><it>Ser/Thr/Tyr kinase (STYK) domain </it></b>(PF00069) [Pfam:PF00069 <url>http://www.sanger.ac.uk/cgi-bin/Pfam/getacc?PF00069</url>] belongs to the Protein kinase superfamily clan in Pfam <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>. Other members of this clan, such as kinases of kanamycin, streptomycin, methylthioribose, homoserine, choline, and 3-deoxy-D-manno-octulosonic acid (KDO), are often retrieved in PSI-BLAST searches. In fact, the latter enzyme, KDO kinase (product of the <it>waaP </it>gene, PF06293 [Pfam:PF06293 <url>http://www.sanger.ac.uk/cgi-bin/Pfam/getacc?PF06293</url>]) often gives much better BLAST scores than certain divergent Ser/Thr kinases. Most of the discrepancies between the data presented here and those in the KinG database <abbrgrp><abbr bid="B64">64</abbr><abbr bid="B65">65</abbr></abbrgrp> could be attributed to those false-positive hits. The most common false-negative hits were the putative protein kinases of ABC1/AarF family (PF03109 [Pfam:PF03109 <url>http://www.sanger.ac.uk/cgi-bin/Pfam/getacc?PF03109</url>] or COG0661 [<url>http://www.ncbi.nlm.nih.gov/COG/new/release/cow.cgi?view=1&amp;cog=COG0661</url>]), which are somehow involved in ubiquinone biosynthesis, most likely by regulating this pathway <abbrgrp><abbr bid="B66">66</abbr></abbrgrp>. It should be noted that although members of the ABC1 (<ul>a</ul>ctivity of <ul>bc1</ul>) family are sometimes misannotated as ABC transporters or even ABC transporter substrate binding proteins, this appears to be due to a simple misunderstanding, which I have ignored and counted these proteins as protein kinases.</p>
            <p><b><it>GGDEF domains </it></b>(PF00990 [<url>http://www.sanger.ac.uk/cgi-bin/Pfam/getacc?PF00990</url>] from diverse bacteria have diguanylate cyclase activity <abbrgrp><abbr bid="B23">23</abbr><abbr bid="B24">24</abbr></abbrgrp> and are structurally related to the eukaryotic adenylate cyclase (AC3) domains <abbrgrp><abbr bid="B67">67</abbr></abbrgrp>. While PSI-BLAST searches of GGDEF domains rarely produced any false positive hits, many GGDEF-related domains appeared to be inactivated, some were clearly truncated. The latter ones were excluded from the total count. The most interesting example included a conserved family of proteins (COG3887 [<url>http://www.ncbi.nlm.nih.gov/COG/new/release/cow.cgi?view=1&amp;cog=COG3887</url>]) comprising a fusion of a modified (likely inactivated) GGDEF domain and the DHH-family (PF01368 [Pfam:PF01368 <url>http://www.sanger.ac.uk/cgi-bin/Pfam/getacc?PF01368</url>], <abbrgrp><abbr bid="B68">68</abbr></abbrgrp>) phosphoesterase domain. Members of this family are encoded in genomes of most Firmicutes, including tiny genomes of some <it>Mycoplasma </it>spp., but their function remains unknown.</p>
            <p><b><it>EAL, AC1, AC2, or AC3 domains </it></b>(corresponding to the Pfam entries PF00563 [Pfam:PF00583 <url>http://www.sanger.ac.uk/cgi-bin/Pfam/getacc?PF02518</url>], PF01295 [Pfam:PF01295 <url>http://www.sanger.ac.uk/cgi-bin/Pfam/getacc?PF01295</url>], PF01928 [Pfam:PF01928 <url>http://www.sanger.ac.uk/cgi-bin/Pfam/getacc?PF01928</url>], and PF00211 [Pfam:PF00211 <url>http://www.sanger.ac.uk/cgi-bin/Pfam/getacc?PF00211</url>], respectively) did not return any false-positive hits in PSI-BLAST searches.</p>
            <p><b><it>HD-GYP domain </it></b>is a variant of the widespread HD-type phosphohydrolase (PF01966 [Pfam:PF01966 <url>http://www.sanger.ac.uk/cgi-bin/Pfam/getacc?PF01966</url>], <abbrgrp><abbr bid="B69">69</abbr></abbrgrp>) domain that contains a C-terminal subdomain with extra conserved residues <abbrgrp><abbr bid="B26">26</abbr></abbrgrp>. Classical HD domains without the second subdomain often showed up as false-positive hits; these were filtered based on the total length of the BLAST alignment.</p>
            <p>Whenever possible, the domain and protein counts were compared to the published data and all discrepancies were manually verified. Thus, this census has identified 92 HKs in <it>Bradyrhizobium japonicum</it>, 62 HKs in <it>Mesorhizobium loti</it>, and 48 HKs in <it>Sinorhizobium meliloti </it>[see <supplr sid="S1">Additional file 1</supplr>], which was much more than 80, 47 and 40 HKs, respectively, recognized in these bacteria in a recent survey <abbrgrp><abbr bid="B34">34</abbr></abbrgrp>. A comparison of the two sets revealed that most of the proteins missing from the HK list by Hagiwara <it>et al</it>.<abbrgrp><abbr bid="B34">34</abbr></abbrgrp> comprise a conserved family (COG3920 [<url>http://www.ncbi.nlm.nih.gov/COG/new/release/cow.cgi?view=1&amp;cog=COG3920</url>]) with an unusual HisKA_2 (PF07568 [<url>http://www.sanger.ac.uk/cgi-bin/Pfam/getacc?PF07568</url>]) dimerization domain, which, however, still contains a conserved His residue, confirming that these proteins are true HKs. This and other comparisons showed that, in most cases, different authors correctly identified the core sets of signaling proteins and most discrepancies could be attributed to the different ways of treating divergent, inactivated and truncated sequences. The approach adopted here was to take a middle ground, not counting clearly truncated and highly diverged sequences but keeping in the list full-length domains that might have had inactivating point mutations. For example, although Gly?Ala and Glu?Ala changes in the GGEE motif of the GGDEF domain have been shown to abrogate its diguanylate cyclase activity, sequences with such changes were still counted as diguanylate cyclases, while the truncated sequences in <it>Methanococcus kandleri </it>protein MK0296 [UniProt:Q8TYK1 <url>http://www.expasy.org/uniprot/Q8TYK1</url>], <it>Aeropyrum pernix </it>protein APE1864 [UniProt:Q9YAS9 <url>http://www.expasy.org/uniprot/Q9YAS9</url>, or in COG3887 [<url>http://www.ncbi.nlm.nih.gov/COG/new/release/cow.cgi?view=1&amp;cog=COG3887</url>] proteins (see above) were not. Likewise, <it>Archaeoglobus fulgidus </it>encodes a family of proteins that have a typical HK domain architecture but lack the HATPase domain. Such truncated sequences were not included in the total count [see <supplr sid="S1">Additional file 1</supplr>] but still listed (marked with asterisks) in the supporting files. Since the signaling protein count was based on the domain count, monster multidomain proteins, combining various output domains, such as the hybrid HK-STYK [UniProt:O32393 <url>http://www.expasy.org/uniprot/O32393</url>] described in <it>Spirulina platensis </it><abbrgrp><abbr bid="B70">70</abbr></abbrgrp> or the HK-GGDEF combination, found in <it>Geobacter sulfurreducens </it>protein GSU3350 [UniProt:Q747B7 <url>http://www.expasy.org/uniprot/Q747B7</url>], have been counted more than once.</p>
         </sec>
         <sec>
            <st>
               <p>General trends</p>
            </st>
            <p>The census of signal transduction proteins encoded in complete microbial genomes [see <supplr sid="S1">Additional file 1</supplr>] revealed several interesting trends. It has largely confirmed previous observations <abbrgrp><abbr bid="B2">2</abbr><abbr bid="B4">4</abbr><abbr bid="B71">71</abbr></abbrgrp> that the total number of regulatory proteins encoded by each given organism genome positively correlates with the genome size (Figure <figr fid="F2">2a</figr>) and the total number of encoded proteins (Figure <figr fid="F2">2b</figr>): microbes with complex life styles generally have larger genomes and encode more sophisticated and diverse regulatory systems than parasites with their largely degraded genomes.</p>
            <fig id="F2">
               <title>
                  <p>Figure 2</p>
               </title>
               <caption>
                  <p>The total number of encoded signal transducers proteins grows with genome size</p>
               </caption>
               <text>
                  <p><b>The total number of encoded signal transducers proteins grows with genome size. </b>A. Distribution of signal transduction proteins among free-living bacteria and archaea (squares) and obligate pathogens (closed squares). Organisms with other status (symbionts and other commensals) are indicated by triangles. The best-fit line represents data from all species. B. Distribution of signal transduction proteins among organisms of different phylogenetic lineages. The symbols indicate members of the following phyla: Actinobacteria, black circles; Cyanobacteria, open circles (light blue); Alpha-proteobacteria, closed diamonds (dark brown); Beta-, Delta-, and Epsilon-proteobacteria, open diamonds (yellow); Gamma-proteobacteria, closed squares (dark blue); Firmicutes, open squares (magenta); members of other bacterial phyla (Aquificales, Bacteroidetes, Chlamydiae, Deinococcus-Thermus, Planctomycetes, Spirochetes, Thermotoga), closed triangles (red); Archaea, open triangles (yellow).</p>
               </text>
               <graphic file="1471-2180-5-35-2"/>
            </fig>
            <p>While small genome size (and the correspondingly low number of signaling systems) is often associated with pathogenicity, there are numerous pathogens with relatively large genomes (e.g. <it>Bordetella parapertussis</it>, <it>Mycobacterium tuberculosis</it>), as well as free-living organisms with very small genome sizes (e.g. <it>Thermoplasma acidophilum</it>, <it>Aquifex aeolicus</it>, see Figure <figr fid="F2">2a</figr>). Many free-living archaea encode very few, if any, proteins involved in signal transduction. For example, among 2977 proteins, encoded by the extreme thermoacidophile <it>Sulfolobus solfataricus</it>, only 9 are signaling (8 STYKs and an AC2). A similar picture has been reported in marine cyanobacteria <abbrgrp><abbr bid="B72">72</abbr></abbrgrp> and is seen in the recently sequenced genome of the ruminal bacterium <it>Mannheimia succiniproducens</it>, which encodes just 5 HKs, an AC, and a STYK [see <supplr sid="S1">Additional file 1</supplr>]. Apparently, the constant and nutrient-rich ruminal environment does not require much signal transduction. These data indicate that organisms inhabiting stable environments can get away with relatively simple signal transduction systems. In contrast, organisms that survive in diverse ecological niches, including facultative pathogens, such as the spirochetes <it>Leptospira interrogans </it>and <it>Treponema denticola</it>, typically encode complex sensory systems. Of course, sophisticated bacteria can also be found in simple and stable environments: <it>Wolinella succinogenes</it>, another ruminal inhabitant <abbrgrp><abbr bid="B73">73</abbr></abbrgrp>, encodes many more signal transduction proteins than other bacteria with similar genome sizes (Table <tblr tid="T1">1</tblr>, see below).</p>
            <tbl id="T1">
               <title>
                  <p>Table 1</p>
               </title>
               <caption>
                  <p>Bacteria with the highest adaptability index ("highest IQ")</p>
               </caption>
               <tblbdy cols="5">
                  <r>
                     <c ca="left">
                        <p>
                           <b>Organism</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Phylum</b>
                           <sup>a</sup>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Signal transducers</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Genome size, kb</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>IQ</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="5">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Wolinella succinogenes</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>Epsilon</p>
                     </c>
                     <c ca="center">
                        <p>99</p>
                     </c>
                     <c ca="center">
                        <p>2,110</p>
                     </c>
                     <c ca="center">
                        <p>230</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Geobacter sulfurreducens</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>Delta</p>
                     </c>
                     <c ca="center">
                        <p>165</p>
                     </c>
                     <c ca="center">
                        <p>3,814</p>
                     </c>
                     <c ca="center">
                        <p>166</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Idiomarina loihiensis</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>Gamma</p>
                     </c>
                     <c ca="center">
                        <p>80</p>
                     </c>
                     <c ca="center">
                        <p>2,839</p>
                     </c>
                     <c ca="center">
                        <p>153</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Desulfovibrio vulgaris</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>Delta</p>
                     </c>
                     <c ca="center">
                        <p>135</p>
                     </c>
                     <c ca="center">
                        <p>3,773</p>
                     </c>
                     <c ca="center">
                        <p>151</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Vibrio cholerae</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>Gamma</p>
                     </c>
                     <c ca="center">
                        <p>152</p>
                     </c>
                     <c ca="center">
                        <p>4,033</p>
                     </c>
                     <c ca="center">
                        <p>150</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Thermotoga maritima</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>Other</p>
                     </c>
                     <c ca="center">
                        <p>34</p>
                     </c>
                     <c ca="center">
                        <p>1,861</p>
                     </c>
                     <c ca="center">
                        <p>145</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Borrelia garinii</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>Spiro</p>
                     </c>
                     <c ca="center">
                        <p>13</p>
                     </c>
                     <c ca="center">
                        <p>987</p>
                     </c>
                     <c ca="center">
                        <p>143</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Vibrio vulnificus</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>Gamma</p>
                     </c>
                     <c ca="center">
                        <p>200</p>
                     </c>
                     <c ca="center">
                        <p>5,127</p>
                     </c>
                     <c ca="center">
                        <p>136</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Chromobacterium violaceum</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>Beta</p>
                     </c>
                     <c ca="center">
                        <p>160</p>
                     </c>
                     <c ca="center">
                        <p>4,751</p>
                     </c>
                     <c ca="center">
                        <p>131</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Thermosynechococcus elongatus</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>Cyano</p>
                     </c>
                     <c ca="center">
                        <p>51</p>
                     </c>
                     <c ca="center">
                        <p>2,594</p>
                     </c>
                     <c ca="center">
                        <p>131</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p><sup>a </sup>&#8211; Beta, Gamma, Delta and Epsilon indicate the corresponding subdivisions of Proteobacteria; Cyano indicates cyanobacteria; Spiro indicates Spirochetes.</p>
               </tblfn>
            </tbl>
         </sec>
         <sec>
            <st>
               <p>Bacterial IQ</p>
            </st>
            <p>The total number of signaling proteins encoded in a given genome (or, rather, the fraction of such proteins among all encoded in the genome) can be used as a measure of the adaptive potential of an organism, some kind of 'bacterial IQ'. The slope of the best-fit line on Figure <figr fid="F2">2a</figr> is 2.03, meaning that the total number of signal transduction proteins grows approximately as a square of the genome size. The organisms whose genomes deviate most from this trend can be considered particularly 'smart' or 'dumb' compared to their relatives. There could be different ways to evaluate the relative abundance of signal transduction proteins at the given genome size; the data in Table <tblr tid="T1">1</tblr> were calculated using the following formula:</p>
            <p>IQ = 5 &#215; 10<sup>4 </sup>(n-5)<sup>1/2 </sup>L<sup>-1</sup>,</p>
            <p>where n is the total number of signal transduction proteins, L is the complete genome size in kb (even counting plasmids, it is a more consistent measure than the number of predicted proteins), 5 &#215; 10<sup>4 </sup>and 5 are arbitrarily chosen empirical coefficients, so that IQ = 100 corresponds to 9 signal transducers in a 1000 kb genome and to 105 transducers in a 5000 kb genome. Accordingly, the IQ value is not defined for organisms with less than 6 signal transduction proteins.</p>
            <p>With one exception, all organisms listed in Table <tblr tid="T1">1</tblr> are environmental gram-negative bacteria (most gram-positive bacteria and archaea scored much lower) that are highly motile and are known to use a wide variety of electron donors and electron acceptors <abbrgrp><abbr bid="B73">73</abbr><abbr bid="B74">74</abbr><abbr bid="B75">75</abbr><abbr bid="B76">76</abbr></abbrgrp>. Such versatile organisms as <it>Chromobacterium violaceum, Desulfovibrio vulgaris</it>, <it>Geobacter sulfurreducens</it>, <it>Vibrio vulnificus</it>, and <it>Wolinella succinogenes </it>are also repeatedly found among the leaders in individual categories (Table <tblr tid="T2">2</tblr>), both in terms of absolute number of signal transduction proteins and of their fraction among all encoded proteins. Remarkably, most of the winners come from the relatively poorly characterized beta-, delta- and epsilon- subdivisions of Proteobacteria. This illustrates the limitations of relying just on <it>Escherichia coli </it>and <it>Bacillus subtilis </it>as model organisms for studying signaling transduction in environmental organisms. The recent efforts on the post-genomic analysis of the versatile gamma-proteobacterium <it>Shewanella oneidensis </it><abbrgrp><abbr bid="B77">77</abbr></abbrgrp>, which encodes a decent set of 46 HKs, 26 MCPs, 7 STYKs, 3 ACs and 52 GGDEF, 28 EAL, and 9 HD-GYP domains [see <supplr sid="S1">Additional file 1</supplr>] might be a step in the right direction. In contrast, <it>E. coli </it>appears to have a relatively low IQ. Although its 30 HKs, 19 GGDEF and 17 EAL domains at first seemed like a high number <abbrgrp><abbr bid="B16">16</abbr><abbr bid="B26">26</abbr></abbrgrp>, <it>E. coli</it>, as well as <it>Salmonella </it>spp. and <it>Yersinia </it>spp., other members of <it>Enterobacteriaceae</it>, looks pretty 'dumb' compared to the representatives of <it>Pseudomonadaceae</it>, <it>Vibrionaceae</it>, or <it>Xanthomonadaceae</it>, particularly with respect to chemotaxis: any sequenced member of the three latter families encodes many more MCPs than the meager 5 MCPs in <it>E. coli</it>. The deep-sea bacterium <it>Idiomarina loihiensis</it>, which belongs to yet another gamma-proteobacterial lineage and whose protein set is just 62% of that of <it>E. coli </it><abbrgrp><abbr bid="B78">78</abbr></abbrgrp>, encodes more diguanylate cyclases and 3 times more MCPs than <it>E. coli</it>. The delta-proteobacterium <it>Bdellovibrio bacteriovorus</it>, a predator that infects an <it>E. coli </it>cell and grows in its periplasmic space, also turned out to have a higher IQ: it has a smaller genome than <it>E. coli </it>but encodes almost twice as many HKs and four times more MCPs.</p>
            <tbl id="T2">
               <title>
                  <p>Table 2</p>
               </title>
               <caption>
                  <p>Bacteria and archaea with the highest proportion of encoded signaling proteins of each type</p>
               </caption>
               <tblbdy cols="3">
                  <r>
                     <c ca="left">
                        <p>
                           <b>Organism</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>Phylum</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>No. proteins (%total)</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="3">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b>Histidine kinases</b>
                        </p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p><it>Nostoc </it>sp. PCC 7120</p>
                     </c>
                     <c ca="left">
                        <p>Cyano</p>
                     </c>
                     <c ca="center">
                        <p>134 (2.2%)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Geobacter sulfurreducens</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Delta</p>
                     </c>
                     <c ca="center">
                        <p>92 (2.7%)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Bacteroides thetaiotaomicron</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Other</p>
                     </c>
                     <c ca="center">
                        <p>85 (1.8%)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Rhodopseudomonas palustris</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Alpha</p>
                     </c>
                     <c ca="center">
                        <p>66 (1.4%)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Desulfovibrio vulgaris</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Delta</p>
                     </c>
                     <c ca="center">
                        <p>64 (1.8%)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Wolinella succinogenes</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Epsilon</p>
                     </c>
                     <c ca="center">
                        <p>39 (1.9%)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Haloarcula marismortui</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Archaea</p>
                     </c>
                     <c ca="center">
                        <p>59 (1.4%)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b>MCPs</b>
                        </p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Vibrio vulnificus</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Gamma</p>
                     </c>
                     <c ca="center">
                        <p>52 (1.2%)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Pseudomonas syringae</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Gamma</p>
                     </c>
                     <c ca="center">
                        <p>48 (0.9%)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Vibrio cholerae</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Gamma</p>
                     </c>
                     <c ca="center">
                        <p>45 (1.2%)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Chromobacterium violaceum</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Beta</p>
                     </c>
                     <c ca="center">
                        <p>42 (1.0%)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Clostridium acetobutylicum</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Firmicutes</p>
                     </c>
                     <c ca="center">
                        <p>38 (1.0%)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Wolinella succinogenes</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Epsilon</p>
                     </c>
                     <c ca="center">
                        <p>31 (1.5%)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Halobacterium salinarium</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Archaea</p>
                     </c>
                     <c ca="center">
                        <p>17 (0.6%)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b>Ser/Thr protein kinases</b>
                        </p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Rhodopirellula baltica</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Other</p>
                     </c>
                     <c ca="center">
                        <p>60 (0.8%)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p><it>Nostoc </it>sp. PCC 7120</p>
                     </c>
                     <c ca="left">
                        <p>Cyano</p>
                     </c>
                     <c ca="center">
                        <p>52 (0.9%)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Streptomyces coelicolor</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Actino</p>
                     </c>
                     <c ca="center">
                        <p>37 (0.4%)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Streptomyces avermitilis</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Actino</p>
                     </c>
                     <c ca="center">
                        <p>35 (0.4%)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Gloeobacter violaceus</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Cyano</p>
                     </c>
                     <c ca="center">
                        <p>20 (0.4%)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>.Thermosynechococcus elongatus</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Cyano</p>
                     </c>
                     <c ca="center">
                        <p>17 (0.7%)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Sulfolobus tokodaii</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Archaea</p>
                     </c>
                     <c ca="center">
                        <p>12 (0.4%)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b>Diguanylate cyclases</b>
                        </p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Vibrio vulnificus</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Gamma</p>
                     </c>
                     <c ca="center">
                        <p>66 (1.5%)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Shewanella oneidensis</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Gamma</p>
                     </c>
                     <c ca="center">
                        <p>52 (1.2%)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Vibrio parahaemolyticus</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Gamma</p>
                     </c>
                     <c ca="center">
                        <p>44 (0.9%)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Chromobacterium violaceum</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Beta</p>
                     </c>
                     <c ca="center">
                        <p>43 (1.0%)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Vibrio cholerae</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Gamma</p>
                     </c>
                     <c ca="center">
                        <p>41 (1.1%)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Idiomarina loihiensis</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Gamma</p>
                     </c>
                     <c ca="center">
                        <p>33 (1.3%)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b>Adenylate cyclases</b>
                        </p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Bradyrhizobium japonicum</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Alpha</p>
                     </c>
                     <c ca="center">
                        <p>37 (0.4%)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Sinorhizobium meliloti</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Alpha</p>
                     </c>
                     <c ca="center">
                        <p>28 (0.5%)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Leptospira interrogans</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Spiro</p>
                     </c>
                     <c ca="center">
                        <p>18 (0.4%)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Mycobacterium bovis</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Actino</p>
                     </c>
                     <c ca="center">
                        <p>16 (0.4%)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Mycobacterium tuberculosis</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Actino</p>
                     </c>
                     <c ca="center">
                        <p>16 (0.4%)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Treponema denticola</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Spiro</p>
                     </c>
                     <c ca="center">
                        <p>9 (0.3%)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b>HD-GYP domains</b>
                        </p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Desulfovibrio vulgaris</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Delta</p>
                     </c>
                     <c ca="center">
                        <p>14 (0.4%)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Vibrio vulnificus</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Gamma</p>
                     </c>
                     <c ca="center">
                        <p>13 (0.3%)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Chromobacterium violaceum</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Beta</p>
                     </c>
                     <c ca="center">
                        <p>11 (0.2%)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Thermotoga maritima</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Other</p>
                     </c>
                     <c ca="center">
                        <p>10 (0.5%)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Desulfotalea psychrophila</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Delta</p>
                     </c>
                     <c ca="center">
                        <p>10 (0.3%)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Geobacter sulfurreducens</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Delta</p>
                     </c>
                     <c ca="center">
                        <p>10 (0.3%)</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>a &#8211; Beta, Gamma, Delta and Epsilon indicate the corresponding subdivisions of Proteobacteria; Cyano indicates cyanobacteria; Spiro indicates Spirochetes.</p>
               </tblfn>
            </tbl>
         </sec>
         <sec>
            <st>
               <p>Phylogenetic distribution of signaling systems</p>
            </st>
            <p>Histidine kinases are by far the predominant type of sensory proteins (Figure <figr fid="F1">1</figr>), whose distribution in all sequenced organisms generally follows the power law (Figures <figr fid="F3">3a</figr> and <figr fid="F4">4a</figr>). The relative abundance of other types of receptors, however, varies widely among organisms of different phylogenetic lineages (Figure <figr fid="F3">3b&#8211;f</figr>). Still, the distribution of their total number also follows power law (Figure <figr fid="F4">4b</figr>). These observations will be analyzed in detail elsewhere. The following is just a brief listing of several unexpected trends:</p>
            <fig id="F3">
               <title>
                  <p>Figure 3</p>
               </title>
               <caption>
                  <p>Phylogenetic distribution of certain types of signal transducers</p>
               </caption>
               <text>
                  <p><b>Phylogenetic distribution of certain types of signal transducers. </b>A. Histidine kinases. B. Methyl-accepting chemotaxis proteins. C. Ser/Thr/Tyr-protein kinases. D. GGDEF domains (active and inactive diguanylate cyclases). E. EAL domains (active and inactive c-di-GMP phosphodiesterases). F. Adenylate cyclases. The symbols for various phyla are shown at the bottom and are the same as in Fig. 2b.</p>
               </text>
               <graphic file="1471-2180-5-35-3"/>
            </fig>
            <fig id="F4">
               <title>
                  <p>Figure 4</p>
               </title>
               <caption>
                  <p>Distribution of the signal transduction proteins follows the power law</p>
               </caption>
               <text>
                  <p><b>Distribution of the signal transduction proteins follows the power law. </b>A. Distribution of histidine kinases. B. Distribution of the total number of all signal transduction proteins except for histidine kinases encoded in a given genome. The symbols for various phyla are as in Fig. 2b.</p>
               </text>
               <graphic file="1471-2180-5-35-4"/>
            </fig>
            <p>1. Archaea do not encode AC1- or AC3-type adenylate cyclases, diguanylate cyclases or c-di-GMP-specific phosphodiesterases (with the exception of several highly diverged and probably inactive ORFs), but encode a fair amount of STYKs. In 11 of 20 archaeal genomes, STYKs and class 2 ACs are the only recognizable proteins involved in signal transduction. More than a half of all sequenced archaeal genomes do not encode any MCPs, others encode from 2 to 5 and only the two halophilic species have a large number of MCPs (17 each, Figure <figr fid="F3">3b</figr>).</p>
            <p>2. Actinobacteria do not encode MCPs or, for that matter, any other chemotaxis or flagellar proteins (the only one that does, <it>Symbiobacterium thermophilum</it>, probably does not belong to the actinobacterial lineage <abbrgrp><abbr bid="B79">79</abbr></abbrgrp>). Instead, actinobacteria encode relatively large numbers of HKs and STYKs (Figure <figr fid="F3">3a,c</figr>). As noted previously, <it>Mycobacterium tuberculosis </it>encodes a relatively high number of AC3s <abbrgrp><abbr bid="B80">80</abbr></abbrgrp>, as do two other mycobacteria, <it>M</it>. <it>bovis </it>and <it>M</it>. <it>avium</it>, but not <it>M</it>. <it>leprae </it>(16, 16, 12, and 4, respectively). The regulators of these AC3s remain unknown, although some ACs have been implicated in sensing of the bicarbonate level <abbrgrp><abbr bid="B81">81</abbr></abbrgrp>. The dramatically lower number of signaling proteins in <it>M. leprae</it>, compared to other mycobacteria, is in line with the general picture of genome decay in this organism <abbrgrp><abbr bid="B82">82</abbr></abbrgrp>.</p>
            <p>3. Cyanobacteria encode large numbers of HKs and STYKs, but very few MCPs (e.g. 134, 52 and 3, respectively, in <it>Nostoc </it>PCC 7120 [see <supplr sid="S1">Additional file 1</supplr>]). These data are consistent with previous observations that cyanobacteria encode just several highly conserved MCPs <abbrgrp><abbr bid="B37">37</abbr></abbrgrp> and regulate their motility using HKs (phytochromes) <abbrgrp><abbr bid="B83">83</abbr><abbr bid="B84">84</abbr></abbrgrp> and ACs <abbrgrp><abbr bid="B27">27</abbr><abbr bid="B28">28</abbr></abbrgrp>.</p>
            <p>4. There is great variation between different subdivisions of Proteobacteria with very few common trends. Proteobacteria generally encode few, if any, STYKs, but a large number of MCPs and diguanylate cyclases. The number of ACs is relatively low, except for representatives of the alpha-subdivision. While gamma-proteobacteria typically encode a single AC1 and no more than one AC3, in <it>Pseudomonas aeruginosa </it>this sole AC3 is important for virulence <abbrgrp><abbr bid="B85">85</abbr></abbrgrp>.</p>
            <p>5. Several bacterial phyla that currently have only a handful of sequenced representatives show highly biased patterns of signal transducer distribution. For example, four sequenced members of the Bacteroidetes (formerly the CFB group) encode a relatively large number of HKs (85 in <it>Bacteroides thetaiotaomicron</it>), but few or no STYKs and no MCPs, ACs or diguanylate cyclases. It would be interesting to see if this trend holds when more genomes of this lineage become available.</p>
         </sec>
         <sec>
            <st>
               <p>Variation in IQ between close relatives</p>
            </st>
            <p>The recent genomic data revealed substantial differences in gene content among different strains that, judging by the level of 16S rRNA identity, belong to the same bacterial species <abbrgrp><abbr bid="B86">86</abbr><abbr bid="B87">87</abbr></abbrgrp>. It is therefore not surprising to see dramatic differences in signaling protein content among different species of the same genus. Still, different members of the <it>Bacillus </it>genus show very similar distributions of signaling proteins [see <supplr sid="S1">Additional file 1</supplr>]. In contrast, three sequenced genomes of <it>Clostridium </it>spp. encode dramatically different numbers of MCPs (38 in <it>C. acetobutylicum</it>, 20 in <it>C. tetani </it>and 0 in <it>C. perfringens</it>) and HD-GYP domains (9, 1, and 1, respectively), whereas the content of other signaling proteins is more or less in line with the genome sizes. Accordingly, <it>C. acetobutylicum </it>makes it into the winners list in both MCP and HD-GYP categories (Table <tblr tid="T2">2</tblr>).</p>
            <p>Although not seen in the current data set, domains that are missing in one strain were sometimes found in a different strain of the same species. Thus, although this domain census shows the absence of HD-GYP domains in <it>Yersinia pestis </it>strain CO92 and in <it>Bacillus cereus </it>strain ATCC 14579 [see <supplr sid="S1">Additional file 1</supplr>], this domain is encoded in <it>Y. pestis </it>strain KIM and <it>B. cereus </it>strain ZK. These differences indicate that signaling proteins can be easily acquired and lost, so all observations on the presence or absence of certain signaling system in a certain organism are only as good as the current genome set.</p>
         </sec>
         <sec>
            <st>
               <p>Transmembrane and intracellular sensors: Extroverts and introverts</p>
            </st>
            <p>Analysis of complete microbial genomes revealed complex systems of intracellular monitoring that included PAS- and GAF-containing proteins with a variety of output domains <abbrgrp><abbr bid="B3">3</abbr></abbrgrp>. The fraction of membrane-bound proteins among all signal transduction proteins encoded in each given genome was evaluated here using three different methods for predicting transmembrane (TM) segments, followed by manual analysis of the outputs. The census showed that while the great majority of HKs and MCPs were membrane-bound, as much as one-third of all HKs and one-sixth of all MCPs did not contain a single TM segment (Figure <figr fid="F1">1</figr>, Additional file <supplr sid="S1">1</supplr>). In contrast, only about a half of all adenylate and diguanylate cyclases and c-di-GMP phosphodiesterases were membrane-bound; a majority of STYKs and HD-GYP domains were soluble (Figure <figr fid="F1">1</figr>).</p>
            <p>It must be noted that not every membrane-bound signal transduction protein is necessarily a sensor of the environmental parameters. An obvious example among HKs is the turgor sensor KdpD, where TM segments serve solely as anchors <abbrgrp><abbr bid="B88">88</abbr></abbrgrp>. Aer, the energy-sensing MCP, presents a similar case <abbrgrp><abbr bid="B89">89</abbr></abbrgrp>. Conversely, some cytoplasmic sensors might actually sense extracellular signals, e.g. when the sensing domains are present on separate transmembrane polypeptides, as is the case with CheA, the chemotaxis HK. Furthermore, many cytoplasmic sensors respond to signals that are membrane-permeable, such as light, oxygen, H<sub>2</sub>O<sub>2</sub>; NH<sub>3</sub>, and should not be considered purely external or internal. Keeping in mind all these caveats, the predominance of extracellular or intracellular transducers can be used to distinguish organisms that are concerned primarily with sensing environmental parameters ("extroverts") from those more closely monitoring the intracellular milieu ("introverts").</p>
            <p>In obligately parasitic bacteria that encode only a handful of signal transduction proteins, most of these proteins are membrane-bound [see <supplr sid="S1">Additional file 1</supplr>]. However, Figure <figr fid="F5">5a</figr> shows that once the total number of signal transduction proteins goes beyond a dozen, the fraction of them that are membrane-bound stabilizes at about 60%, approximately the same in representatives of all phyla, except for Cyanobacteria and Archaea. In the latter two groups, the fraction of membrane-bound signal transduction proteins is close to 30% and also shows very little variance (Figure <figr fid="F5">5a</figr>). Although, as mentioned above, cyanobacteria encode very few MCPs, this fact alone cannot account for the difference between them and all other bacteria. A comparison of other types of signaling proteins encoded in cyanobacteria and proteobacteria (Figure <figr fid="F5">5b</figr>) shows the prevalence of soluble proteins among cyanobacterial HKs, STYKs and GGDEF domains, compared to the prevalence of TM proteins among the same groups of proteins in proteobacteria. The difference between cyanobacterial and archaeal proteins on one hand and proteins from other lineages is most clearly seen in the comparison of HKs (Figure <figr fid="F5">5c</figr>). Remarkably, actinobacteria and firmicutes turn out to be firm extroverts with relatively few intracellular HKs; some of the latter, however, are known to participate in regulation of sporulation <abbrgrp><abbr bid="B6">6</abbr></abbrgrp>. This schism between cyanobacteria and all other bacteria with completely sequenced genomes is likely to be due to the much more complex organization of the cyanobacterial cell, which contains intracellular membranes harboring the photosynthetic reaction centers. Among other autotrophic prokaryotes, prevalence of intracellular proteins is seen in methanogenic archaea and in the green sulfur bacterium <it>Chlorobium tepidum</it>, although not in the phototrophic alpha-proteobacterium <it>Rhodopseudomonas palustris </it>[see <supplr sid="S1">Additional file 1</supplr>]. Since archaeal heterotrophs also show low amounts of TM signaling proteins, there does not seem to be a direct connection between 'introvertness' and autotrophic metabolism.</p>
            <fig id="F5">
               <title>
                  <p>Figure 5</p>
               </title>
               <caption>
                  <p>Phylogenetic distribution of membrane-bound signal transduction proteins</p>
               </caption>
               <text>
                  <p><b>Phylogenetic distribution of membrane-bound signal transduction proteins. </b>A. Phylogenetic distribution of the total number of transmembrane signal transducers. The best-fit lines are shown for proteins from gamma-bacteria (dark blue) and cyanobacteria (cyan). The symbols for various phyla are as in Fig. 2b. B. Transmembrane histidine kinases (squares), Ser/Thr kinases (circles) and diguanylate cyclases (triangles) in all proteobacteria (open symbols) and cyanobacteria (closed symbols). The best-fit lines are shown for proteobacterial (dark blue) and cyanobacterial (cyan) histidine kinases. C. Phylogenetic distribution of transmembrane histidine kinases. The best-fit lines are shown for actinobacterial (black) and cyanobacterial (cyan) histidine kinases.</p>
               </text>
               <graphic file="1471-2180-5-35-5"/>
            </fig>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Discussion</p>
         </st>
         <p>This paper has grown out of a survey of signal transduction systems in several alpha- and gamma-proteobacteria prepared for a recent review (Table <tblr tid="T1">1</tblr> in ref <abbrgrp><abbr bid="B3">3</abbr></abbrgrp>). It turned out that mere 'counting the senses' could help understand bacterial behavior. For example, as discussed earlier, genomes of two alpha-proteobacteria, <it>Caulobacter crescentus </it>and <it>Mesorhizobium loti</it>, encode the same number of HKs but the former one encodes 19 MCPs compared to just one in <it>M. loti </it>[see <supplr sid="S1">Additional file 1</supplr>]. In contrast, <it>M. loti </it>encodes 13 copies of AC3, compared to just two of them in <it>C. crescentus </it>(<abbrgrp><abbr bid="B3">3</abbr></abbrgrp>, [see <supplr sid="S1">Additional file 1</supplr>]). Such observations could provide a useful insight into the physiology of many obscure bacteria whose genomes have been sequenced in the last several years or will be sequenced in the near future. I have therefore updated our previous listing of signal transduction proteins encoded in microbial genomes <abbrgrp><abbr bid="B2">2</abbr></abbrgrp> to cover the genomes sequenced in the past five years.</p>
         <sec>
            <st>
               <p>Defining the set of signaling proteins</p>
            </st>
            <p>For the purposes of this study, the set of surveyed signal transduction proteins has been limited to just 7 classes of proteins: histidine kinases, methyl-accepting chemotaxis receptors, Ser/Thr protein kinases, adenylate and diguanylate cyclases, c-di-GMP phosphodiesterases with the EAL domain and predicted phosphodiesterases with the HD-GYP domain [see <supplr sid="S1">Additional file 1</supplr>]. Certainly, this list is far from being complete. In a general sense, any cellular protein that participates in cellular adaptation to the changing environment can be considered part of the signaling machinery. Thus, AraC-type transcription regulator, whose DNA-binding properties are modulated by arabinose binding to its N-terminal domain <abbrgrp><abbr bid="B52">52</abbr></abbrgrp>, could also be treated as an intracellular signal transducer. According to a recent study by Ulrich, Koonin, and Zhulin, such 'one-component' signalers comprise a majority of signal transduction systems and were the first to arise in evolution <abbrgrp><abbr bid="B90">90</abbr></abbrgrp>. More sophisticated mechanisms of signal transduction include two-component (HK and RR) signal transduction systems and a variety of other signaling systems that have been described only in the past several years (see <abbrgrp><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr><abbr bid="B21">21</abbr><abbr bid="B22">22</abbr><abbr bid="B39">39</abbr></abbrgrp> for reviews).</p>
            <p>This census considered only dedicated signaling systems that consist of more than two individual components. Therefore, transcriptional regulators, even those of complex domain architecture, were left out (for a comprehensive survey of helix-turn-helix-type (HTH) transcriptional regulators, see <abbrgrp><abbr bid="B91">91</abbr></abbrgrp>). I have also left out response regulators, which are typically considered together with HKs. One of the reasons for that was the frequent confusion between three classes of response regulators: (i) the single-domain chemotaxis response regulator CheY that transmits the signal through protein-protein interactions; (ii) the DNA-binding response regulators of the CheY-HTH domain architecture, and (iii) the response regulators with CheY-AC, CheY-GGDEF or CheY-GGDEF-EAL domain architectures, which produce secondary messengers, cAMP and c-di-GMP. Here, various proteins containing AC, GGDEF, EAL or HD-GYP domains have been lumped together, just as the chemotaxis signal transduction kinase CheA is typically treated as sensor kinase, despite being just a transmitter in the signaling cascade going from MCPs to the flagellar motor. This approach differed from that of Ulrich <it>et al</it>. <abbrgrp><abbr bid="B90">90</abbr></abbrgrp>, who included diguanylate cyclases and c-di-GMP phosphodiesterases (GGDEF and EAL domains, respectively) into the 'one-component' set.</p>
            <p>Another important omission in this survey are Ser/Thr protein phosphatases, which can dephosphorylate STYKs, modulating their activity, and should also be able to dephosphorylate the cellular targets of STYKs. However, several surveys of these enzymes have been published recently <abbrgrp><abbr bid="B19">19</abbr><abbr bid="B39">39</abbr><abbr bid="B92">92</abbr></abbrgrp>, and more are apparently on the way. Due to the difficulties in separating true protein phosphatases from phosphatases of other specificities that often produce false-positive hits I have chosen to exclude them from this survey. Several other systems of the bacterial signal transduction machinery have also been left out. These include (i) Ser/Thr kinases of the bacterial (GHKL) type that regulate the activity of the RNA polymerase sigma subunit; (ii) HPr<sup>Ser </sup>kinase/phosphorylase and other components of the bacterial PEP-dependent phosphotransferase systems, which regulate chemotaxis, membrane transport (inducer exclusion), and catabolite repression; (iii) the systems that regulate RNA and protein degradation; and many others. A census of each of these systems could be an interesting project in its own right.</p>
            <p>The limited scope of this survey, which did not include the sophisticated sporulation machinery of the firmicutes and certain unique (potentially signaling) archaeal domains, could be a reason why representatives of these two groups have generally scored low in the IQ category. Including those proteins into a future version of this census might partly correct that bias, although that would increase the degree of 'introvertness' among archaea even further.</p>
         </sec>
         <sec>
            <st>
               <p>Caveats of automated domain counting</p>
            </st>
            <p>Even within the limited scope of this survey, there is a lot of space for controversy. There are no clear criteria to decide which proteins should be considered HKs or STYKs and which should be not. Thus, the discrepancies of the results presented here and in the papers by the Mizuno group <abbrgrp><abbr bid="B5">5</abbr><abbr bid="B16">16</abbr><abbr bid="B34">34</abbr></abbrgrp> can all be attributed to their more conservative approach to defining HKs. The survey by Kim and Forst <abbrgrp><abbr bid="B17">17</abbr></abbrgrp> shows a similar undercount of non-canonical HKs. In contrast, counting STYKs in the KinG database <abbrgrp><abbr bid="B64">64</abbr></abbrgrp> used more permissive criteria than those employed here, which resulted in KDO kinases and other related kinases being counted as STYKs. For other signaling domains, there was much less room for disagreement. The counts of MCPs and ACs, presented here, are very similar to those reported, respectively, by Zhulin <abbrgrp><abbr bid="B40">40</abbr></abbrgrp> and Shenoy and Visweswariah <abbrgrp><abbr bid="B41">41</abbr></abbrgrp>. All our data with supporting information are available on a public web site <abbrgrp><abbr bid="B51">51</abbr></abbrgrp>, which should provide an easy way to analyze any discrepancies and, if necessary, correct the final count.</p>
         </sec>
         <sec>
            <st>
               <p>Do numbers really matter?</p>
            </st>
            <p>It is well known that growth in bacterial genome size is accompanied by accumulation of paralogous protein families, which can be easily seen in lineage-specific expansions of transcriptional regulators, metabolic enzymes, and/or surface proteins <abbrgrp><abbr bid="B93">93</abbr><abbr bid="B94">94</abbr><abbr bid="B95">95</abbr></abbrgrp>. It can be argued therefore that the sheer number of signal transduction proteins encoded in a bacterial genome is hardly a good measure of its IQ, as many of these proteins are closely related paralogs. It would seem, however, that lineage-specific expansions that have been fixed in evolution must be of some value to the host organism. Among metabolic enzymes, there are indications of functional diversification even among close paralogs <abbrgrp><abbr bid="B96">96</abbr></abbrgrp>. As for signaling proteins, Valley Stewart and colleagues have shown that NarQ and NarX, two paralogous HKs in <it>E. coli</it>, have similar but non-identical functions in modulating cellular response to nitrate and nitrite <abbrgrp><abbr bid="B97">97</abbr><abbr bid="B98">98</abbr></abbrgrp>. Likewise, out of 12 GGDEF domain-containing proteins &#8211; potential diguanylate cyclases &#8211; encoded in <it>Salmonella </it>Typhimurium genome, one, AdrA, was found to be primarily responsible for regulating biofilm formation in a complex medium, whereas another, STM1987, was critical for biofilm formation in the nutrient-poor medium <abbrgrp><abbr bid="B99">99</abbr><abbr bid="B100">100</abbr></abbrgrp>. These data show that we should be very careful in assigning the same function even to closely related paralogs. Differential regulation of expression and activity of paralogous signal transduction proteins could be yet another sophisticated mechanism allowing the bacterial cell to fine-tune its response to environmental changes. Therefore, until there is clear evidence that functions of paralogous signal transduction proteins are truly identical, the total number of such proteins remains the best measure of the bacterial IQ.</p>
         </sec>
         <sec>
            <st>
               <p>Intracellular signaling</p>
            </st>
            <p>One of the most significant insights to emerge from comparative genome analysis was the recognition of the vast system of intracellular signaling in bacteria. It became clear that many bacteria encode complex systems of intracellular monitoring whose domain organization is very similar to that used in transmembrane signaling: a sensor domain (typically, PAS and/or GAF), followed by HK, AC, GGDEF or EAL output domains <abbrgrp><abbr bid="B3">3</abbr></abbrgrp>. In certain cases, soluble HKs, MCPs, and ACs have been experimentally characterized and shown to be involved in monitoring levels of intracellular ATP, oxygen, CO, bicarbonate, nitrate, reactive nitrogen species, and other metabolites and modulating the cellular response to the changes in these parameters <abbrgrp><abbr bid="B101">101</abbr><abbr bid="B102">102</abbr><abbr bid="B103">103</abbr><abbr bid="B104">104</abbr><abbr bid="B105">105</abbr></abbrgrp>. Some intracellular sensors appeared to be specifically geared towards unusual substrates used by the particular bacterium, such as methanol and formaldehyde in <it>Paracoccus denitrificans </it>and <it>Methylobacterium organophilum </it><abbrgrp><abbr bid="B106">106</abbr><abbr bid="B107">107</abbr></abbrgrp>. In the recently sequenced genome of <it>Dehalococcoides ethenogenes</it>, a major detoxifier of chlorinated organic pollutants, many soluble HKs were found encoded in close proximity to the genes for reductive halogenases, the enzymes that catalyze the dechlorination reactions <abbrgrp><abbr bid="B108">108</abbr></abbrgrp>. It was proposed that these HKs respond to intracellular rather than extracellular stimuli, stimulating the expression of reductive halogenases in response to the presence of their chlorinated substrates <abbrgrp><abbr bid="B108">108</abbr></abbrgrp>.</p>
            <p>This census shows that intracellular signal transduction proteins comprise a significant fraction of all signal transducers encoded in almost any bacterial genome. However, most of them are still uncharacterized and have yet to be recognized as legitimate members of the bacterial signaling network. The finding that these proteins are abundant in many pathogenic as well as free-living bacteria should help focus the attention of the research community on these novel components of the signal transduction network.</p>
            <p>The predominance of intracellular signal transduction proteins in cyanobacteria is in stark contrast with the far smaller proportion of such proteins in other bacterial lineages. There could be several possible reasons for this 'introvertness', all linked to the ability of cyanobacteria to conduct oxygenic photosynthesis. Firstly, cyanobacteria harbor a complex system of intracellular membranes carrying the photosynthetic reaction centers. Intracellular signaling proteins could be needed to control formation and functioning of the photosynthetic system, as well as the transition from phototrophic to heterotrophic metabolism and back. The compartmentalization of the cellular interior probably requires a sophisticated system of monitoring conditions within the individual compartments. Last but not the least, cyanobacteria are unique among (known) prokaryotes in that their cells generate oxygen, which other bacteria try to keep outside the cell. The presence of oxygen affects the redox balance in the cytoplasm and leads to oxidative damage of numerous cellular compounds, including ATP, methionine, cysteine, and many others. It is very likely that numerous intracellular HKs that contain PAS domains are involved in maintaining the constant level of the redox potential in the cyanobacterial cell. Surprisingly, <it>Rhodopseudomonas palustris</it>, an alpha-proteobacterium that is also capable of transition between autotrophic and heterotrophic metabolism, does not appear to be an 'introvert' [see <supplr sid="S1">Additional file 1</supplr>]. Hence, it seems that the trend of autotrophic bacteria and archaea being more of 'introverts' and heterotrophic bacteria being more of 'extroverts' might be biased by the current selection of the completely sequenced genomes. It would be interesting to see whether this trend holds when more genomes of bacterial photo- and chemolitotrophs become available.</p>
         </sec>
         <sec>
            <st>
               <p>Phylum-specific bias and evolution of signal transduction</p>
            </st>
            <p>The knowledge of the phylogenetic distribution of signal transduction systems allows a better understanding of their evolution. Previous analysis of HKs and RRs by Koretke and colleagues led to the conclusion that two-component systems originated in bacteria and radiated into two other domains of life through multiple events of horizontal gene transfer <abbrgrp><abbr bid="B32">32</abbr></abbrgrp>. HKs and STYKs appear to be the principal signal transduction proteins in archaea, suggesting that these two classes of proteins could be already present in the last common ancestor of all living organisms (LUCA, <abbrgrp><abbr bid="B92">92</abbr><abbr bid="B109">109</abbr><abbr bid="B110">110</abbr></abbrgrp>). The absence of AC3-type adenylate cyclases, diguanylate cyclases and c-di-GMP phosphodiesterases in any of the sequenced archaeal genomes is quite remarkable. In fact, the only full-size archaeal AC3 domain known to date has been found in an uncultivated psychrophilic crenarchaeote that exhibited numerous cases of horizontal gene transfer <abbrgrp><abbr bid="B111">111</abbr></abbrgrp>. Most archaea, however, encode ACs of class 2 (COG1437 [<url>http://www.ncbi.nlm.nih.gov/COG/new/release/cow.cgi?view=1&amp;cog=COG1437</url>]), which are found in only a handful of organisms outside Archaea <abbrgrp><abbr bid="B30">30</abbr></abbrgrp>. These data show that although cAMP is a truly universal second messenger, different domains of life utilize different enzymes for its production and probably employ entirely different mechanisms of cAMP-dependent signaling.</p>
            <p>Another remarkable example is the diversity of outputs of the chemotaxis machinery. Although all MCPs counted in this work are very similar, it has been noted <abbrgrp><abbr bid="B112">112</abbr></abbrgrp> that chemotactic signals in diverse bacteria and archaea are being transduced to at least three different motility apparata: the bacterial flagellum, the archaeal flagellum that is unrelated to the bacterial one <abbrgrp><abbr bid="B113">113</abbr></abbrgrp>, and to the type IV pili, which are responsible for gliding motility of cyanobacteria and certain other bacteria <abbrgrp><abbr bid="B84">84</abbr><abbr bid="B114">114</abbr></abbrgrp>.</p>
            <p>In general, variability of signal transduction protein content in closely related bacteria, uneven distribution of these proteins among well-established phylogenetic lineages, and the presence in many genomes of tight clusters of closely related paralogs indicate that signaling proteins can be easily acquired and lost. Lineage-specific gene duplication and gene loss and lateral gene transfer probably play a key role in shaping the signaling protein repertoire of each given organism. Why, then, would the total number of signal transduction proteins grow as a square of the genome size (Figure <figr fid="F2">2a</figr>) across a wide variety of microorganisms with diverse lifestyles, phylogenetic affinities, and metabolic capabilities? It is tempting to suggest that there must be an underlying mechanism supporting this correlation. For example, the power-law distribution of HKs (Figure <figr fid="F4">4a</figr>) might stem from the simple fact that the number of binary interactions grows as a square of the number of interacting components <abbrgrp><abbr bid="B115">115</abbr></abbrgrp>, so that the number of sensory proteins that manage the linearly growing number of metabolic enzymes has to grow as a square of that number. This explanation is somewhat similar to the one offered by van Nimwegen to explain his observation that the number of transcriptional regulators in bacteria also grows as a square of the genome size <abbrgrp><abbr bid="B71">71</abbr></abbrgrp>, although his analysis did not include two-component systems. This was also the rationale behind the decision to measure bacterial IQ as a square root, rather than a linear function, of the total number of encoded signal transduction proteins (see the Results section). However, HKs comprise only but a half of all signal transduction proteins counted in this work [see <supplr sid="S1">Additional file 1</supplr>]. The distribution of other types of signal transducers is even more fascinating: while distribution of each individual class of proteins seems almost random (Figures <figr fid="F3">3b&#8211;f</figr>), their <b><it>total </it></b>number still grows approximately as a square of genome size (Figure <figr fid="F4">4b</figr>). One could speculate that this quadratic dependence determines a near-optimal number of signal transducers at a given genome size. This would mean that during their adaptation to different ecological niches, bacteria evolve to rely primarily on certain types of signal transduction, while other types of transducers can be lost (or not fixed in the genome when acquired by lateral gene transfer). For example, during the reductive evolution of chlamydia, HKs and STYKs were retained, while all other transducers and were lost [see <supplr sid="S1">Additional file 1</supplr>]. In contrast, spirochetes held on to their chemotaxis transducers but mostly lost their STYKs. The recent evidence for non-canonical roles of signal transduction proteins, e.g. regulation of gene expression by the chemotaxis system <abbrgrp><abbr bid="B116">116</abbr></abbrgrp> and regulation of chemotaxis by adenylate cyclases <abbrgrp><abbr bid="B28">28</abbr></abbrgrp>, suggests that there is certain flexibility in functions of different transducers that could be used by bacterial evolution to generate even greater diversity of signal transduction mechanisms.</p>
         </sec>
         <sec>
            <st>
               <p>Future developments</p>
            </st>
            <p>The goal of genome analysis is to predict the organism's physiology and behavior based solely on the genomic sequence. There has been great progress in predicting metabolic pathways <abbrgrp><abbr bid="B110">110</abbr><abbr bid="B117">117</abbr><abbr bid="B118">118</abbr></abbrgrp>; deciphering signaling pathways so far has lagged behind. Accumulation of complete genome sequences has led to the delineation of many new signaling and signal transduction domains and caused a revolution on our understanding of bacterial regulatory networks <abbrgrp><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr><abbr bid="B20">20</abbr><abbr bid="B119">119</abbr></abbrgrp>.</p>
            <p>I believe that, despite all its limitations, this census would be useful for microbiologists, at least by highlighting still unresolved problems in prokaryotic signal transduction. This work should be complemented by surveys of other components of the signal transduction machinery, including various response regulators, Ser/Thr protein phosphatases, PTS proteins, and many others. Genomes of several environmental microorganisms, including 9-Mb genomes of <it>Myxococcus xanthus, Rhodococcus </it>sp., and <it>Gemmata obscuriglobus</it>, have been completed and are expected to be publicly released in the near future. Owing to their sheer size, these genomes are likely to bring new signaling domains and illuminate even more regulatory relations. <it>Myxococcus xanthus</it>, which reportedly encodes close to 200 HKs and many STYKs, would probably become a leader in both these categories.</p>
            <p>The example of <it>M. xanthus </it>exposes certain flaws in the IQ calculation method used in this work. This bacterium has extremely complex behavioral patterns <abbrgrp><abbr bid="B114">114</abbr></abbrgrp>, but, at 9.1 Mb, it would need to encode more than 550 signal transduction proteins just to make it into the winners' list (Table <tblr tid="T1">1</tblr>). Certainly, better ways to evaluate bacterial IQ are needed, but that should be subject of a future work. Still, I believe that in the era of 'systems biology' when cellular metabolic pathways are being routinely modeled on a whole-genome level <abbrgrp><abbr bid="B50">50</abbr><abbr bid="B120">120</abbr></abbrgrp> and the cell itself is treated more as a machine with a number of interacting parts <abbrgrp><abbr bid="B121">121</abbr><abbr bid="B122">122</abbr></abbrgrp>, it is important to keep in mind the real complexity of the signal network encoded in each given prokaryotic genome and have an easy measure of this complexity.</p>
            <p>I also hope that this census will help us get a better understanding of the microbial diversity and the unique ways that bacteria use to adapt to changing environment. Such understanding is becoming increasingly important as our earlier methods of controlling bacterial growth with one-size-fits-all wide-spectrum antibiotics show progressively diminishing results.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Conclusion</p>
         </st>
         <p>Careful accounting of diverse proteins participating in prokaryotic signal transduction shows that the complexity of signaling mechanisms correlates well with the organism's genome size and the size of its proteome. The total number of proteins involved in signal transduction, the number of histidine kinases, and the total number of signal transduction proteins other than histidine kinases all grow as square of the genome size. At the same time, the fractions of the latter proteins &#8211; MCPs, STYKs, adenylate and diguanylate cyclases and phosphodiesterases &#8211; in the total set vary widely depending on the organism's ecology, metabolic properties, and phylogenetic position. The results of this census are freely available to the public and will be updated and corrected as necessary. The availability of this resource, as well as introduction of the concepts of bacterial IQ, introverts and extroverts among the prokaryotes, should help in achieving a better understanding of the microbial behavior and forces that shape microbial genome evolution.</p>
      </sec>
      <sec>
         <st>
            <p>Methods</p>
         </st>
         <sec>
            <st>
               <p>Data sources</p>
            </st>
            <p>Complete genome sequences of 167 bacterial and archaeal species, sequenced by the end of 2004, were downloaded from the NCBI's Genomes database <abbrgrp><abbr bid="B123">123</abbr></abbrgrp> or searched directly through the NCBI web site. Only one representative genome per species was used, usually the first one to be publicly released, according to the NCBI Genomes database listing. Exceptions were made for <it>Escherichia coli</it>, represented by two strains, K12 [GenBank:<ext-link ext-link-type="gen" ext-link-id="U00096">U00096</ext-link>] and O157:H7 [GenBank:<ext-link ext-link-type="gen" ext-link-id="BA000007">BA000007</ext-link>], and <it>Salmonella enterica</it>, represented by three serovars, Paratyphi [GenBank:<ext-link ext-link-type="gen" ext-link-id="CP000026">CP000026</ext-link>], Typhi [GenBank:<ext-link ext-link-type="gen" ext-link-id="AL513382">AL513382</ext-link>], and Typhimurium [GenBank:<ext-link ext-link-type="gen" ext-link-id="AE006468">AE006468</ext-link>]. For <it>Prochlorococcus marinus</it>, strain CCMP1375 [GenBank:<ext-link ext-link-type="gen" ext-link-id="AE017126">AE017126</ext-link>] genome was used, the middle-sized one of the three. Among other simultaneously released genomes, <it>Staphylococcus aureus </it>N315 [GenBank:<ext-link ext-link-type="gen" ext-link-id="BA000018">BA000018</ext-link>], <it>Streptococcus thermophilus </it>CNRZ1066 [GenBank:<ext-link ext-link-type="gen" ext-link-id="CP000024">CP000024</ext-link>], and <it>Thermus thermophilus </it>HB27 [GenBank:<ext-link ext-link-type="gen" ext-link-id="AE017221">AE017221</ext-link>] genomes were used.</p>
         </sec>
         <sec>
            <st>
               <p>A census of histidine kinases</p>
            </st>
            <p>The complete list of histidine kinases was compiled separately for each particular phylum of bacteria from the results of BLAST searches against selected genomes using the NCBI's Genomic BLAST tool <abbrgrp><abbr bid="B124">124</abbr></abbrgrp>, followed by iterative PSI-BLAST searches <abbrgrp><abbr bid="B59">59</abbr></abbrgrp>. Typically, the searches used as the query sequence the C-terminal fragment (residues 301&#8211;579) of the well-characterized histidine kinase PhoR [UniProt:P23545 <url>http://www.expasy.org/cgi-bin/niceprot.pl?P23545</url>] from <it>Bacillus subtilis</it>, which contains both HisKA and HATPase domains <abbrgrp><abbr bid="B125">125</abbr></abbrgrp>, and a position-specific scoring matrix (PSSM) derived from an alignment of well-characterized histidine kinases (both available as Supplementary Material). Additional searches against the NCBI's Reference Sequence (RefSeq) database <abbrgrp><abbr bid="B126">126</abbr><abbr bid="B127">127</abbr></abbrgrp> were performed through the NCBI BLAST web interface <url>http://www.ncbi.nlm.nih.gov/BLAST/</url> by limiting the search space to the given phylum (e.g. Actinobacteria [orgn]) and excluding reference sequences of incomplete genomes (srcdb_refseq [prop] NOT srcdb_refseq_model [prop]). The PSI-BLAST searches used strict inclusion threshold expect values of 10<sup>-5</sup>&#8211;10<sup>-7 </sup>(adjusting as necessary) and were iterated until no newly retrieved sequences belonged to HKs. The total numbers of copies of each signaling domain encoded in each given genome were estimated using the "Taxonomy Report" option in the BLAST output. Potential false-positive hits were checked at every step of PSI-BLAST using the CDD Domain viewer and manually removed (unselected) from the hit list for the next iteration of PSI-BLAST. In each case where the HATPase domain was easily recognized but HisKA domain was not, a BLAST2sequences <abbrgrp><abbr bid="B128">128</abbr></abbrgrp> search was performed to check whether the HATPase domain was preceded by a conserved region carrying a conserved His residue. The presence of such His-containing regions would indicate that those questionable proteins (e.g., mlr1749 [UniProt:Q98JW4 <url>http://www.expasy.org/cgi-bin/niceprot.pl?Q98JW4_RHILO</url>] and other members of COG3920 [<url>http://www.ncbi.nlm.nih.gov/COG/new/release/cow.cgi?view=1&amp;cog=COG3920</url>]) comprise legitimate HKs, contrary to the view of Hagiwara <it>et al</it>. <abbrgrp><abbr bid="B34">34</abbr></abbrgrp>.</p>
            <p>Alternatively, PSI-BLAST searches were run against a local copy of the RefSeq database, using the same query sequence and search parameters with additional filtering against sequences translated from unfinished genomes (ZP_xxxxxxxx entries). The resulting hits were compared against the NCBI Taxonomy database to ensure that they all came from a single organism (only one genome of each bacterial species, usually the first one to be sequenced, was used in this analysis). Similar protocol was used to search for histidine kinases in other bacterial phyla.</p>
         </sec>
         <sec>
            <st>
               <p>Counting other signaling domains</p>
            </st>
            <p>Owing to the relatively high sequence conservation of the MCP, ACyc, GGDEF, and EAL domains, manual checking of the PSI-BLAST outputs revealed very few false-positive hits. In the case of the two latter domains, many low-scoring proteins had numerous amino acid changes, including ones in the likely active sites (see <abbrgrp><abbr bid="B2">2</abbr><abbr bid="B22">22</abbr><abbr bid="B67">67</abbr></abbrgrp>). No attempt has been made to sort these domains into active and inactive ones. For the HD-GYP domain, which comprises a typical HD superfamily phosphoesterase domain with a number of additional conserved residues, high-scoring BLAST hits to the standard HD domains were filtered based on the shorter length of those hits.</p>
         </sec>
         <sec>
            <st>
               <p>Identification of transmembrane receptors</p>
            </st>
            <p>Transmembrane (TM) segments in verified sets of signal transduction proteins from various phylogenetic lineages were predicted using PHDhtm <abbrgrp><abbr bid="B129">129</abbr></abbrgrp> and TMHMM <abbrgrp><abbr bid="B130">130</abbr></abbrgrp> programs. The results were sorted into three bins: TM proteins (&#8805; 2 TM segments), 1 TM proteins, and soluble proteins, and the discrepancies between predictions of the two programs were manually inspected. Comparison of the results revealed many false-negative assignments, so that prediction of a TM segment by either program typically turned out to be justified. Questionable cases were also checked using the HMMTop <abbrgrp><abbr bid="B131">131</abbr></abbrgrp> program, which, however, produced both false-negative and false-positive predictions of TM segments. Therefore, HMMTop assignments were considered only when supported by either PHDhtm or TMHMM results.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>List of Abbreviations</p>
         </st>
         <p>AC, adenylate cyclase;</p>
         <p>AC1, adenylate cyclase class 1;</p>
         <p>AC2, adenylate cyclase class 2;</p>
         <p>AC3, adenylate cyclase class 3;</p>
         <p>c-di-GMP, cyclic dimeric (3',5'-guanosine monophosphate);</p>
         <p>EAL, conserved protein domain with the Glu-Ala-Leu sequence motif and c-di-GMP-specific phosphodiesterase activity;</p>
         <p>GGDEF, conserved protein domain with the Gly-Gly-(Asp/Glu)-Glu-Phe sequence motif and diguanylate cyclase activity;</p>
         <p>HD-GYP, conserved protein domain of the HD phosphohydrolase superfamily with additional highly conserved residues, predicted phosphodiesterase;</p>
         <p>HK or HisK, histidine kinase;</p>
         <p>MCP, methyl-accepting chemotaxis protein;</p>
         <p>STYK, Ser/Thr/Tyr-specific protein kinase</p>
         <p>TM, transmembrane.</p>
      </sec>
      <sec>
         <st>
            <p>Authors' contributions</p>
         </st>
         <p>MYG conceived the study, performed all the calculations and wrote the manuscript.</p>
         <suppl id="S2">
            <title>
               <p>Additional File 2</p>
            </title>
            <text>
               <p>Table <tblr tid="T1">1</tblr> in HTML format</p>
            </text>
            <file name="1471-2180-5-35-S2.html">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S3">
            <title>
               <p>Additional File 3</p>
            </title>
            <text>
               <p>Table <tblr tid="T1">1</tblr> in HTML format</p>
            </text>
            <file name="1471-2180-5-35-S3.html">
               <p>Click here for file</p>
            </file>
         </suppl>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>I thank Yuri Wolf and Darren Natale for valuable advice, Mark Gomelsky, Eugene Koonin, Armen Mulkidjanian, and Igor Zhulin for helpful comments, and many other colleagues for suggestions.</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>Binding of enzyme IIAGlc, a component of the phosphoenolpyruvate:sugar phosphotransferase system, to the Escherichia coli lactose permease</p>
            </title>
            <aug>
               <au>
                  <snm>Sondej</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Weinglass</snm>
                  <fnm>AB</fnm>
               </au>
               <au>
                  <snm>Peterkofsky</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Kaback</snm>
                  <fnm>HR</fnm>
               </au>
            </aug>
            <source>Biochemistry</source>
            <pubdate>2002</pubdate>
            <volume>41</volume>
            <issue>17</issue>
            <fpage>5556</fpage>
            <lpage>5565</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">11969416</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <title>
               <p>Novel domains of the prokaryotic two-component signal transduction systems</p>
            </title>
            <aug>
               <au>
                  <snm>Galperin</snm>
                  <fnm>MY</fnm>
               </au>
               <au>
                  <snm>Nikolskaya</snm>
                  <fnm>AN</fnm>
               </au>
               <au>
                  <snm>Koonin</snm>
                  <fnm>EV</fnm>
               </au>
            </aug>
            <source>FEMS Microbiol Lett</source>
            <pubdate>2001</pubdate>
            <volume>203</volume>
            <issue>1</issue>
            <fpage>11</fpage>
            <lpage>21</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">11557134</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>Bacterial signal transduction network in a genomic perspective</p>
            </title>
            <aug>
               <au>
                  <snm>Galperin</snm>
                  <fnm>MY</fnm>
               </au>
            </aug>
            <source>Environ Microbiol</source>
            <pubdate>2004</pubdate>
            <volume>6</volume>
            <issue>6</issue>
            <fpage>552</fpage>
            <lpage>567</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">15142243</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>Trends between gene content and genome size in prokaryotic species with larger genomes</p>
            </title>
            <aug>
               <au>
                  <snm>Konstantinidis</snm>
                  <fnm>KT</fnm>
               </au>
               <au>
                  <snm>Tiedje</snm>
                  <fnm>JM</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci U S A</source>
            <pubdate>2004</pubdate>
            <volume>101</volume>
            <issue>9</issue>
            <fpage>3160</fpage>
            <lpage>3165</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">365760</pubid>
                  <pubid idtype="pmpid" link="fulltext">14973198</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>Compilation of all genes encoding bacterial two-component signal transducers in the genome of the cyanobacterium, Synechocystis sp. strain PCC 6803</p>
            </title>
            <aug>
               <au>
                  <snm>Mizuno</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Kaneko</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Tabata</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>DNA Res</source>
            <pubdate>1996</pubdate>
            <volume>3</volume>
            <issue>6</issue>
            <fpage>407</fpage>
            <lpage>414</lpage>
            <xrefbib>
               <pubid idtype="pmpid">9097043</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>Two-component signal transduction in Bacillus subtilis: how one organism sees its world</p>
            </title>
            <aug>
               <au>
                  <snm>Fabret</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Feher</snm>
                  <fnm>VA</fnm>
               </au>
               <au>
                  <snm>Hoch</snm>
                  <fnm>JA</fnm>
               </au>
            </aug>
            <source>J Bacteriol</source>
            <pubdate>1999</pubdate>
            <volume>181</volume>
            <issue>7</issue>
            <fpage>1975</fpage>
            <lpage>1983</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">93607</pubid>
                  <pubid idtype="pmpid" link="fulltext">10094672</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>Histidine protein kinases: key signal transducers outside the animal kingdom</p>
            </title>
            <aug>
               <au>
                  <snm>Wolanin</snm>
                  <fnm>PM</fnm>
               </au>
               <au>
                  <snm>Thomason</snm>
                  <fnm>PA</fnm>
               </au>
               <au>
                  <snm>Stock</snm>
                  <fnm>JB</fnm>
               </au>
            </aug>
            <source>Genome Biol</source>
            <pubdate>2002</pubdate>
            <volume>3</volume>
            <issue>10</issue>
            <fpage>REVIEWS3013.</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">244915</pubid>
                  <pubid idtype="pmpid" link="fulltext">12372152</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>The histidine protein kinase superfamily</p>
            </title>
            <aug>
               <au>
                  <snm>Grebe</snm>
                  <fnm>TW</fnm>
               </au>
               <au>
                  <snm>Stock</snm>
                  <fnm>JB</fnm>
               </au>
            </aug>
            <source>Adv Microb Physiol</source>
            <pubdate>1999</pubdate>
            <volume>41</volume>
            <issue>21</issue>
            <fpage>139</fpage>
            <lpage>227</lpage>
            <xrefbib>
               <pubid idtype="pmpid">10500846</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p>Classification of Histididine Protein Kinases and Response Regulators</p>
            </title>
            <url>http://www.uni-kl.de/FB-Biologie/AG-Hakenbeck/TGrebe/HPK/Classification.htm</url>
         </bibl>
         <bibl id="B10">
            <title>
               <p>The HWE histidine kinases, a new family of bacterial two-component sensor kinases with potentially diverse roles in environmental signaling</p>
            </title>
            <aug>
               <au>
                  <snm>Karniol</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Vierstra</snm>
                  <fnm>RD</fnm>
               </au>
            </aug>
            <source>J Bacteriol</source>
            <pubdate>2004</pubdate>
            <volume>186</volume>
            <issue>2</issue>
            <fpage>445</fpage>
            <lpage>453</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">305753</pubid>
                  <pubid idtype="pmpid" link="fulltext">14702314</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>SMART 4.0: towards genomic data integration</p>
            </title>
            <aug>
               <au>
                  <snm>Letunic</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Copley</snm>
                  <fnm>RR</fnm>
               </au>
               <au>
                  <snm>Schmidt</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Ciccarelli</snm>
                  <fnm>FD</fnm>
               </au>
               <au>
                  <snm>Doerks</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Schultz</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Ponting</snm>
                  <fnm>CP</fnm>
               </au>
               <au>
                  <snm>Bork</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2004</pubdate>
            <volume>32</volume>
            <issue>1</issue>
            <fpage>D142</fpage>
            <lpage>D144.</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">308822</pubid>
                  <pubid idtype="pmpid" link="fulltext">14681379</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>Simple Modular Architecture Research Tool</p>
            </title>
            <url>http://smart.embl.de</url>
         </bibl>
         <bibl id="B13">
            <title>
               <p>The Pfam protein families database</p>
            </title>
            <aug>
               <au>
                  <snm>Bateman</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Coin</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Durbin</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Finn</snm>
                  <fnm>RD</fnm>
               </au>
               <au>
                  <snm>Hollich</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Griffiths-Jones</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Khanna</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Marshall</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Moxon</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Sonnhammer</snm>
                  <fnm>EL</fnm>
               </au>
               <au>
                  <snm>Studholme</snm>
                  <fnm>DJ</fnm>
               </au>
               <au>
                  <snm>Yeats</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Eddy</snm>
                  <fnm>SR</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2004</pubdate>
            <volume>32</volume>
            <issue>1</issue>
            <fpage>D138</fpage>
            <lpage>D141.</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">308855</pubid>
                  <pubid idtype="pmpid" link="fulltext">14681378</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B14">
            <title>
               <p>Pfam - Protein families database of alignments and HMMs</p>
            </title>
            <url>http://www.sanger.ac.uk/Software/Pfam/</url>
         </bibl>
         <bibl id="B15">
            <title>
               <p>Digging with experimental pick and computational shovel: a new addition to the histidine kinase superfamily</p>
            </title>
            <aug>
               <au>
                  <snm>Zhulin</snm>
                  <fnm>IB</fnm>
               </au>
            </aug>
            <source>J Bacteriol</source>
            <pubdate>2004</pubdate>
            <volume>186</volume>
            <issue>2</issue>
            <fpage>267</fpage>
            <lpage>269</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">305774</pubid>
                  <pubid idtype="pmpid" link="fulltext">14702293</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B16">
            <title>
               <p>Compilation of all genes encoding two-component phosphotransfer signal transducers in the genome of Escherichia coli</p>
            </title>
            <aug>
               <au>
                  <snm>Mizuno</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>DNA Res</source>
            <pubdate>1997</pubdate>
            <volume>4</volume>
            <issue>2</issue>
            <fpage>161</fpage>
            <lpage>168</lpage>
            <xrefbib>
               <pubid idtype="pmpid">9205844</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B17">
            <title>
               <p>Genomic analysis of the histidine kinase family in bacteria and archaea</p>
            </title>
            <aug>
               <au>
                  <snm>Kim</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Forst</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Microbiology</source>
            <pubdate>2001</pubdate>
            <volume>147</volume>
            <issue>Pt 5</issue>
            <fpage>1197</fpage>
            <lpage>1212</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">11320123</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B18">
            <title>
               <p>His-Asp phosphotransfer signal transduction</p>
            </title>
            <aug>
               <au>
                  <snm>Mizuno</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>J Biochem (Tokyo)</source>
            <pubdate>1998</pubdate>
            <volume>123</volume>
            <issue>4</issue>
            <fpage>555</fpage>
            <lpage>563</lpage>
            <xrefbib>
               <pubid idtype="pmpid">9538242</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B19">
            <title>
               <p>The serine, threonine, and/or tyrosine-specific protein kinases and protein phosphatases of prokaryotic organisms: a family portrait</p>
            </title>
            <aug>
               <au>
                  <snm>Shi</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Potts</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Kennelly</snm>
                  <fnm>PJ</fnm>
               </au>
            </aug>
            <source>FEMS Microbiol Rev</source>
            <pubdate>1998</pubdate>
            <volume>22</volume>
            <issue>4</issue>
            <fpage>229</fpage>
            <lpage>253</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">9862122</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B20">
            <title>
               <p>Protein kinases and protein phosphatases in prokaryotes: a genomic perspective</p>
            </title>
            <aug>
               <au>
                  <snm>Kennelly</snm>
                  <fnm>PJ</fnm>
               </au>
            </aug>
            <source>FEMS Microbiol Lett</source>
            <pubdate>2002</pubdate>
            <volume>206</volume>
            <issue>1</issue>
            <fpage>1</fpage>
            <lpage>8</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">11786249</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B21">
            <title>
               <p>Cyclic di-guanosine-monophosphate comes of age: a novel secondary messenger involved in modulating cell surface structures in bacteria?</p>
            </title>
            <aug>
               <au>
                  <snm>Jenal</snm>
                  <fnm>U</fnm>
               </au>
            </aug>
            <source>Curr Opin Microbiol</source>
            <pubdate>2004</pubdate>
            <volume>7</volume>
            <issue>2</issue>
            <fpage>185</fpage>
            <lpage>191</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">15063857</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B22">
            <title>
               <p>C-di-GMP: The dawning of a novel bacterial signalling system</p>
            </title>
            <aug>
               <au>
                  <snm>R&#246;mling</snm>
                  <fnm>U</fnm>
               </au>
               <au>
                  <snm>Gomelsky</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Galperin</snm>
                  <fnm>MY</fnm>
               </au>
            </aug>
            <source>Mol Microbiol</source>
            <pubdate>2005</pubdate>
            <inpress/>
         </bibl>
         <bibl id="B23">
            <title>
               <p>Cell cycle-dependent dynamic localization of a bacterial response regulator with a novel di-guanylate cyclase output domain</p>
            </title>
            <aug>
               <au>
                  <snm>Paul</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Weiser</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Amiot</snm>
                  <fnm>NC</fnm>
               </au>
               <au>
                  <snm>Chan</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Schirmer</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Giese</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Jenal</snm>
                  <fnm>U</fnm>
               </au>
            </aug>
            <source>Genes Dev</source>
            <pubdate>2004</pubdate>
            <volume>18</volume>
            <issue>6</issue>
            <fpage>715</fpage>
            <lpage>727</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">387245</pubid>
                  <pubid idtype="pmpid" link="fulltext">15075296</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B24">
            <title>
               <p>Cyclic diguanylate is a ubiquitous signaling molecule in Bacteria: Insights into biochemistry of the GGDEF protein domain.</p>
            </title>
            <aug>
               <au>
                  <snm>Ryjenkov</snm>
                  <fnm>DA</fnm>
               </au>
               <au>
                  <snm>Tarutina</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Moskvin</snm>
                  <fnm>OM</fnm>
               </au>
               <au>
                  <snm>Gomelsky</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>J Bacteriol</source>
            <pubdate>2005</pubdate>
            <volume>187</volume>
            <issue>5</issue>
            <fpage>1792</fpage>
            <lpage>1798</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">15716451</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B25">
            <title>
               <p>The phosphodiesterase activity of the HmsP EAL domain is required for negative regulation of biofilm formation in Yersinia pestis</p>
            </title>
            <aug>
               <au>
                  <snm>Bobrov</snm>
                  <fnm>AG</fnm>
               </au>
               <au>
                  <snm>Kirillina</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Perry</snm>
                  <fnm>RD</fnm>
               </au>
            </aug>
            <source>FEMS Microbiol Lett</source>
            <pubdate>2005</pubdate>
            <volume>247</volume>
            <fpage>123</fpage>
            <lpage>130</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">15935569</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B26">
            <title>
               <p>A specialized version of the HD hydrolase domain implicated in signal transduction</p>
            </title>
            <aug>
               <au>
                  <snm>Galperin</snm>
                  <fnm>MY</fnm>
               </au>
               <au>
                  <snm>Natale</snm>
                  <fnm>DA</fnm>
               </au>
               <au>
                  <snm>Aravind</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Koonin</snm>
                  <fnm>EV</fnm>
               </au>
            </aug>
            <source>J Mol Microbiol Biotechnol</source>
            <pubdate>1999</pubdate>
            <volume>1</volume>
            <fpage>303</fpage>
            <lpage>305</lpage>
            <xrefbib>
               <pubid idtype="pmpid">10943560</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B27">
            <title>
               <p>Photoresponsive cAMP signal transduction in cyanobacteria</p>
            </title>
            <aug>
               <au>
                  <snm>Ohmori</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Okamoto</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Photochem Photobiol Sci</source>
            <pubdate>2004</pubdate>
            <volume>3</volume>
            <issue>6</issue>
            <fpage>503</fpage>
            <lpage>511</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">15170478</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B28">
            <title>
               <p>Blue light stimulates cyanobacterial motility via a cAMP signal transduction system</p>
            </title>
            <aug>
               <au>
                  <snm>Terauchi</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Ohmori</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Mol Microbiol</source>
            <pubdate>2004</pubdate>
            <volume>52</volume>
            <issue>1</issue>
            <fpage>303</fpage>
            <lpage>309</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">15049828</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B29">
            <title>
               <p>Adenylyl cyclases: a heterogeneous class of ATP-utilizing enzymes</p>
            </title>
            <aug>
               <au>
                  <snm>Barzu</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Danchin</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Prog Nucleic Acid Res Mol Biol</source>
            <pubdate>1994</pubdate>
            <volume>49</volume>
            <fpage>241</fpage>
            <lpage>283</lpage>
            <xrefbib>
               <pubid idtype="pmpid">7863008</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B30">
            <title>
               <p>Aeromonas hydrophila adenylyl cyclase 2: a new class of adenylyl cyclases with thermophilic properties and sequence similarities to proteins from hyperthermophilic archaebacteria</p>
            </title>
            <aug>
               <au>
                  <snm>Sismeiro</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Trotot</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Biville</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Vivares</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Danchin</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>J Bacteriol</source>
            <pubdate>1998</pubdate>
            <volume>180</volume>
            <issue>13</issue>
            <fpage>3339</fpage>
            <lpage>3344</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">107287</pubid>
                  <pubid idtype="pmpid" link="fulltext">9642185</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B31">
            <title>
               <p>Characterization of a eukaryotic type serine/threonine protein kinase and protein phosphatase of Streptococcus pneumoniae and identification of kinase substrates</p>
            </title>
            <aug>
               <au>
                  <snm>Novakova</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Saskova</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Pallova</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Janecek</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Novotna</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Ulrych</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Echenique</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Trombe</snm>
                  <fnm>MC</fnm>
               </au>
               <au>
                  <snm>Branny</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>FEBS J</source>
            <pubdate>2005</pubdate>
            <volume>272</volume>
            <issue>5</issue>
            <fpage>1243</fpage>
            <lpage>1254</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">15720398</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B32">
            <title>
               <p>Evolution of two-component signal transduction</p>
            </title>
            <aug>
               <au>
                  <snm>Koretke</snm>
                  <fnm>KK</fnm>
               </au>
               <au>
                  <snm>Lupas</snm>
                  <fnm>AN</fnm>
               </au>
               <au>
                  <snm>Warren</snm>
                  <fnm>PV</fnm>
               </au>
               <au>
                  <snm>Rosenberg</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Brown</snm>
                  <fnm>JR</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>2000</pubdate>
            <volume>17</volume>
            <issue>12</issue>
            <fpage>1956</fpage>
            <lpage>1970</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">11110912</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B33">
            <title>
               <p>Survey of the number of two-component response regulator genes in the complete and annotated genome sequences of prokaryotes</p>
            </title>
            <aug>
               <au>
                  <snm>Ashby</snm>
                  <fnm>MK</fnm>
               </au>
            </aug>
            <source>FEMS Microbiol Lett</source>
            <pubdate>2004</pubdate>
            <volume>231</volume>
            <issue>2</issue>
            <fpage>277</fpage>
            <lpage>281</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">14987775</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B34">
            <title>
               <p>Genome-wide comparison of the His-to-Asp phosphorelay signaling components of three symbiotic genera of Rhizobia</p>
            </title>
            <aug>
               <au>
                  <snm>Hagiwara</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Yamashino</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Mizuno</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>DNA Res</source>
            <pubdate>2004</pubdate>
            <volume>11</volume>
            <issue>1</issue>
            <fpage>57</fpage>
            <lpage>65</lpage>
            <xrefbib>
               <pubid idtype="pmpid">15141946</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B35">
            <title>
               <p>Characterization of genes encoding multi-domain proteins in the genome of the filamentous nitrogen-fixing cyanobacterium Anabaena sp. strain PCC 7120</p>
            </title>
            <aug>
               <au>
                  <snm>Ohmori</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Ikeuchi</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Sato</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Wolk</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Kaneko</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Ogawa</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Kanehisa</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Goto</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Kawashima</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Okamoto</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Yoshimura</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Katoh</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Fujisawa</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Ehira</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Kamei</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Yoshihara</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Narikawa</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Tabata</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>DNA Res</source>
            <pubdate>2001</pubdate>
            <volume>8</volume>
            <fpage>271</fpage>
            <lpage>284</lpage>
            <xrefbib>
               <pubid idtype="pmpid">11858227</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B36">
            <title>
               <p>Genomic analysis of protein kinases, protein phosphatases and two-component regulatory systems of the cyanobacterium Anabaena sp. strain PCC 7120</p>
            </title>
            <aug>
               <au>
                  <snm>Wang</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Sun</snm>
                  <fnm>YP</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>WL</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>JH</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>CC</fnm>
               </au>
            </aug>
            <source>FEMS Microbiol Lett</source>
            <pubdate>2002</pubdate>
            <volume>217</volume>
            <issue>2</issue>
            <fpage>155</fpage>
            <lpage>165</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12480098</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B37">
            <title>
               <p>Molecular evolution of sensory domains in cyanobacterial chemoreceptors</p>
            </title>
            <aug>
               <au>
                  <snm>Wuichet</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Zhulin</snm>
                  <fnm>IB</fnm>
               </au>
            </aug>
            <source>Trends Microbiol</source>
            <pubdate>2003</pubdate>
            <volume>11</volume>
            <issue>5</issue>
            <fpage>200</fpage>
            <lpage>203</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12781518</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B38">
            <title>
               <p>Comparative analysis of eukaryotic-type protein phosphatases in two streptomycete genomes</p>
            </title>
            <aug>
               <au>
                  <snm>Shi</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>W</fnm>
               </au>
            </aug>
            <source>Microbiology</source>
            <pubdate>2004</pubdate>
            <volume>150</volume>
            <issue>Pt 7</issue>
            <fpage>2247</fpage>
            <lpage>2256</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">15256567</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B39">
            <title>
               <p>Manganese-dependent protein O-phosphatases in prokaryotes and their biological functions</p>
            </title>
            <aug>
               <au>
                  <snm>Shi</snm>
                  <fnm>L</fnm>
               </au>
            </aug>
            <source>Front Biosci</source>
            <pubdate>2004</pubdate>
            <volume>9</volume>
            <fpage>1382</fpage>
            <lpage>1397</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">14977554</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B40">
            <title>
               <p>The superfamily of chemotaxis transducers: from physiology to genomics and back</p>
            </title>
            <aug>
               <au>
                  <snm>Zhulin</snm>
                  <fnm>IB</fnm>
               </au>
            </aug>
            <source>Adv Microb Physiol</source>
            <pubdate>2001</pubdate>
            <volume>45</volume>
            <issue>6</issue>
            <fpage>157</fpage>
            <lpage>198</lpage>
            <xrefbib>
               <pubid idtype="pmpid">11450109</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B41">
            <title>
               <p>Class III nucleotide cyclases in bacteria and archaebacteria: lineage-spefic expansion of adenylyl cyclases and a dearth of guanylyl cyclases</p>
            </title>
            <aug>
               <au>
                  <snm>Shenoy</snm>
                  <fnm>AR</fnm>
               </au>
               <au>
                  <snm>Visweswariah</snm>
                  <fnm>SS</fnm>
               </au>
            </aug>
            <source>FEBS Lett</source>
            <pubdate>2004</pubdate>
            <volume>561</volume>
            <issue>1</issue>
            <fpage>11</fpage>
            <lpage>21</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">15043055</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B42">
            <title>
               <p>The KEGG resource for deciphering the genome</p>
            </title>
            <aug>
               <au>
                  <snm>Kanehisa</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Goto</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Kawashima</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Okuno</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Hattori</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2004</pubdate>
            <volume>32</volume>
            <issue>Database issue</issue>
            <fpage>D277</fpage>
            <lpage>D280</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">308797</pubid>
                  <pubid idtype="pmpid" link="fulltext">14681412</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B43">
            <title>
               <p>KEGG: Kyoto Encyclopedia of Genes and Genomes. Two-component system - Escherichia coli K-12 MG1655</p>
            </title>
            <url>http://www.genome.jp/kegg/pathway/eco/eco02020.html</url>
         </bibl>
         <bibl id="B44">
            <title>
               <p>Sentra, a database of signal transduction proteins</p>
            </title>
            <aug>
               <au>
                  <snm>Maltsev</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Marland</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Yu</snm>
                  <fnm>GX</fnm>
               </au>
               <au>
                  <snm>Bhatnagar</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Lusk</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2002</pubdate>
            <volume>30</volume>
            <issue>1</issue>
            <fpage>349</fpage>
            <lpage>350</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">99115</pubid>
                  <pubid idtype="pmpid" link="fulltext">11752334</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B45">
            <title>
               <p>SENTRA, A Database of Signal Transduction Proteins for Prokaryotes</p>
            </title>
            <url>http://www-wit.mcs.anl.gov/sentra/</url>
         </bibl>
         <bibl id="B46">
            <title>
               <p>The COG database: an updated version includes eukaryotes</p>
            </title>
            <aug>
               <au>
                  <snm>Tatusov</snm>
                  <fnm>RL</fnm>
               </au>
               <au>
                  <snm>Fedorova</snm>
                  <fnm>ND</fnm>
               </au>
               <au>
                  <snm>Jackson</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Jacobs</snm>
                  <fnm>AR</fnm>
               </au>
               <au>
                  <snm>Kiryutin</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Koonin</snm>
                  <fnm>EV</fnm>
               </au>
               <au>
                  <snm>Krylov</snm>
                  <fnm>DM</fnm>
               </au>
               <au>
                  <snm>Mazumder</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Mekhedov</snm>
                  <fnm>SL</fnm>
               </au>
               <au>
                  <snm>Nikolskaya</snm>
                  <fnm>AN</fnm>
               </au>
               <au>
                  <snm>Rao</snm>
                  <fnm>BS</fnm>
               </au>
               <au>
                  <snm>Smirnov</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Sverdlov</snm>
                  <fnm>AV</fnm>
               </au>
               <au>
                  <snm>Vasudevan</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Wolf</snm>
                  <fnm>YI</fnm>
               </au>
               <au>
                  <snm>Yin</snm>
                  <fnm>JJ</fnm>
               </au>
               <au>
                  <snm>Natale</snm>
                  <fnm>DA</fnm>
               </au>
            </aug>
            <source>BMC Bioinformatics</source>
            <pubdate>2003</pubdate>
            <volume>4</volume>
            <issue>1</issue>
            <fpage>41</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">222959</pubid>
                  <pubid idtype="pmpid" link="fulltext">12969510</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B47">
            <title>
               <p>Clusters of Orthologous Groups of proteins database</p>
            </title>
            <url>http://www.ncbi.nlm.nih.gov/COG/new/release/phylox.cgi</url>
         </bibl>
         <bibl id="B48">
            <title>
               <p>The PEDANT genome database in 2005</p>
            </title>
            <aug>
               <au>
                  <snm>Riley</snm>
                  <fnm>ML</fnm>
               </au>
               <au>
                  <snm>Schmidt</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Wagner</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Mewes</snm>
                  <fnm>HW</fnm>
               </au>
               <au>
                  <snm>Frishman</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2005</pubdate>
            <volume>33</volume>
            <fpage>D308</fpage>
            <lpage>D310</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">539973</pubid>
                  <pubid idtype="pmpid" link="fulltext">15608204</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B49">
            <title>
               <p>PEDANT - Protein Extraction, Description and ANalysis Tool</p>
            </title>
            <url>http://pedant.gsf.de/</url>
         </bibl>
         <bibl id="B50">
            <title>
               <p>Bacterial observations: a rudimentary form of intelligence?</p>
            </title>
            <aug>
               <au>
                  <snm>Hellingwerf</snm>
                  <fnm>KJ</fnm>
               </au>
            </aug>
            <source>Trends Microbiol</source>
            <pubdate>2005</pubdate>
            <volume>13</volume>
            <issue>4</issue>
            <fpage>152</fpage>
            <lpage>158</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">15817384</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B51">
            <title>
               <p>Census of bacterial signal transduction proteins </p>
            </title>
            <url>http://www.ncbi.nlm.nih.gov/Complete_Genomes/SignalCensus.html</url>
         </bibl>
         <bibl id="B52">
            <title>
               <p>AraC protein: a love-hate relationship</p>
            </title>
            <aug>
               <au>
                  <snm>Schleif</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Bioessays</source>
            <pubdate>2003</pubdate>
            <volume>25</volume>
            <issue>3</issue>
            <fpage>274</fpage>
            <lpage>282</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12596232</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B53">
            <title>
               <p>Prominent roles of the NorR and Fur regulators in the Escherichia coli transcriptional response to reactive nitrogen species</p>
            </title>
            <aug>
               <au>
                  <snm>Mukhopadhyay</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Zheng</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Bedzyk</snm>
                  <fnm>LA</fnm>
               </au>
               <au>
                  <snm>LaRossa</snm>
                  <fnm>RA</fnm>
               </au>
               <au>
                  <snm>Storz</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2004</pubdate>
            <volume>101</volume>
            <issue>3</issue>
            <fpage>745</fpage>
            <lpage>750</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">321752</pubid>
                  <pubid idtype="pmpid" link="fulltext">14718666</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B54">
            <title>
               <p>The complete genome sequence of Escherichia coli K-12</p>
            </title>
            <aug>
               <au>
                  <snm>Blattner</snm>
                  <fnm>FR</fnm>
               </au>
               <au>
                  <snm>Plunkett</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Bloch</snm>
                  <fnm>CA</fnm>
               </au>
               <au>
                  <snm>Perna</snm>
                  <fnm>NT</fnm>
               </au>
               <au>
                  <snm>Burland</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Riley</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Collado-Vides</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Glasner</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Rode</snm>
                  <fnm>CK</fnm>
               </au>
               <au>
                  <snm>Mayhew</snm>
                  <fnm>GF</fnm>
               </au>
               <au>
                  <snm>Gregor</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Davis</snm>
                  <fnm>NW</fnm>
               </au>
               <au>
                  <snm>Kirkpatrick</snm>
                  <fnm>HA</fnm>
               </au>
               <au>
                  <snm>Goeden</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Rose</snm>
                  <fnm>DJ</fnm>
               </au>
               <au>
                  <snm>Mau</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Shao</snm>
                  <fnm>Y</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>1997</pubdate>
            <volume>277</volume>
            <issue>5331</issue>
            <fpage>1453</fpage>
            <lpage>1474</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">9278503</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B55">
            <title>
               <p>Complete genome sequence of enterohemorrhagic Escherichia coli O157:H7 and genomic comparison with a laboratory strain K-12</p>
            </title>
            <aug>
               <au>
                  <snm>Hayashi</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Makino</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Ohnishi</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Kurokawa</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Ishii</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Yokoyama</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Han</snm>
                  <fnm>CG</fnm>
               </au>
               <au>
                  <snm>Ohtsubo</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Nakayama</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Murata</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Tanaka</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Tobe</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Iida</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Takami</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Honda</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Sasakawa</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Ogasawara</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Yasunaga</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Kuhara</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Shiba</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Hattori</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Shinagawa</snm>
                  <fnm>H</fnm>
               </au>
            </aug>
            <source>DNA Res</source>
            <pubdate>2001</pubdate>
            <volume>8</volume>
            <issue>1</issue>
            <fpage>11</fpage>
            <lpage>22</lpage>
            <xrefbib>
               <pubid idtype="pmpid">11258796</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B56">
            <title>
               <p>Complete genome sequence of a multiple drug resistant Salmonella enterica serovar Typhi CT18</p>
            </title>
            <aug>
               <au>
                  <snm>Parkhill</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Dougan</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>James</snm>
                  <fnm>KD</fnm>
               </au>
               <au>
                  <snm>Thomson</snm>
                  <fnm>NR</fnm>
               </au>
               <au>
                  <snm>Pickard</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Wain</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Churcher</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Mungall</snm>
                  <fnm>KL</fnm>
               </au>
               <au>
                  <snm>Bentley</snm>
                  <fnm>SD</fnm>
               </au>
               <au>
                  <snm>Holden</snm>
                  <fnm>MT</fnm>
               </au>
               <au>
                  <snm>Sebaihia</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Baker</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Basham</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Brooks</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Chillingworth</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Connerton</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Cronin</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Davis</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Davies</snm>
                  <fnm>RM</fnm>
               </au>
               <au>
                  <snm>Dowd</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>White</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Farrar</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Feltwell</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Hamlin</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Haque</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Hien</snm>
                  <fnm>TT</fnm>
               </au>
               <au>
                  <snm>Holroyd</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Jagels</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Krogh</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Larsen</snm>
                  <fnm>TS</fnm>
               </au>
               <au>
                  <snm>Leather</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Moule</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>O'Gaora</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Parry</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Quail</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Rutherford</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Simmonds</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Skelton</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Stevens</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Whitehead</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Barrell</snm>
                  <fnm>BG</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2001</pubdate>
            <volume>413</volume>
            <issue>6858</issue>
            <fpage>848</fpage>
            <lpage>852</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">11677608</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B57">
            <title>
               <p>Complete genome sequence of Salmonella enterica serovar Typhimurium LT2</p>
            </title>
            <aug>
               <au>
                  <snm>McClelland</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Sanderson</snm>
                  <fnm>KE</fnm>
               </au>
               <au>
                  <snm>Spieth</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Clifton</snm>
                  <fnm>SW</fnm>
               </au>
               <au>
                  <snm>Latreille</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Courtney</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Porwollik</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Ali</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Dante</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Du</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Hou</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Layman</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Leonard</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Nguyen</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Scott</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Holmes</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Grewal</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Mulvaney</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Ryan</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Sun</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Florea</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Miller</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Stoneking</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Nhan</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Waterston</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Wilson</snm>
                  <fnm>RK</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2001</pubdate>
            <volume>413</volume>
            <issue>6858</issue>
            <fpage>852</fpage>
            <lpage>856</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">11677609</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B58">
            <title>
               <p>Comparison of genome degradation in Paratyphi A and Typhi, human-restricted serovars of Salmonella enterica that cause typhoid</p>
            </title>
            <aug>
               <au>
                  <snm>McClelland</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Sanderson</snm>
                  <fnm>KE</fnm>
               </au>
               <au>
                  <snm>Clifton</snm>
                  <fnm>SW</fnm>
               </au>
               <au>
                  <snm>Latreille</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Porwollik</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Sabo</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Meyer</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Bieri</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Ozersky</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>McLellan</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Harkins</snm>
                  <fnm>CR</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Nguyen</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Berghoff</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Elliott</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Kohlberg</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Strong</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Du</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Carter</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Kremizki</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Layman</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Leonard</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Sun</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Fulton</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Nash</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Miner</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Minx</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Delehaunty</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Fronick</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Magrini</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Nhan</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Warren</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Florea</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Spieth</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Wilson</snm>
                  <fnm>RK</fnm>
               </au>
            </aug>
            <source>Nat Genet</source>
            <pubdate>2004</pubdate>
            <volume>36</volume>
            <issue>12</issue>
            <fpage>1268</fpage>
            <lpage>1274</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">15531882</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B59">
            <title>
               <p>Gapped BLAST and PSI-BLAST - A new generation of protein database search programs</p>
            </title>
            <aug>
               <au>
                  <snm>Altschul</snm>
                  <fnm>SF</fnm>
               </au>
               <au>
                  <snm>Madden</snm>
                  <fnm>TL</fnm>
               </au>
               <au>
                  <snm>Schaffer</snm>
                  <fnm>AA</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Zheng</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Miller</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Lipman</snm>
                  <fnm>DJ</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>1997</pubdate>
            <volume>25</volume>
            <issue>17</issue>
            <fpage>3389</fpage>
            <lpage>3402</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">146917</pubid>
                  <pubid idtype="pmpid" link="fulltext">9254694</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B60">
            <title>
               <p>CD-Search: protein domain annotations on the fly</p>
            </title>
            <aug>
               <au>
                  <snm>Marchler-Bauer</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Bryant</snm>
                  <fnm>SH</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2004</pubdate>
            <volume>32</volume>
            <issue>Web Server issue</issue>
            <fpage>W327</fpage>
            <lpage>W331</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">441592</pubid>
                  <pubid idtype="pmpid" link="fulltext">15215404</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B61">
            <title>
               <p>Crystal structure and ATPase activity of MutL: implications for DNA repair and mutagenesis</p>
            </title>
            <aug>
               <au>
                  <snm>Ban</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Yang</snm>
                  <fnm>W</fnm>
               </au>
            </aug>
            <source>Cell</source>
            <pubdate>1998</pubdate>
            <volume>95</volume>
            <issue>4</issue>
            <fpage>541</fpage>
            <lpage>552</lpage>
            <xrefbib>
               <pubid idtype="pmpid">9827806</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B62">
            <title>
               <p>GHKL, an emergent ATPase/kinase superfamily</p>
            </title>
            <aug>
               <au>
                  <snm>Dutta</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Inouye</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Trends Biochem Sci</source>
            <pubdate>2000</pubdate>
            <volume>25</volume>
            <issue>1</issue>
            <fpage>24</fpage>
            <lpage>28</lpage>
            <xrefbib>
               <pubid idtype="pmpid">10637609</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B63">
            <title>
               <p>Two-component signal transduction</p>
            </title>
            <aug>
               <au>
                  <snm>Stock</snm>
                  <fnm>AM</fnm>
               </au>
               <au>
                  <snm>Robinson</snm>
                  <fnm>VL</fnm>
               </au>
               <au>
                  <snm>Goudreau</snm>
                  <fnm>PN</fnm>
               </au>
            </aug>
            <source>Annu Rev Biochem</source>
            <pubdate>2000</pubdate>
            <volume>69</volume>
            <issue>4</issue>
            <fpage>183</fpage>
            <lpage>215</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">10966457</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B64">
            <title>
               <p>KinG: a database of protein kinases in genomes</p>
            </title>
            <aug>
               <au>
                  <snm>Krupa</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Abhinandan</snm>
                  <fnm>KR</fnm>
               </au>
               <au>
                  <snm>Srinivasan</snm>
                  <fnm>N</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2004</pubdate>
            <volume>32</volume>
            <issue>Database issue</issue>
            <fpage>D153</fpage>
            <lpage>D155</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">308754</pubid>
                  <pubid idtype="pmpid" link="fulltext">14681382</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B65">
            <title>
               <p>KinG database</p>
            </title>
            <url>http://hodgkin.mbu.iisc.ernet.in/~king</url>
         </bibl>
         <bibl id="B66">
            <title>
               <p>Identification of Escherichia coli ubiB, a gene required for the first monooxygenase step in ubiquinone biosynthesis</p>
            </title>
            <aug>
               <au>
                  <snm>Poon</snm>
                  <fnm>WW</fnm>
               </au>
               <au>
                  <snm>Davis</snm>
                  <fnm>DE</fnm>
               </au>
               <au>
                  <snm>Ha</snm>
                  <fnm>HT</fnm>
               </au>
               <au>
                  <snm>Jonassen</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Rather</snm>
                  <fnm>PN</fnm>
               </au>
               <au>
                  <snm>Clarke</snm>
                  <fnm>CF</fnm>
               </au>
            </aug>
            <source>J Bacteriol</source>
            <pubdate>2000</pubdate>
            <volume>182</volume>
            <issue>18</issue>
            <fpage>5139</fpage>
            <lpage>5146</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">94662</pubid>
                  <pubid idtype="pmpid" link="fulltext">10960098</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B67">
            <title>
               <p>Structural basis of activity and allosteric control of diguanylate cyclase</p>
            </title>
            <aug>
               <au>
                  <snm>Chan</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Paul</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Samoray</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Amiot</snm>
                  <fnm>NC</fnm>
               </au>
               <au>
                  <snm>Giese</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Jenal</snm>
                  <fnm>U</fnm>
               </au>
               <au>
                  <snm>Schirmer</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci U S A</source>
            <pubdate>2004</pubdate>
            <volume>101</volume>
            <issue>49</issue>
            <fpage>17084</fpage>
            <lpage>17089</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">535365</pubid>
                  <pubid idtype="pmpid" link="fulltext">15569936</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B68">
            <title>
               <p>A novel family of predicted phosphoesterases includes Drosophila prune protein and bacterial RecJ exonuclease</p>
            </title>
            <aug>
               <au>
                  <snm>Aravind</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Koonin</snm>
                  <fnm>EV</fnm>
               </au>
            </aug>
            <source>Trends Biochem Sci</source>
            <pubdate>1998</pubdate>
            <volume>23</volume>
            <issue>1</issue>
            <fpage>17</fpage>
            <lpage>19</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">9478130</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B69">
            <title>
               <p>The HD domain defines a new superfamily of metal-dependent phosphohydrolases</p>
            </title>
            <aug>
               <au>
                  <snm>Aravind</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Koonin</snm>
                  <fnm>EV</fnm>
               </au>
            </aug>
            <source>Trends Biochem Sci</source>
            <pubdate>1998</pubdate>
            <volume>23</volume>
            <issue>12</issue>
            <fpage>469</fpage>
            <lpage>472</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">9868367</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B70">
            <title>
               <p>Activation of a cyanobacterial adenylate cyclase, CyaC, by autophosphorylation and a subsequent phosphotransfer reaction</p>
            </title>
            <aug>
               <au>
                  <snm>Kasahara</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Ohmori</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>J Biol Chem</source>
            <pubdate>1999</pubdate>
            <volume>274</volume>
            <issue>21</issue>
            <fpage>15167</fpage>
            <lpage>15172</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">10329724</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B71">
            <title>
               <p>Scaling laws in the functional content of genomes</p>
            </title>
            <aug>
               <au>
                  <snm>van Nimwegen</snm>
                  <fnm>E</fnm>
               </au>
            </aug>
            <source>Trends Genet</source>
            <pubdate>2003</pubdate>
            <volume>19</volume>
            <issue>9</issue>
            <fpage>479</fpage>
            <lpage>484</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12957540</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B72">
            <title>
               <p>Genome sequence of the cyanobacterium Prochlorococcus marinus SS120, a nearly minimal oxyphototrophic genome</p>
            </title>
            <aug>
               <au>
                  <snm>Dufresne</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Salanoubat</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Partensky</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Artiguenave</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Axmann</snm>
                  <fnm>IM</fnm>
               </au>
               <au>
                  <snm>Barbe</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Duprat</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Galperin</snm>
                  <fnm>MY</fnm>
               </au>
               <au>
                  <snm>Koonin</snm>
                  <fnm>EV</fnm>
               </au>
               <au>
                  <snm>Le Gall</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Makarova</snm>
                  <fnm>KS</fnm>
               </au>
               <au>
                  <snm>Ostrowski</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Oztas</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Robert</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Rogozin</snm>
                  <fnm>IB</fnm>
               </au>
               <au>
                  <snm>Scanlan</snm>
                  <fnm>DJ</fnm>
               </au>
               <au>
                  <snm>Tandeau de Marsac</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Weissenbach</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Wincker</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Wolf</snm>
                  <fnm>YI</fnm>
               </au>
               <au>
                  <snm>Hess</snm>
                  <fnm>WR</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2003</pubdate>
            <volume>100</volume>
            <issue>17</issue>
            <fpage>10020</fpage>
            <lpage>10025</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">187748</pubid>
                  <pubid idtype="pmpid" link="fulltext">12917486</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B73">
            <title>
               <p>Complete genome sequence and analysis of Wolinella succinogenes</p>
            </title>
            <aug>
               <au>
                  <snm>Baar</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Eppinger</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Raddatz</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Simon</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Lanz</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Klimmek</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Nandakumar</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Gross</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Rosinus</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Keller</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Jagtap</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Linke</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Meyer</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Lederer</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Schuster</snm>
                  <fnm>SC</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2003</pubdate>
            <volume>100</volume>
            <issue>20</issue>
            <fpage>11690</fpage>
            <lpage>11695</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">208819</pubid>
                  <pubid idtype="pmpid" link="fulltext">14500908</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B74">
            <title>
               <p>Genome of Geobacter sulfurreducens: metal reduction in subsurface environments</p>
            </title>
            <aug>
               <au>
                  <snm>Methe</snm>
                  <fnm>BA</fnm>
               </au>
               <au>
                  <snm>Nelson</snm>
                  <fnm>KE</fnm>
               </au>
               <au>
                  <snm>Eisen</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Paulsen</snm>
                  <fnm>IT</fnm>
               </au>
               <au>
                  <snm>Nelson</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Heidelberg</snm>
                  <fnm>JF</fnm>
               </au>
               <au>
                  <snm>Wu</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Wu</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Ward</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Beanan</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Dodson</snm>
                  <fnm>RJ</fnm>
               </au>
               <au>
                  <snm>Madupu</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Brinkac</snm>
                  <fnm>LM</fnm>
               </au>
               <au>
                  <snm>Daugherty</snm>
                  <fnm>SC</fnm>
               </au>
               <au>
                  <snm>DeBoy</snm>
                  <fnm>RT</fnm>
               </au>
               <au>
                  <snm>Durkin</snm>
                  <fnm>AS</fnm>
               </au>
               <au>
                  <snm>Gwinn</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Kolonay</snm>
                  <fnm>JF</fnm>
               </au>
               <au>
                  <snm>Sullivan</snm>
                  <fnm>SA</fnm>
               </au>
               <au>
                  <snm>Haft</snm>
                  <fnm>DH</fnm>
               </au>
               <au>
                  <snm>Selengut</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Davidsen</snm>
                  <fnm>TM</fnm>
               </au>
               <au>
                  <snm>Zafar</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>White</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Tran</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Romero</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Forberger</snm>
                  <fnm>HA</fnm>
               </au>
               <au>
                  <snm>Weidman</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Khouri</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Feldblyum</snm>
                  <fnm>TV</fnm>
               </au>
               <au>
                  <snm>Utterback</snm>
                  <fnm>TR</fnm>
               </au>
               <au>
                  <snm>Van Aken</snm>
                  <fnm>SE</fnm>
               </au>
               <au>
                  <snm>Lovley</snm>
                  <fnm>DR</fnm>
               </au>
               <au>
                  <snm>Fraser</snm>
                  <fnm>CM</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2003</pubdate>
            <volume>302</volume>
            <issue>5652</issue>
            <fpage>1967</fpage>
            <lpage>1969</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">14671304</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B75">
            <title>
               <p>The complete genome sequence of Chromobacterium violaceum reveals remarkable and exploitable bacterial adaptability</p>
            </title>
            <aug>
               <au>
                  <snm>Consortium</snm>
                  <fnm>BNGP</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2003</pubdate>
            <volume>100</volume>
            <issue>20</issue>
            <fpage>11660</fpage>
            <lpage>11665</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">208814</pubid>
                  <pubid idtype="pmpid" link="fulltext">14500782</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B76">
            <title>
               <p>The genome sequence of the anaerobic, sulfate-reducing bacterium Desulfovibrio vulgaris Hildenborough</p>
            </title>
            <aug>
               <au>
                  <snm>Heidelberg</snm>
                  <fnm>JF</fnm>
               </au>
               <au>
                  <snm>Seshadri</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Haveman</snm>
                  <fnm>SA</fnm>
               </au>
               <au>
                  <snm>Hemme</snm>
                  <fnm>CL</fnm>
               </au>
               <au>
                  <snm>Paulsen</snm>
                  <fnm>IT</fnm>
               </au>
               <au>
                  <snm>Kolonay</snm>
                  <fnm>JF</fnm>
               </au>
               <au>
                  <snm>Eisen</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Ward</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Methe</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Brinkac</snm>
                  <fnm>LM</fnm>
               </au>
               <au>
                  <snm>Daugherty</snm>
                  <fnm>SC</fnm>
               </au>
               <au>
                  <snm>Deboy</snm>
                  <fnm>RT</fnm>
               </au>
               <au>
                  <snm>Dodson</snm>
                  <fnm>RJ</fnm>
               </au>
               <au>
                  <snm>Durkin</snm>
                  <fnm>AS</fnm>
               </au>
               <au>
                  <snm>Madupu</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Nelson</snm>
                  <fnm>WC</fnm>
               </au>
               <au>
                  <snm>Sullivan</snm>
                  <fnm>SA</fnm>
               </au>
               <au>
                  <snm>Fouts</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Haft</snm>
                  <fnm>DH</fnm>
               </au>
               <au>
                  <snm>Selengut</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Peterson</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Davidsen</snm>
                  <fnm>TM</fnm>
               </au>
               <au>
                  <snm>Zafar</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Zhou</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Radune</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Dimitrov</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Hance</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Tran</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Khouri</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Gill</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Utterback</snm>
                  <fnm>TR</fnm>
               </au>
               <au>
                  <snm>Feldblyum</snm>
                  <fnm>TV</fnm>
               </au>
               <au>
                  <snm>Wall</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Voordouw</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Fraser</snm>
                  <fnm>CM</fnm>
               </au>
            </aug>
            <source>Nat Biotechnol</source>
            <pubdate>2004</pubdate>
            <volume>22</volume>
            <issue>5</issue>
            <fpage>554</fpage>
            <lpage>559</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">15077118</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B77">
            <title>
               <p>Global profiling of Shewanella oneidensis MR-1: expression of hypothetical genes and improved functional annotations</p>
            </title>
            <aug>
               <au>
                  <snm>Kolker</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Picone</snm>
                  <fnm>AF</fnm>
               </au>
               <au>
                  <snm>Galperin</snm>
                  <fnm>MY</fnm>
               </au>
               <au>
                  <snm>Romine</snm>
                  <fnm>MF</fnm>
               </au>
               <au>
                  <snm>Higdon</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Makarova</snm>
                  <fnm>KS</fnm>
               </au>
               <au>
                  <snm>Kolker</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Anderson</snm>
                  <fnm>GA</fnm>
               </au>
               <au>
                  <snm>Qiu</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Auberry</snm>
                  <fnm>KJ</fnm>
               </au>
               <au>
                  <snm>Babnigg</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Beliaev</snm>
                  <fnm>AS</fnm>
               </au>
               <au>
                  <snm>Edlefsen</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Elias</snm>
                  <fnm>DA</fnm>
               </au>
               <au>
                  <snm>Gorby</snm>
                  <fnm>YA</fnm>
               </au>
               <au>
                  <snm>Holzman</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Klappenbach</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Konstantinidis</snm>
                  <fnm>KT</fnm>
               </au>
               <au>
                  <snm>Land</snm>
                  <fnm>ML</fnm>
               </au>
               <au>
                  <snm>Lipton</snm>
                  <fnm>MS</fnm>
               </au>
               <au>
                  <snm>McCue</snm>
                  <fnm>LA</fnm>
               </au>
               <au>
                  <snm>Monroe</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Pasa-Tolic</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Pinchuk</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Purvine</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Serres</snm>
                  <fnm>MH</fnm>
               </au>
               <au>
                  <snm>Tsapin</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Zakrajsek</snm>
                  <fnm>BA</fnm>
               </au>
               <au>
                  <snm>Zhu</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Zhou</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Larimer</snm>
                  <fnm>FW</fnm>
               </au>
               <au>
                  <snm>Lawrence</snm>
                  <fnm>CE</fnm>
               </au>
               <au>
                  <snm>Riley</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Collart</snm>
                  <fnm>FR</fnm>
               </au>
               <au>
                  <snm>Yates</snm>
                  <fnm>JR</fnm>
               </au>
               <au>
                  <snm>Smith</snm>
                  <fnm>RD</fnm>
               </au>
               <au>
                  <snm>Giometti</snm>
                  <fnm>CS</fnm>
               </au>
               <au>
                  <snm>Nealson</snm>
                  <fnm>KH</fnm>
               </au>
               <au>
                  <snm>Fredrickson</snm>
                  <fnm>JK</fnm>
               </au>
               <au>
                  <snm>Tiedje</snm>
                  <fnm>JM</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2005</pubdate>
            <volume>102</volume>
            <issue>6</issue>
            <fpage>2099</fpage>
            <lpage>2104</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">548307</pubid>
                  <pubid idtype="pmpid" link="fulltext">15684069</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B78">
            <title>
               <p>Genome sequence of the deep-sea gamma-proteobacterium Idiomarina loihiensis reveals amino acid fermentation as a source of carbon and energy</p>
            </title>
            <aug>
               <au>
                  <snm>Hou</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Saw</snm>
                  <fnm>JH</fnm>
               </au>
               <au>
                  <snm>Lee</snm>
                  <fnm>KS</fnm>
               </au>
               <au>
                  <snm>Freitas</snm>
                  <fnm>TA</fnm>
               </au>
               <au>
                  <snm>Belisle</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Kawarabayasi</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Donachie</snm>
                  <fnm>SP</fnm>
               </au>
               <au>
                  <snm>Pikina</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Galperin</snm>
                  <fnm>MY</fnm>
               </au>
               <au>
                  <snm>Koonin</snm>
                  <fnm>EV</fnm>
               </au>
               <au>
                  <snm>Makarova</snm>
                  <fnm>KS</fnm>
               </au>
               <au>
                  <snm>Omelchenko</snm>
                  <fnm>MV</fnm>
               </au>
               <au>
                  <snm>Sorokin</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Wolf</snm>
                  <fnm>YI</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>QX</fnm>
               </au>
               <au>
                  <snm>Keum</snm>
                  <fnm>YS</fnm>
               </au>
               <au>
                  <snm>Campbell</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Denery</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Aizawa</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Shibata</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Malahoff</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Alam</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2004</pubdate>
            <volume>101</volume>
            <issue>52</issue>
            <fpage>18036</fpage>
            <lpage>18041</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">539801</pubid>
                  <pubid idtype="pmpid" link="fulltext">15596722</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B79">
            <title>
               <p>Genome sequence of Symbiobacterium thermophilum, an uncultivable bacterium that depends on microbial commensalism</p>
            </title>
            <aug>
               <au>
                  <snm>Ueda</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Yamashita</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Ishikawa</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Shimada</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Watsuji</snm>
                  <fnm>TO</fnm>
               </au>
               <au>
                  <snm>Morimura</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Ikeda</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Hattori</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Beppu</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2004</pubdate>
            <volume>32</volume>
            <issue>16</issue>
            <fpage>4937</fpage>
            <lpage>4944</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">519118</pubid>
                  <pubid idtype="pmpid" link="fulltext">15383646</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B80">
            <title>
               <p>Functional classification of cNMP-binding proteins and nucleotide cyclases with implications for novel regulatory pathways in Mycobacterium tuberculosis</p>
            </title>
            <aug>
               <au>
                  <snm>McCue</snm>
                  <fnm>LA</fnm>
               </au>
               <au>
                  <snm>McDonough</snm>
                  <fnm>KA</fnm>
               </au>
               <au>
                  <snm>Lawrence</snm>
                  <fnm>CE</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2000</pubdate>
            <volume>10</volume>
            <issue>2</issue>
            <fpage>204</fpage>
            <lpage>219</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">10673278</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B81">
            <title>
               <p>A defined subset of adenylyl cyclases is regulated by bicarbonate ion</p>
            </title>
            <aug>
               <au>
                  <snm>Cann</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Hammer</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Zhou</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Kanacher</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>J Biol Chem</source>
            <pubdate>2003</pubdate>
            <volume>278</volume>
            <issue>37</issue>
            <fpage>35033</fpage>
            <lpage>35038</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12829712</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B82">
            <title>
               <p>Massive gene decay in the leprosy bacillus</p>
            </title>
            <aug>
               <au>
                  <snm>Cole</snm>
                  <fnm>ST</fnm>
               </au>
               <au>
                  <snm>Eiglmeier</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Parkhill</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>James</snm>
                  <fnm>KD</fnm>
               </au>
               <au>
                  <snm>Thomson</snm>
                  <fnm>NR</fnm>
               </au>
               <au>
                  <snm>Wheeler</snm>
                  <fnm>PR</fnm>
               </au>
               <au>
                  <snm>Honore</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Garnier</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Churcher</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Harris</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Mungall</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Basham</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Brown</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Chillingworth</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Connor</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Davies</snm>
                  <fnm>RM</fnm>
               </au>
               <au>
                  <snm>Devlin</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Duthoy</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Feltwell</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Fraser</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Hamlin</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Holroyd</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Hornsby</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Jagels</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Lacroix</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Maclean</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Moule</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Murphy</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Oliver</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Quail</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Rajandream</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Rutherford</snm>
                  <fnm>KM</fnm>
               </au>
               <au>
                  <snm>Rutter</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Seeger</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Simon</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Simmonds</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Skelton</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Squares</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Squares</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Stevens</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Taylor</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Whitehead</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Woodward</snm>
                  <fnm>JR</fnm>
               </au>
               <au>
                  <snm>Barrell</snm>
                  <fnm>BG</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2001</pubdate>
            <volume>409</volume>
            <issue>6823</issue>
            <fpage>1007</fpage>
            <lpage>1011</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">11234002</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B83">
            <title>
               <p>The cyanobacterial phytochrome Cph2 inhibits phototaxis towards blue light</p>
            </title>
            <aug>
               <au>
                  <snm>Wilde</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Fiedler</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Borner</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Mol Microbiol</source>
            <pubdate>2002</pubdate>
            <volume>44</volume>
            <issue>4</issue>
            <fpage>981</fpage>
            <lpage>988</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12010493</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B84">
            <title>
               <p>Light matters: phototaxis and signal transduction in unicellular cyanobacteria</p>
            </title>
            <aug>
               <au>
                  <snm>Bhaya</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Mol Microbiol</source>
            <pubdate>2004</pubdate>
            <volume>53</volume>
            <issue>3</issue>
            <fpage>745</fpage>
            <lpage>754</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">15255889</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B85">
            <title>
               <p>The multi-talented bacterial adenylate cyclases</p>
            </title>
            <aug>
               <au>
                  <snm>Lory</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Wolfgang</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Lee</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Smith</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Int J Med Microbiol</source>
            <pubdate>2004</pubdate>
            <volume>293</volume>
            <issue>7-8</issue>
            <fpage>479</fpage>
            <lpage>482</lpage>
            <xrefbib>
               <pubid idtype="pmpid">15149021</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B86">
            <title>
               <p>Genome sequence of enterohaemorrhagic Escherichia coli O157:H7</p>
            </title>
            <aug>
               <au>
                  <snm>Perna</snm>
                  <fnm>NT</fnm>
               </au>
               <au>
                  <snm>Plunkett</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Burland</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Mau</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Glasner</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Rose</snm>
                  <fnm>DJ</fnm>
               </au>
               <au>
                  <snm>Mayhew</snm>
                  <fnm>GF</fnm>
               </au>
               <au>
                  <snm>Evans</snm>
                  <fnm>PS</fnm>
               </au>
               <au>
                  <snm>Gregor</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Kirkpatrick</snm>
                  <fnm>HA</fnm>
               </au>
               <au>
                  <snm>Posfai</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Hackett</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Klink</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Boutin</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Shao</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Miller</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Grotbeck</snm>
                  <fnm>EJ</fnm>
               </au>
               <au>
                  <snm>Davis</snm>
                  <fnm>NW</fnm>
               </au>
               <au>
                  <snm>Lim</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Dimalanta</snm>
                  <fnm>ET</fnm>
               </au>
               <au>
                  <snm>Potamousis</snm>
                  <fnm>KD</fnm>
               </au>
               <au>
                  <snm>Apodaca</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Anantharaman</snm>
                  <fnm>TS</fnm>
               </au>
               <au>
                  <snm>Lin</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Yen</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Schwartz</snm>
                  <fnm>DC</fnm>
               </au>
               <au>
                  <snm>Welch</snm>
                  <fnm>RA</fnm>
               </au>
               <au>
                  <snm>Blattner</snm>
                  <fnm>FR</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2001</pubdate>
            <volume>409</volume>
            <issue>6819</issue>
            <fpage>529</fpage>
            <lpage>533</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">11206551</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B87">
            <title>
               <p>Genome analysis of marine photosynthetic microbes and their global role</p>
            </title>
            <aug>
               <au>
                  <snm>Hess</snm>
                  <fnm>WR</fnm>
               </au>
            </aug>
            <source>Curr Opin Biotechnol</source>
            <pubdate>2004</pubdate>
            <volume>15</volume>
            <issue>3</issue>
            <fpage>191</fpage>
            <lpage>198</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">15193326</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B88">
            <title>
               <p>The transmembrane domains of the sensor kinase KdpD of Escherichia coli are not essential for sensing K+ limitation</p>
            </title>
            <aug>
               <au>
                  <snm>Heermann</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Fohrmann</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Altendorf</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Jung</snm>
                  <fnm>K</fnm>
               </au>
            </aug>
            <source>Mol Microbiol</source>
            <pubdate>2003</pubdate>
            <volume>47</volume>
            <issue>3</issue>
            <fpage>839</fpage>
            <lpage>848</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12535080</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B89">
            <title>
               <p>Interactions between the PAS and HAMP domains of the Escherichia coli aerotaxis receptor Aer</p>
            </title>
            <aug>
               <au>
                  <snm>Watts</snm>
                  <fnm>KJ</fnm>
               </au>
               <au>
                  <snm>Ma</snm>
                  <fnm>Q</fnm>
               </au>
               <au>
                  <snm>Johnson</snm>
                  <fnm>MS</fnm>
               </au>
               <au>
                  <snm>Taylor</snm>
                  <fnm>BL</fnm>
               </au>
            </aug>
            <source>J Bacteriol</source>
            <pubdate>2004</pubdate>
            <volume>186</volume>
            <issue>21</issue>
            <fpage>7440</fpage>
            <lpage>7449</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">523216</pubid>
                  <pubid idtype="pmpid" link="fulltext">15489456</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B90">
            <title>
               <p>One-component systems dominate signal transduction in prokaryotes</p>
            </title>
            <aug>
               <au>
                  <snm>Ulrich</snm>
                  <fnm>LE</fnm>
               </au>
               <au>
                  <snm>Koonin</snm>
                  <fnm>EV</fnm>
               </au>
               <au>
                  <snm>Zhulin</snm>
                  <fnm>IB</fnm>
               </au>
            </aug>
            <source>Trends Microbiol</source>
            <pubdate>2005</pubdate>
            <volume>13</volume>
            <issue>2</issue>
            <fpage>52</fpage>
            <lpage>56</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">15680762</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B91">
            <title>
               <p>The many faces of the helix-turn-helix domain: Transcription regulation and beyond</p>
            </title>
            <aug>
               <au>
                  <snm>Aravind</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Anantharaman</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Balaji</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Babu</snm>
                  <fnm>MM</fnm>
               </au>
               <au>
                  <snm>Iyer</snm>
                  <fnm>LM</fnm>
               </au>
            </aug>
            <source>FEMS Microbiol Rev</source>
            <pubdate>2005</pubdate>
            <volume>29</volume>
            <issue>2</issue>
            <fpage>231</fpage>
            <lpage>262</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">15808743</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B92">
            <title>
               <p>Archaeal protein kinases and protein phosphatases: insights from genomics and biochemistry</p>
            </title>
            <aug>
               <au>
                  <snm>Kennelly</snm>
                  <fnm>PJ</fnm>
               </au>
            </aug>
            <source>Biochem J</source>
            <pubdate>2003</pubdate>
            <volume>370</volume>
            <issue>Pt 2</issue>
            <fpage>373</fpage>
            <lpage>389</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12444920</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B93">
            <title>
               <p>Lineage-specific gene expansions in bacterial and archaeal genomes</p>
            </title>
            <aug>
               <au>
                  <snm>Jordan</snm>
                  <fnm>IK</fnm>
               </au>
               <au>
                  <snm>Makarova</snm>
                  <fnm>KS</fnm>
               </au>
               <au>
                  <snm>Spouge</snm>
                  <fnm>JL</fnm>
               </au>
               <au>
                  <snm>Wolf</snm>
                  <fnm>YI</fnm>
               </au>
               <au>
                  <snm>Koonin</snm>
                  <fnm>EV</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2001</pubdate>
            <volume>11</volume>
            <issue>4</issue>
            <fpage>555</fpage>
            <lpage>565</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">311027</pubid>
                  <pubid idtype="pmpid" link="fulltext">11282971</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B94">
            <title>
               <p>On the nature of gene innovation: duplication patterns in microbial genomes</p>
            </title>
            <aug>
               <au>
                  <snm>Hooper</snm>
                  <fnm>SD</fnm>
               </au>
               <au>
                  <snm>Berg</snm>
                  <fnm>OG</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>2003</pubdate>
            <volume>20</volume>
            <issue>6</issue>
            <fpage>945</fpage>
            <lpage>954</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12716994</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B95">
            <title>
               <p>Gene duplication and biased functional retention of paralogs in bacterial genomes</p>
            </title>
            <aug>
               <au>
                  <snm>Gevers</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Vandepoele</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Simillon</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Van de Peer</snm>
                  <fnm>Y</fnm>
               </au>
            </aug>
            <source>Trends Microbiol</source>
            <pubdate>2004</pubdate>
            <volume>12</volume>
            <issue>4</issue>
            <fpage>148</fpage>
            <lpage>154</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">15116722</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B96">
            <title>
               <p>Divergent evolution in the enolase superfamily: the interplay of mechanism and specificity</p>
            </title>
            <aug>
               <au>
                  <snm>Gerlt</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Babbitt</snm>
                  <fnm>PC</fnm>
               </au>
               <au>
                  <snm>Rayment</snm>
                  <fnm>I</fnm>
               </au>
            </aug>
            <source>Arch Biochem Biophys</source>
            <pubdate>2005</pubdate>
            <volume>433</volume>
            <issue>1</issue>
            <fpage>59</fpage>
            <lpage>70</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">15581566</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B97">
            <title>
               <p>Response to culture aeration mediated by the nitrate and nitrite sensor NarQ of Escherichia coli K-12</p>
            </title>
            <aug>
               <au>
                  <snm>Stewart</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>LL</fnm>
               </au>
               <au>
                  <snm>Wu</snm>
                  <fnm>HC</fnm>
               </au>
            </aug>
            <source>Mol Microbiol</source>
            <pubdate>2003</pubdate>
            <volume>50</volume>
            <issue>4</issue>
            <fpage>1391</fpage>
            <lpage>1399</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">14622424</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B98">
            <title>
               <p>Nitrate- and nitrite-responsive sensors NarX and NarQ of proteobacteria</p>
            </title>
            <aug>
               <au>
                  <snm>Stewart</snm>
                  <fnm>V</fnm>
               </au>
            </aug>
            <source>Biochem Soc Trans</source>
            <pubdate>2003</pubdate>
            <volume>31</volume>
            <issue>Pt 1</issue>
            <fpage>1</fpage>
            <lpage>10</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12546643</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B99">
            <title>
               <p>AgfD, the checkpoint of multicellular and aggregative behaviour in Salmonella typhimurium regulates at least two independent pathways</p>
            </title>
            <aug>
               <au>
                  <snm>R&#246;mling</snm>
                  <fnm>U</fnm>
               </au>
               <au>
                  <snm>Rohde</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Olsen</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Normark</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Reinkoster</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Mol Microbiol</source>
            <pubdate>2000</pubdate>
            <volume>36</volume>
            <issue>1</issue>
            <fpage>10</fpage>
            <lpage>23</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">10760159</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B100">
            <title>
               <p>Role of the GGDEF protein family in Salmonella cellulose biosynthesis and biofilm formation</p>
            </title>
            <aug>
               <au>
                  <snm>Garcia</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Latasa</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Solano</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Garcia-del Portillo</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Gamazo</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Lasa</snm>
                  <fnm>I</fnm>
               </au>
            </aug>
            <source>Mol Microbiol</source>
            <pubdate>2004</pubdate>
            <volume>54</volume>
            <issue>1</issue>
            <fpage>264</fpage>
            <lpage>277</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">15458421</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B101">
            <title>
               <p>PAS-A domain of phosphorelay sensor kinase A: a catalytic ATP-binding domain involved in the initiation of development in Bacillus subtilis</p>
            </title>
            <aug>
               <au>
                  <snm>Stephenson</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Hoch</snm>
                  <fnm>JA</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2001</pubdate>
            <volume>98</volume>
            <issue>26</issue>
            <fpage>15251</fpage>
            <lpage>15256</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">65015</pubid>
                  <pubid idtype="pmpid" link="fulltext">11734624</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B102">
            <title>
               <p>Primary structure and functional analysis of the soluble transducer protein HtrXI in the archaeon Halobacterium salinarium</p>
            </title>
            <aug>
               <au>
                  <snm>Brooun</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Alam</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>J Bacteriol</source>
            <pubdate>1997</pubdate>
            <volume>179</volume>
            <issue>9</issue>
            <fpage>2963</fpage>
            <lpage>2968</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">179061</pubid>
                  <pubid idtype="pmpid" link="fulltext">9139915</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B103">
            <title>
               <p>Myoglobin-like aerotaxis transducers in Archaea and Bacteria</p>
            </title>
            <aug>
               <au>
                  <snm>Hou</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Larsen</snm>
                  <fnm>RW</fnm>
               </au>
               <au>
                  <snm>Boudko</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Riley</snm>
                  <fnm>CW</fnm>
               </au>
               <au>
                  <snm>Karatan</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Zimmer</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Ordal</snm>
                  <fnm>GW</fnm>
               </au>
               <au>
                  <snm>Alam</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2000</pubdate>
            <volume>403</volume>
            <issue>6769</issue>
            <fpage>540</fpage>
            <lpage>544</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">10676961</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B104">
            <title>
               <p>Dos, a heme-binding PAS protein from Escherichia coli, is a direct oxygen sensor</p>
            </title>
            <aug>
               <au>
                  <snm>Delgado-Nixon</snm>
                  <fnm>VM</fnm>
               </au>
               <au>
                  <snm>Gonzalez</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Gilles-Gonzalez</snm>
                  <fnm>MA</fnm>
               </au>
            </aug>
            <source>Biochemistry</source>
            <pubdate>2000</pubdate>
            <volume>39</volume>
            <issue>10</issue>
            <fpage>2685</fpage>
            <lpage>2691</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">10704219</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B105">
            <title>
               <p>Soluble adenylyl cyclase as an evolutionarily conserved bicarbonate sensor</p>
            </title>
            <aug>
               <au>
                  <snm>Chen</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Cann</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Litvin</snm>
                  <fnm>TN</fnm>
               </au>
               <au>
                  <snm>Iourgenko</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Sinclair</snm>
                  <fnm>ML</fnm>
               </au>
               <au>
                  <snm>Levin</snm>
                  <fnm>LR</fnm>
               </au>
               <au>
                  <snm>Buck</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2000</pubdate>
            <volume>289</volume>
            <issue>5479</issue>
            <fpage>625</fpage>
            <lpage>628</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">10915626</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B106">
            <title>
               <p>Identification of a two-component regulatory system controlling methanol dehydrogenase synthesis in Paracoccus denitrificans</p>
            </title>
            <aug>
               <au>
                  <snm>Harms</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Reijnders</snm>
                  <fnm>WN</fnm>
               </au>
               <au>
                  <snm>Anazawa</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>van der Palen</snm>
                  <fnm>CJ</fnm>
               </au>
               <au>
                  <snm>van Spanning</snm>
                  <fnm>RJ</fnm>
               </au>
               <au>
                  <snm>Oltmann</snm>
                  <fnm>LF</fnm>
               </au>
               <au>
                  <snm>Stouthamer</snm>
                  <fnm>AH</fnm>
               </au>
            </aug>
            <source>Mol Microbiol</source>
            <pubdate>1993</pubdate>
            <volume>8</volume>
            <issue>3</issue>
            <fpage>457</fpage>
            <lpage>470</lpage>
            <xrefbib>
               <pubid idtype="pmpid">8392137</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B107">
            <title>
               <p>Nucleotide sequence of the mxcQ and mxcE genes, required for methanol dehydrogenase synthesis in Methylobacterium organophilum XX: a two-component regulatory system</p>
            </title>
            <aug>
               <au>
                  <snm>Xu</snm>
                  <fnm>HH</fnm>
               </au>
               <au>
                  <snm>Janka</snm>
                  <fnm>JJ</fnm>
               </au>
               <au>
                  <snm>Viebahn</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Hanson</snm>
                  <fnm>RS</fnm>
               </au>
            </aug>
            <source>Microbiology</source>
            <pubdate>1995</pubdate>
            <volume>141</volume>
            <issue>Pt 10</issue>
            <fpage>2543</fpage>
            <lpage>2551</lpage>
            <xrefbib>
               <pubid idtype="pmpid">7582014</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B108">
            <title>
               <p>Genome sequence of the PCE-dechlorinating bacterium Dehalococcoides ethenogenes</p>
            </title>
            <aug>
               <au>
                  <snm>Seshadri</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Adrian</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Fouts</snm>
                  <fnm>DE</fnm>
               </au>
               <au>
                  <snm>Eisen</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Phillippy</snm>
                  <fnm>AM</fnm>
               </au>
               <au>
                  <snm>Methe</snm>
                  <fnm>BA</fnm>
               </au>
               <au>
                  <snm>Ward</snm>
                  <fnm>NL</fnm>
               </au>
               <au>
                  <snm>Nelson</snm>
                  <fnm>WC</fnm>
               </au>
               <au>
                  <snm>Deboy</snm>
                  <fnm>RT</fnm>
               </au>
               <au>
                  <snm>Khouri</snm>
                  <fnm>HM</fnm>
               </au>
               <au>
                  <snm>Kolonay</snm>
                  <fnm>JF</fnm>
               </au>
               <au>
                  <snm>Dodson</snm>
                  <fnm>RJ</fnm>
               </au>
               <au>
                  <snm>Daugherty</snm>
                  <fnm>SC</fnm>
               </au>
               <au>
                  <snm>Brinkac</snm>
                  <fnm>LM</fnm>
               </au>
               <au>
                  <snm>Sullivan</snm>
                  <fnm>SA</fnm>
               </au>
               <au>
                  <snm>Madupu</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Nelson</snm>
                  <fnm>KE</fnm>
               </au>
               <au>
                  <snm>Kang</snm>
                  <fnm>KH</fnm>
               </au>
               <au>
                  <snm>Impraim</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Tran</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Robinson</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Forberger</snm>
                  <fnm>HA</fnm>
               </au>
               <au>
                  <snm>Fraser</snm>
                  <fnm>CM</fnm>
               </au>
               <au>
                  <snm>Zinder</snm>
                  <fnm>SH</fnm>
               </au>
               <au>
                  <snm>Heidelberg</snm>
                  <fnm>JF</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2005</pubdate>
            <volume>307</volume>
            <issue>5706</issue>
            <fpage>105</fpage>
            <lpage>108</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">15637277</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B109">
            <title>
               <p>The last universal common ancestor (LUCA), simple or complex?</p>
            </title>
            <aug>
               <au>
                  <snm>Forterre</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Philippe</snm>
                  <fnm>H</fnm>
               </au>
            </aug>
            <source>Biol Bull</source>
            <pubdate>1999</pubdate>
            <volume>196</volume>
            <issue>3</issue>
            <fpage>373</fpage>
            <lpage>377</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">11536914</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B110">
            <title>
               <p>Sequence - Evolution - Function. Computational Approaches in Comparative Genomics</p>
            </title>
            <aug>
               <au>
                  <snm>Koonin</snm>
                  <fnm>EV</fnm>
               </au>
               <au>
                  <snm>Galperin</snm>
                  <fnm>MY</fnm>
               </au>
            </aug>
            <publisher>Boston , Kluwer Academic Publishers</publisher>
            <pubdate>2002</pubdate>
         </bibl>
         <bibl id="B111">
            <title>
               <p>Comparative analysis of a genome fragment of an uncultivated mesopelagic crenarchaeote reveals multiple horizontal gene transfers</p>
            </title>
            <aug>
               <au>
                  <snm>Lopez-Garcia</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Brochier</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Moreira</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Rodriguez-Valera</snm>
                  <fnm>F</fnm>
               </au>
            </aug>
            <source>Environ Microbiol</source>
            <pubdate>2004</pubdate>
            <volume>6</volume>
            <issue>1</issue>
            <fpage>19</fpage>
            <lpage>34</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">14686938</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B112">
            <title>
               <p>Diversity in chemotaxis mechanisms among the bacteria and archaea</p>
            </title>
            <aug>
               <au>
                  <snm>Szurmant</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Ordal</snm>
                  <fnm>GW</fnm>
               </au>
            </aug>
            <source>Microbiol Mol Biol Rev</source>
            <pubdate>2004</pubdate>
            <volume>68</volume>
            <issue>2</issue>
            <fpage>301</fpage>
            <lpage>319</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">419924</pubid>
                  <pubid idtype="pmpid" link="fulltext">15187186</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B113">
            <title>
               <p>Recent advances in the structure and assembly of the archaeal flagellum</p>
            </title>
            <aug>
               <au>
                  <snm>Bardy</snm>
                  <fnm>SL</fnm>
               </au>
               <au>
                  <snm>Ng</snm>
                  <fnm>SY</fnm>
               </au>
               <au>
                  <snm>Jarrell</snm>
                  <fnm>KF</fnm>
               </au>
            </aug>
            <source>J Mol Microbiol Biotechnol</source>
            <pubdate>2004</pubdate>
            <volume>7</volume>
            <issue>1-2</issue>
            <fpage>41</fpage>
            <lpage>51</lpage>
            <xrefbib>
               <pubid idtype="pmpid">15170402</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B114">
            <title>
               <p>Cell polarity, intercellular signalling and morphogenetic cell movements in Myxococcus xanthus</p>
            </title>
            <aug>
               <au>
                  <snm>Sogaard-Andersen</snm>
                  <fnm>L</fnm>
               </au>
            </aug>
            <source>Curr Opin Microbiol</source>
            <pubdate>2004</pubdate>
            <volume>7</volume>
            <issue>6</issue>
            <fpage>587</fpage>
            <lpage>593</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">15556030</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B115">
            <title>
               <p>Birth and death of protein domains: A simple model of evolution explains power law behavior</p>
            </title>
            <aug>
               <au>
                  <snm>Karev</snm>
                  <fnm>GP</fnm>
               </au>
               <au>
                  <snm>Wolf</snm>
                  <fnm>YI</fnm>
               </au>
               <au>
                  <snm>Rzhetsky</snm>
                  <fnm>AY</fnm>
               </au>
               <au>
                  <snm>Berezovskaya</snm>
                  <fnm>FS</fnm>
               </au>
               <au>
                  <snm>Koonin</snm>
                  <fnm>EV</fnm>
               </au>
            </aug>
            <source>BMC Evol Biol</source>
            <pubdate>2002</pubdate>
            <volume>2</volume>
            <issue>1</issue>
            <fpage>18</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">137606</pubid>
                  <pubid idtype="pmpid" link="fulltext">12379152</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B116">
            <title>
               <p>Chemosensory regulation of developmental gene expression in Myxococcus xanthus</p>
            </title>
            <aug>
               <au>
                  <snm>Kirby</snm>
                  <fnm>JR</fnm>
               </au>
               <au>
                  <snm>Zusman</snm>
                  <fnm>DR</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2003</pubdate>
            <volume>100</volume>
            <issue>4</issue>
            <fpage>2008</fpage>
            <lpage>2013</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">149949</pubid>
                  <pubid idtype="pmpid" link="fulltext">12566562</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B117">
            <title>
               <p>Missing genes in metabolic pathways: a comparative genomics approach</p>
            </title>
            <aug>
               <au>
                  <snm>Osterman</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Overbeek</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Curr Opin Chem Biol</source>
            <pubdate>2003</pubdate>
            <volume>7</volume>
            <issue>2</issue>
            <fpage>238</fpage>
            <lpage>251</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12714058</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B118">
            <title>
               <p>MetaCyc: a multiorganism database of metabolic pathways and enzymes</p>
            </title>
            <aug>
               <au>
                  <snm>Krieger</snm>
                  <fnm>CJ</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Mueller</snm>
                  <fnm>LA</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Paley</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Arnaud</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Pick</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Rhee</snm>
                  <fnm>SY</fnm>
               </au>
               <au>
                  <snm>Karp</snm>
                  <fnm>PD</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2004</pubdate>
            <volume>32</volume>
            <issue>Database issue</issue>
            <fpage>D438</fpage>
            <lpage>42</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">308834</pubid>
                  <pubid idtype="pmpid" link="fulltext">14681452</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B119">
            <title>
               <p>Evolutionary connections between bacterial and eukaryotic signaling systems: a genomic perspective</p>
            </title>
            <aug>
               <au>
                  <snm>Aravind</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Anantharaman</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Iyer</snm>
                  <fnm>LM</fnm>
               </au>
            </aug>
            <source>Curr Opin Microbiol</source>
            <pubdate>2003</pubdate>
            <volume>6</volume>
            <issue>5</issue>
            <fpage>490</fpage>
            <lpage>497</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">14572542</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B120">
            <title>
               <p>In silico metabolic model and protein expression of Haemophilus influenzae strain Rd KW20 in rich medium</p>
            </title>
            <aug>
               <au>
                  <snm>Raghunathan</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Price</snm>
                  <fnm>ND</fnm>
               </au>
               <au>
                  <snm>Galperin</snm>
                  <fnm>MY</fnm>
               </au>
               <au>
                  <snm>Makarova</snm>
                  <fnm>KS</fnm>
               </au>
               <au>
                  <snm>Purvine</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Picone</snm>
                  <fnm>AF</fnm>
               </au>
               <au>
                  <snm>Cherny</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Xie</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Reilly</snm>
                  <fnm>TJ</fnm>
               </au>
               <au>
                  <snm>Munson</snm>
                  <fnm>RJ</fnm>
               </au>
               <au>
                  <snm>Tyler</snm>
                  <fnm>RE</fnm>
               </au>
               <au>
                  <snm>Akerley</snm>
                  <fnm>BJ</fnm>
               </au>
               <au>
                  <snm>Smith</snm>
                  <fnm>AL</fnm>
               </au>
               <au>
                  <snm>Palsson</snm>
                  <fnm>BO</fnm>
               </au>
               <au>
                  <snm>Kolker</snm>
                  <fnm>E</fnm>
               </au>
            </aug>
            <source>OMICS</source>
            <pubdate>2004</pubdate>
            <volume>8</volume>
            <issue>1</issue>
            <fpage>25</fpage>
            <lpage>41</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">15107235</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B121">
            <title>
               <p>Network biology: understanding the cell's functional organization</p>
            </title>
            <aug>
               <au>
                  <snm>Barabasi</snm>
                  <fnm>AL</fnm>
               </au>
               <au>
                  <snm>Oltvai</snm>
                  <fnm>ZN</fnm>
               </au>
            </aug>
            <source>Nat Rev Genet</source>
            <pubdate>2004</pubdate>
            <volume>5</volume>
            <issue>2</issue>
            <fpage>101</fpage>
            <lpage>113</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">14735121</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B122">
            <title>
               <p>Topological units of environmental signal processing in the transcriptional regulatory network of Escherichia coli</p>
            </title>
            <aug>
               <au>
                  <snm>Balazsi</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Barabasi</snm>
                  <fnm>AL</fnm>
               </au>
               <au>
                  <snm>Oltvai</snm>
                  <fnm>ZN</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2005</pubdate>
         </bibl>
         <bibl id="B123">
            <title>
               <p>Entrez Genome Project</p>
            </title>
            <url>http://www.ncbi.nlm.nih.gov/genomes/lproks.cgi</url>
         </bibl>
         <bibl id="B124">
            <title>
               <p>Genomic BLAST: custom-defined virtual databases for complete and unfinished genomes</p>
            </title>
            <aug>
               <au>
                  <snm>Cummings</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Riley</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Black</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Souvorov</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Resenchuk</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Dondoshansky</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Tatusova</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>FEMS Microbiol Lett</source>
            <pubdate>2002</pubdate>
            <volume>216</volume>
            <issue>2</issue>
            <fpage>133</fpage>
            <lpage>138</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12435493</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B125">
            <title>
               <p>The cytoplasmic kinase domain of PhoR is sufficient for the low phosphate-inducible expression of pho regulon genes in Bacillus subtilis</p>
            </title>
            <aug>
               <au>
                  <snm>Shi</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Hulett</snm>
                  <fnm>FM</fnm>
               </au>
            </aug>
            <source>Mol Microbiol</source>
            <pubdate>1999</pubdate>
            <volume>31</volume>
            <issue>1</issue>
            <fpage>211</fpage>
            <lpage>222</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">9987123</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B126">
            <title>
               <p>NCBI Reference Sequence (RefSeq): a curated non-redundant sequence database of genomes, transcripts and proteins</p>
            </title>
            <aug>
               <au>
                  <snm>Pruitt</snm>
                  <fnm>KD</fnm>
               </au>
               <au>
                  <snm>Tatusova</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Maglott</snm>
                  <fnm>DR</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2005</pubdate>
            <volume>33</volume>
            <fpage>D501</fpage>
            <lpage>D504</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">539979</pubid>
                  <pubid idtype="pmpid" link="fulltext">15608248</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B127">
            <title>
               <p>The NCBI Reference Sequence collection </p>
            </title>
            <url>http://www.ncbi.nlm.nih.gov/RefSeq/</url>
         </bibl>
         <bibl id="B128">
            <title>
               <p>BLAST 2 Sequences, a new tool for comparing protein and nucleotide sequences</p>
            </title>
            <aug>
               <au>
                  <snm>Tatusova</snm>
                  <fnm>TA</fnm>
               </au>
               <au>
                  <snm>Madden</snm>
                  <fnm>TL</fnm>
               </au>
            </aug>
            <source>FEMS Microbiol Lett</source>
            <pubdate>1999</pubdate>
            <volume>174</volume>
            <issue>2</issue>
            <fpage>247</fpage>
            <lpage>250</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">10339815</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B129">
            <title>
               <p>Topology prediction for helical transmembrane proteins at 86% accuracy</p>
            </title>
            <aug>
               <au>
                  <snm>Rost</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Fariselli</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Casadio</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Protein Sci</source>
            <pubdate>1996</pubdate>
            <volume>5</volume>
            <issue>8</issue>
            <fpage>1704</fpage>
            <lpage>1718</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">8844859</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B130">
            <title>
               <p>Predicting transmembrane protein topology with a hidden Markov model: application to complete genomes</p>
            </title>
            <aug>
               <au>
                  <snm>Krogh</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Larsson</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>von Heijne</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Sonnhammer</snm>
                  <fnm>EL</fnm>
               </au>
            </aug>
            <source>J Mol Biol</source>
            <pubdate>2001</pubdate>
            <volume>305</volume>
            <issue>3</issue>
            <fpage>567</fpage>
            <lpage>580</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">11152613</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B131">
            <title>
               <p>The HMMTOP transmembrane topology prediction server</p>
            </title>
            <aug>
               <au>
                  <snm>Tusnady</snm>
                  <fnm>GE</fnm>
               </au>
               <au>
                  <snm>Simon</snm>
                  <fnm>I</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2001</pubdate>
            <volume>17</volume>
            <issue>9</issue>
            <fpage>849</fpage>
            <lpage>850</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">11590105</pubid>
            </xrefbib>
         </bibl>
      </refgrp>
   </bm>
</art>
