<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1471-2105-6-8</ui>
   <ji>1471-2105</ji>
   <fm>
      <dochead>Research article</dochead>
      <bibl>
         <title>
            <p>Structural comparison of metabolic networks in selected single cell organisms</p>
         </title>
         <aug>
            <au id="A1">
               <snm>Zhu</snm>
               <fnm>Dongxiao</fnm>
               <insr iid="I1"/>
               <insr iid="I3"/>
               <email>zhud@umich.edu</email>
            </au>
            <au id="A2" ca="yes">
               <snm>Qin</snm>
               <mi>S</mi>
               <fnm>Zhaohui</fnm>
               <insr iid="I2"/>
               <email>qin@umich.edu</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Bioinformatics Program, University of Michigan, Ann Arbor, MI 48109, USA</p>
            </ins>
            <ins id="I2">
               <p>Center for Statistical Genetics, Department of Biostatistics, University of Michigan, Ann Arbor, MI 48109, USA</p>
            </ins>
            <ins id="I3">
               <p>Department of Statistics, University of Michigan, Ann Arbor, MI 48109, USA</p>
            </ins>
         </insg>
         <source>BMC Bioinformatics</source>
         <issn>1471-2105</issn>
         <pubdate>2005</pubdate>
         <volume>6</volume>
         <issue>1</issue>
         <fpage>8</fpage>
         <url>http://www.biomedcentral.com/1471-2105/6/8</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">15649332</pubid>
               <pubid idtype="doi">10.1186/1471-2105-6-8</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>20</day>
               <month>7</month>
               <year>2004</year>
            </date>
         </rec>
         <acc>
            <date>
               <day>14</day>
               <month>1</month>
               <year>2005</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>14</day>
               <month>1</month>
               <year>2005</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2005</year>
         <collab>Zhu and Qin; licensee BioMed Central Ltd.</collab>
         <note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>There has been tremendous interest in the study of biological network structure. An array of measurements has been conceived to assess the topological properties of these networks. In this study, we compared the metabolic network structures of eleven single cell organisms representing the three domains of life using these measurements, hoping to find out whether the intrinsic network design principle(s), reflected by these measurements, are different among species in the three domains of life.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>Three groups of topological properties were used in this study: network indices, degree distribution measures and motif profile measure. All of which are higher-level topological properties except for the marginal degree distribution. Metabolic networks in Archaeal species are found to be different from those in <it>S. cerevisiae </it>and the six Bacterial species in almost all measured higher-level topological properties. Our findings also indicate that the metabolic network in Archaeal species is similar to the exponential random network.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusion</p>
               </st>
               <p>If these metabolic network properties of the organisms studied can be extended to other species in their respective domains (which is likely), then the design principle(s) of Archaea are fundamentally different from those of Bacteria and Eukaryote. Furthermore, the functional mechanisms of Archaeal metabolic networks revealed in this study differentiate significantly from those of Bacterial and Eukaryotic organisms, which warrant further investigation.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>Classification of biological organisms is of fundamental importance to evolutionary studies. It is commonly believed that there are three domains of life: Archaea, Bacteria and Eukaryote. Currently, the most popular classification method is the so called "molecular approach", in which polymorphism information in DNA or protein sequence is exploited to assess the phylogenetic relationships among species <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr></abbrgrp>. To a large extent, this is a "local" approach since the choice of sequence for comparison greatly affects the final result, "lateral gene transfer" (LGT) and thus the resulting "genome chimerism" further complicates the situation <abbrgrp><abbr bid="B3">3</abbr></abbrgrp>. A new "system" approach that takes "global" properties of each organism into consideration serves as a potential alternative to overcome this shortcoming. Indeed, recent advances in system biology and increasingly available genomic databases have made it possible to rebuild biological networks from genomic data and have offered opportunity for such a "system" approach <abbrgrp><abbr bid="B4">4</abbr></abbrgrp>.</p>
         <p>Podani and co-workers <abbrgrp><abbr bid="B5">5</abbr></abbrgrp> proposed classifying organisms based on two kinds of network indices: the Jaccard index, which measures proportions of common sets of nodes in two networks, and Goodman-Kruskal <it>&#947; </it>function, which measures the similarity between rankings of nodes in two networks. They studied metabolic and information network structures of 43 organisms using these two measures under the hypothesis that network structure and the network design principle(s) behind them contain phylogenetic information. Ma and Zeng <abbrgrp><abbr bid="B6">6</abbr></abbrgrp> conducted a more extensive phylogenetic classification study on 82 fully sequenced organisms based on different cellular function systems (enzyme, reaction, and genes) at the genomic level. They constructed phylogenetic tree based on Jaccard index and Korbel's definition, and concluded that in general, the classification based on network indices are in good agreement with the one obtained by analyzing the 16S rRNA using molecular approach. These studies seem to support the notion that significant differences in the network design principle(s) exist among the three domains of life <abbrgrp><abbr bid="B7">7</abbr></abbrgrp>. These differences may reflect on the different approaches that organisms take to organize their entire systems to serve their special needs in the environment they live during the evolutionary history. Motivated by these encouraging results, in this manuscript, we went on to conduct a thorough comparison of network structural properties which provide further and more compelling evidences that significant differences exist among the network design principle(s) in organisms from the three domains of life.</p>
         <p>Restricted by the theoretical network structural studies, there are not many deterministic and informative topological measurements available <abbrgrp><abbr bid="B8">8</abbr><abbr bid="B9">9</abbr><abbr bid="B10">10</abbr><abbr bid="B11">11</abbr></abbrgrp>. The established measurements can be roughly divided into two categories: higher-level (global) properties and low-level (local) properties. The difference between the two is that one needs to know the whole network in order to calculate the higher-level property measures (e.g. average path length) while the low-level properties can be worked out locally (e.g. marginal degree of individual node) <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>. We use three groups of topological measurements (both low and higher-level) that address different aspects of the network structure. The first group contains network indices such as average clustering coefficient, average path length <abbrgrp><abbr bid="B12">12</abbr></abbrgrp>. The second group is composed of degree distributions (both marginal and bivariate joint degree distributions) <abbrgrp><abbr bid="B8">8</abbr><abbr bid="B9">9</abbr><abbr bid="B10">10</abbr><abbr bid="B11">11</abbr><abbr bid="B13">13</abbr></abbrgrp>. The third group is composed of network motif profiles that are recently shown to represent the network design principle(s) and global statistical properties of the network when aggregating together <abbrgrp><abbr bid="B14">14</abbr><abbr bid="B15">15</abbr><abbr bid="B16">16</abbr></abbrgrp>. These measurements have been well studied in the network literatures, and are able to capture most aspects of network degree information.</p>
         <p>Single cell model organisms such as <it>E. coli </it>and <it>S. cerevisiae </it>have been studied intensively in biochemistry, cell biology and genetics; hence the rebuilt networks in those organisms present the best chance to approximate the true underlying network. Moreover, single cell organisms are less likely to have experienced the Whole Genome Duplication (WGD), which might drastically change the network structure <abbrgrp><abbr bid="B17">17</abbr><abbr bid="B18">18</abbr></abbrgrp>. As a result, we selected eleven single cell organisms to study their network structural properties: one Eukaryote: <it>S. cerevisiae</it>; six Bacteria: <it>E. coli</it>, <it>V. cholerae</it>, <it>R. solanacearum</it>, <it>B. subtilis</it>, <it>L. lactis</it>, <it>S. coelicolor</it>; and four Archaea: <it>S. solfataricus</it>, <it>S. tokodaii</it>, <it>M. acetivorans</it>, <it>T. acidophilum</it>.</p>
         <p>There are three main types of intracellular networks: the protein-protein interaction network, the transcriptional regulation network and the metabolic network. The first two are rebuilt by using high throughput techniques such as yeast two-hybrid system, <it>in vivo </it>pull down assay or DNA microarray, which are subject to high uncertainties, and the resulting networks may not be good approximation to biological complexity <abbrgrp><abbr bid="B19">19</abbr><abbr bid="B20">20</abbr><abbr bid="B21">21</abbr><abbr bid="B22">22</abbr></abbrgrp>. On the other hand, the metabolic network is derived from metabolic pathways, many of which are inferred from biochemical experiment-defined stoichiometries of many reactions <abbrgrp><abbr bid="B23">23</abbr></abbrgrp>. It is well known that central pathways contain "hub nodes" of the whole metabolic network <abbrgrp><abbr bid="B24">24</abbr><abbr bid="B25">25</abbr></abbrgrp> and are also main building blocks of the so-called Giant Strongly Connected Component (GSCC) and Giant Weakly Connected Components (GWCC) <abbrgrp><abbr bid="B26">26</abbr></abbrgrp>. The former is defined as the largest cluster of nodes within which any pair of nodes is mutually reachable from each other, and the latter is defined as the largest cluster of nodes within which each pair of nodes is connected in the underlying undirected graph <abbrgrp><abbr bid="B10">10</abbr></abbrgrp>. Therefore, our high confidence in the structure of GSCC and GWCC, based on experimentally verified pathways, guarantees high confidence in whole network structure. The long history of biochemical studies of enzymes ensures relatively low false positive and low false negative rates of connections. Therefore, we decided to use metabolic networks in single cell organisms to compare network topological properties in the three domains of life.</p>
      </sec>
      <sec>
         <st>
            <p>Results</p>
         </st>
         <p>In constructing metabolic networks, Ma and Zeng <abbrgrp><abbr bid="B28">28</abbr></abbrgrp> argued that connections through "current metabolites", which is referred to as cofactors in biochemistry such as ATP, ADP, H<sub>2</sub>O, should be removed from metabolic networks. We followed their suggestions by removing such "current metabolites" before conducting the following analysis.</p>
         <sec>
            <st>
               <p>Group I measures: network indices</p>
            </st>
            <p>Before checking different types of network topological measurements, we visually compared different metabolic networks (Fig. <figr fid="F1">1</figr>). Metabolic networks in <it>S. cerevisiae </it>and the six Bacterial species appear much more heterogeneous than Archaeal metabolic networks. It is well known that the so-called exponential random network (marginal degree distribution follows a Poisson distribution, see Methods for details) appears homogeneous while scale-free network (marginal degree distribution follows a power-law distribution, see Methods for details) appears more heterogeneous and modular <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>.</p>
            <fig id="F1">
               <title>
                  <p>Figure 1</p>
               </title>
               <caption>
                  <p>Visualizations of metabolic networks in the eleven species</p>
               </caption>
               <text>
                  <p><b>Visualizations of metabolic networks in the eleven organisms. </b>In each graph, green lines represent arc and red lines represent edge. The numbers of distinct metabolites that are involved in at least one reaction are noted. All graphs are drawn with Pajek [33] using the layout optimization algorithm Kamada-Kawai.</p>
               </text>
               <graphic file="1471-2105-6-8-1"/>
            </fig>
            <p>Calculations of the two classic network indices, average clustering coefficient and average betweenness (see Methods for definition) also indicate that the metabolic networks in <it>S. cerevisiae </it>andthe six Bacterial species are more clustered and modular than those in the four Archaeal species (Table <tblr tid="T1">1</tblr>, Fig. <figr fid="F2">2</figr>). From Table <tblr tid="T1">1</tblr> and Fig. <figr fid="F2">2</figr>, it is evident that the Clustering Coefficient (C) and Betweenness (B) did a better job in separating Archaeal species from non-Archaeal species than Average Path Length (L) and Diameter (D). Note that since we removed connections through "current metabolites" when constructing metabolic networks, our average path lengths are much longer than those reported in Jeong et al. <abbrgrp><abbr bid="B25">25</abbr></abbrgrp> but similar to those reported in Ma and Zeng <abbrgrp><abbr bid="B28">28</abbr></abbrgrp>.</p>
            <tbl id="T1">
               <title>
                  <p>Table 1</p>
               </title>
               <caption>
                  <p>Descriptive statistics of metabolic networks in the eleven organisms.</p>
               </caption>
               <tblbdy cols="13">
                  <r>
                     <c cspan="3" ca="center">
                        <p>
                           <b>DOMAIN, KINGDOM AND PHYLUM</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>ORGANISM</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>NUM NODES</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>NUM EDGES</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>SINGLE EDGES</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>MUTUAL EDGES</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>C<sub>OUT</sub></b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>C<sub>IN</sub></b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>B</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>L</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>D</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="13">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c cspan="3" ca="center">
                        <p>Eukarya</p>
                     </c>
                     <c ca="center">
                        <p>
                           <it>S. cerevisiae</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>748</p>
                     </c>
                     <c ca="center">
                        <p>1072</p>
                     </c>
                     <c ca="center">
                        <p>396</p>
                     </c>
                     <c ca="center">
                        <p>338</p>
                     </c>
                     <c ca="center">
                        <p>0.066</p>
                     </c>
                     <c ca="center">
                        <p>0.062</p>
                     </c>
                     <c ca="center">
                        <p>0.053</p>
                     </c>
                     <c ca="center">
                        <p>12.147</p>
                     </c>
                     <c ca="center">
                        <p>49</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>Bacteria</p>
                     </c>
                     <c ca="center">
                        <p>Proteobacteria</p>
                     </c>
                     <c ca="center">
                        <p>gamma</p>
                     </c>
                     <c ca="center">
                        <p>
                           <it>E. coli</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>893</p>
                     </c>
                     <c ca="center">
                        <p>1365</p>
                     </c>
                     <c ca="center">
                        <p>459</p>
                     </c>
                     <c ca="center">
                        <p>453</p>
                     </c>
                     <c ca="center">
                        <p>0.060</p>
                     </c>
                     <c ca="center">
                        <p>0.070</p>
                     </c>
                     <c ca="center">
                        <p>0.070</p>
                     </c>
                     <c ca="center">
                        <p>9.281</p>
                     </c>
                     <c ca="center">
                        <p>30</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>
                           <it>V. cholerae</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>738</p>
                     </c>
                     <c ca="center">
                        <p>1076</p>
                     </c>
                     <c ca="center">
                        <p>370</p>
                     </c>
                     <c ca="center">
                        <p>353</p>
                     </c>
                     <c ca="center">
                        <p>0.057</p>
                     </c>
                     <c ca="center">
                        <p>0.055</p>
                     </c>
                     <c ca="center">
                        <p>0.045</p>
                     </c>
                     <c ca="center">
                        <p>8.236</p>
                     </c>
                     <c ca="center">
                        <p>23</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>beta</p>
                     </c>
                     <c ca="center">
                        <p>
                           <it>R. solanacearum</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>864</p>
                     </c>
                     <c ca="center">
                        <p>1238</p>
                     </c>
                     <c ca="center">
                        <p>406</p>
                     </c>
                     <c ca="center">
                        <p>416</p>
                     </c>
                     <c ca="center">
                        <p>0.044</p>
                     </c>
                     <c ca="center">
                        <p>0.044</p>
                     </c>
                     <c ca="center">
                        <p>0.049</p>
                     </c>
                     <c ca="center">
                        <p>10.358</p>
                     </c>
                     <c ca="center">
                        <p>43</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>Firmicutes</p>
                     </c>
                     <c ca="center">
                        <p>Bacillales</p>
                     </c>
                     <c ca="center">
                        <p>
                           <it>B. subtilis</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>787</p>
                     </c>
                     <c ca="center">
                        <p>1151</p>
                     </c>
                     <c ca="center">
                        <p>401</p>
                     </c>
                     <c ca="center">
                        <p>375</p>
                     </c>
                     <c ca="center">
                        <p>0.061</p>
                     </c>
                     <c ca="center">
                        <p>0.063</p>
                     </c>
                     <c ca="center">
                        <p>0.047</p>
                     </c>
                     <c ca="center">
                        <p>10.020</p>
                     </c>
                     <c ca="center">
                        <p>29</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>Lactobacillales</p>
                     </c>
                     <c ca="center">
                        <p>
                           <it>L. lactis</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>545</p>
                     </c>
                     <c ca="center">
                        <p>778</p>
                     </c>
                     <c ca="center">
                        <p>280</p>
                     </c>
                     <c ca="center">
                        <p>249</p>
                     </c>
                     <c ca="center">
                        <p>0.044</p>
                     </c>
                     <c ca="center">
                        <p>0.043</p>
                     </c>
                     <c ca="center">
                        <p>0.068</p>
                     </c>
                     <c ca="center">
                        <p>9.277</p>
                     </c>
                     <c ca="center">
                        <p>27</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c cspan="2" ca="center">
                        <p>Actinobacteria</p>
                     </c>
                     <c ca="center">
                        <p>
                           <it>S. coelicolor</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>814</p>
                     </c>
                     <c ca="center">
                        <p>1154</p>
                     </c>
                     <c ca="center">
                        <p>406</p>
                     </c>
                     <c ca="center">
                        <p>374</p>
                     </c>
                     <c ca="center">
                        <p>0.046</p>
                     </c>
                     <c ca="center">
                        <p>0.047</p>
                     </c>
                     <c ca="center">
                        <p>0.047</p>
                     </c>
                     <c ca="center">
                        <p>15.062</p>
                     </c>
                     <c ca="center">
                        <p>66</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>Archaea</p>
                     </c>
                     <c cspan="2" ca="center">
                        <p>Euryarchaeota</p>
                     </c>
                     <c ca="center">
                        <p>
                           <it>M. acetivorans</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>489</p>
                     </c>
                     <c ca="center">
                        <p>633</p>
                     </c>
                     <c ca="center">
                        <p>209</p>
                     </c>
                     <c ca="center">
                        <p>212</p>
                     </c>
                     <c ca="center">
                        <p>0.029</p>
                     </c>
                     <c ca="center">
                        <p>0.033</p>
                     </c>
                     <c ca="center">
                        <p>0.026</p>
                     </c>
                     <c ca="center">
                        <p>11.350</p>
                     </c>
                     <c ca="center">
                        <p>35</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>
                           <it>T. acidophilum</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>458</p>
                     </c>
                     <c ca="center">
                        <p>593</p>
                     </c>
                     <c ca="center">
                        <p>197</p>
                     </c>
                     <c ca="center">
                        <p>198</p>
                     </c>
                     <c ca="center">
                        <p>0.034</p>
                     </c>
                     <c ca="center">
                        <p>0.036</p>
                     </c>
                     <c ca="center">
                        <p>0.030</p>
                     </c>
                     <c ca="center">
                        <p>10.597</p>
                     </c>
                     <c ca="center">
                        <p>33</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c cspan="2" ca="center">
                        <p>Crenarchaeota</p>
                     </c>
                     <c ca="center">
                        <p>
                           <it>S. solfataricus</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>586</p>
                     </c>
                     <c ca="center">
                        <p>730</p>
                     </c>
                     <c ca="center">
                        <p>256</p>
                     </c>
                     <c ca="center">
                        <p>237</p>
                     </c>
                     <c ca="center">
                        <p>0.022</p>
                     </c>
                     <c ca="center">
                        <p>0.021</p>
                     </c>
                     <c ca="center">
                        <p>0.018</p>
                     </c>
                     <c ca="center">
                        <p>8.053</p>
                     </c>
                     <c ca="center">
                        <p>26</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>
                           <it>S. tokodaii</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>522</p>
                     </c>
                     <c ca="center">
                        <p>651</p>
                     </c>
                     <c ca="center">
                        <p>229</p>
                     </c>
                     <c ca="center">
                        <p>211</p>
                     </c>
                     <c ca="center">
                        <p>0.021</p>
                     </c>
                     <c ca="center">
                        <p>0.024</p>
                     </c>
                     <c ca="center">
                        <p>0.017</p>
                     </c>
                     <c ca="center">
                        <p>8.424</p>
                     </c>
                     <c ca="center">
                        <p>27</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>The column marked "Num nodes" lists the number of metabolites that are involved in at least one chemical reaction in the organism. The column marked "Num edges" lists the number of all directed chemical reactions in the organism. Note that this number consists of two parts: The number of irreversible reactions, i.e. "Single edges"; and the number of reversible reactions, i.e. "Mutual edges", where "Num edges" = "Single edges" + 2 &#215; "Mutual edges". The column marked "<it>C</it><sub><it>out</it></sub>" lists the average clustering coefficient calculated from the nearest neighbors in out-component. The column marked "<it>C</it><sub><it>in</it></sub>" lists the average clustering coefficient calculated from the nearest neighbors in in-component. Column marked "B" lists the average betweenness of the network, the column "L" lists average path length of the network and column marked "D" lists diameter of the network.</p>
               </tblfn>
            </tbl>
            <fig id="F2">
               <title>
                  <p>Figure 2</p>
               </title>
               <caption>
                  <p>Five network indices (Clustering Coefficients (C<sub>out</sub>, C<sub>in</sub>), Betweenness (B), Average Path Length (L) and Diameter (D)) of the metabolic networks in the eleven organisms</p>
               </caption>
               <text>
                  <p>Five network indices (Clustering Coefficients (C<sub>out</sub>, C<sub>in</sub>), Betweenness (B), Average Path Length (L) and Diameter (D)) of the metabolic networks in the eleven organisms.</p>
               </text>
               <graphic file="1471-2105-6-8-2"/>
            </fig>
            <p>To avoid the confounding effects stemming from different network sizes, we calculated the so-called concentrations (number of appearances of subgraphs divided by the number of nodes with edges or arcs (directed edges), see Methods for details) of three-node subgraphs and four-node subgraphs. The concentration of subgraphs is an objective measure of the extent of clustering and modularity of the network <abbrgrp><abbr bid="B8">8</abbr><abbr bid="B9">9</abbr></abbrgrp>. It is observed that the concentrations of subgraphs in <it>S. cerevisiae </it>and the six Bacterial metabolic networks are much higher than those in Archaeal metabolic networks (Fig. <figr fid="F3">3</figr>).</p>
            <fig id="F3">
               <title>
                  <p>Figure 3</p>
               </title>
               <caption>
                  <p>Concentrations (number of appearances divided by number of nodes with edges/arcs) of three-node and four-node subgraphs</p>
               </caption>
               <text>
                  <p>Concentrations (number of appearances divided by number of nodes with edges/arcs) of three-node and four-node subgraphs.</p>
               </text>
               <graphic file="1471-2105-6-8-3"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>Group II measures: degree distributions</p>
            </st>
            <sec>
               <st>
                  <p>Marginal degree distributions</p>
               </st>
               <p>Recently, a variety of real-life networks are found to share the "scale-free" property, i.e. the marginal degree distribution follows a power-law distribution <abbrgrp><abbr bid="B25">25</abbr><abbr bid="B29">29</abbr><abbr bid="B30">30</abbr><abbr bid="B31">31</abbr></abbrgrp>. Our analysis demonstrates that the outgoing and incoming marginal degree distributions in metabolic networks also follow the power-law distribution. A simple linear model fits the log-transformed data well (except for the incoming degree distributions for most of the Archaea) which indicates that in general, the power-law model is appropriate to capture the structure of degree data (Fig. <figr fid="F4">4</figr>). Parameters were estimated using the Least Square method. The results together with goodness of fit measure <it>R</it><sup>2 </sup>and 95% individual confidence intervals are summarized in Table <tblr tid="T2">2</tblr> and Table <tblr tid="T3">3</tblr>. The estimated power-law index <it>&#947; </it>is around -0.3 in all cases and the estimated log-transformed scaling parameter <it>&#945; </it>ranges within 2.0 to 2.5. These indicate that marginal degree distribution, which is a low-level (local) topological property measure, although showed some distinction, is not enough to effectively differentiate networks from different domains. Overall, metabolic networks in most of the species we studied seem to follow the power-law distributions and thus are "scale-free". The fact that the incoming degree distributions of most Archaeal species we studied do not follow power-law well (Fig. <figr fid="F4">4B</figr>) suggests that networks in Archaeal species tend to be less "scale-free" and more "random-like" compared to those of the non-Archaeal species.</p>
               <fig id="F4">
                  <title>
                     <p>Figure 4</p>
                  </title>
                  <caption>
                     <p>(A) Log transformed marginal outgoing degree distributions (B) Log transformed marginal incoming degree distributions in the eleven organisms</p>
                  </caption>
                  <text>
                     <p>(A) Log transformed marginal outgoing degree distributions (B) Log transformed marginal incoming degree distributions in the eleven organisms</p>
                  </text>
                  <graphic file="1471-2105-6-8-4"/>
               </fig>
               <tbl id="T2">
                  <title>
                     <p>Table 2</p>
                  </title>
                  <caption>
                     <p>Parameter estimates of <it>&#947; </it>and log<it>&#945; </it>in the outgoing degree distribution model.</p>
                  </caption>
                  <tblbdy cols="4">
                     <r>
                        <c>
                           <p/>
                        </c>
                        <c ca="center">
                           <p>
                              <b>R<sup>2</sup></b>
                           </p>
                        </c>
                        <c ca="center">
                           <p>
                              <b><it>&#947;</it>, 95% C.I.</b>
                           </p>
                        </c>
                        <c ca="center">
                           <p>
                              <b>log<it>&#945;</it>, 95% C.I.</b>
                           </p>
                        </c>
                     </r>
                     <r>
                        <c cspan="4">
                           <hr/>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p><b><it>S. cerevisiae </it></b>(748)</p>
                        </c>
                        <c ca="left">
                           <p>0.96</p>
                        </c>
                        <c ca="center">
                           <p>-0.39, [-0.46, -0.31]</p>
                        </c>
                        <c ca="left">
                           <p>2.53, [2.29, 2.78]</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p><b><it>E. coli </it></b>(893)</p>
                        </c>
                        <c ca="left">
                           <p>0.92</p>
                        </c>
                        <c ca="center">
                           <p>-0.36, [-0.43, -0.28]</p>
                        </c>
                        <c ca="left">
                           <p>2.51, [2.29. 2.74]</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p><b><it>V. cholerae </it></b>(738)</p>
                        </c>
                        <c ca="left">
                           <p>0.91</p>
                        </c>
                        <c ca="center">
                           <p>-0.36, [-0.44, -0.28]</p>
                        </c>
                        <c ca="left">
                           <p>2.45, [2.22, 2.68]</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p><b><it>R. solanacearum </it></b>(864)</p>
                        </c>
                        <c ca="left">
                           <p>0.96</p>
                        </c>
                        <c ca="center">
                           <p>-0.37, [-0.43, -0.31]</p>
                        </c>
                        <c ca="left">
                           <p>2.50, [2.32, 2.68]</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p><b><it>B. subtilis </it></b>(787)</p>
                        </c>
                        <c ca="left">
                           <p>0.92</p>
                        </c>
                        <c ca="center">
                           <p>-0.36, [-0.44, -0.28]</p>
                        </c>
                        <c ca="left">
                           <p>2.46, [2.23, 2.68]</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p><b><it>L. lactis </it></b>(545)</p>
                        </c>
                        <c ca="left">
                           <p>0.95</p>
                        </c>
                        <c ca="center">
                           <p>-0.38, [-0.45, -0.31]</p>
                        </c>
                        <c ca="left">
                           <p>2.39, [2.20, 2.58]</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p><b><it>S. coelicolor </it></b>(814)</p>
                        </c>
                        <c ca="left">
                           <p>0.95</p>
                        </c>
                        <c ca="center">
                           <p>-0.36, [-0.43, 0.30]</p>
                        </c>
                        <c ca="left">
                           <p>2.47, [2.29, 2.65]</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p><b><it>S. solfataricus </it></b>(586)</p>
                        </c>
                        <c ca="left">
                           <p>0.92</p>
                        </c>
                        <c ca="center">
                           <p>-0.33, [-0.43, -0.23]</p>
                        </c>
                        <c ca="left">
                           <p>2.17, [1.86, 2.49]</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p><b><it>S. tokodaii </it></b>(445)</p>
                        </c>
                        <c ca="left">
                           <p>0.94</p>
                        </c>
                        <c ca="center">
                           <p>-0.34, [-0.42, -0.25]</p>
                        </c>
                        <c ca="left">
                           <p>2.15, [1.88, 2.42]</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p><b><it>T. acidophilum </it></b>(458)</p>
                        </c>
                        <c ca="left">
                           <p>0.97</p>
                        </c>
                        <c ca="center">
                           <p>-0.37, [-0.44, -0.31]</p>
                        </c>
                        <c ca="left">
                           <p>2.25, [2.05, 2.45]</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p><b><it>M. acetivorans </it></b>(489)</p>
                        </c>
                        <c ca="left">
                           <p>0.86</p>
                        </c>
                        <c ca="center">
                           <p>-0.33, [-0.46, -0.20]</p>
                        </c>
                        <c ca="left">
                           <p>2.13, [1.73, 2.53]</p>
                        </c>
                     </r>
                  </tblbdy>
                  <tblfn>
                     <p>Model: log <it>P</it>(<it>K</it><sub><it>i</it></sub>) = <it>&#947; </it>log(<it>K</it><sub><it>i</it></sub>) + log(<it>&#945;</it>) + <it>&#949;</it><sub><it>i </it></sub>(<it>i </it>= 1,2,...,<it>n</it>). Parameters are estimated using Least Square Method.</p>
                  </tblfn>
               </tbl>
               <tbl id="T3">
                  <title>
                     <p>Table 3</p>
                  </title>
                  <caption>
                     <p>Parameter estimates of <it>&#945; </it>and <it>&#947; </it>in the incoming degree distribution model.</p>
                  </caption>
                  <tblbdy cols="4">
                     <r>
                        <c>
                           <p/>
                        </c>
                        <c ca="center">
                           <p>
                              <b>R<sup>2</sup></b>
                           </p>
                        </c>
                        <c ca="center">
                           <p>
                              <b><it>&#947;</it>, 95% C.I.</b>
                           </p>
                        </c>
                        <c ca="center">
                           <p>
                              <b>log<it>&#945;</it>, 95% C.I.</b>
                           </p>
                        </c>
                     </r>
                     <r>
                        <c cspan="4">
                           <hr/>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p><b><it>S. cerevisiae </it></b>(748)</p>
                        </c>
                        <c ca="left">
                           <p>0.95</p>
                        </c>
                        <c ca="center">
                           <p>-0.35, [-0.42, -0.29]</p>
                        </c>
                        <c ca="left">
                           <p>2.38, [2.18, 2.59]</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p><b><it>E. coli </it></b>(893)</p>
                        </c>
                        <c ca="left">
                           <p>0.90</p>
                        </c>
                        <c ca="center">
                           <p>-0.35, [-0.42, -0.28]</p>
                        </c>
                        <c ca="left">
                           <p>2.50, [2.29, 2.70]</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p><b><it>V. cholerae </it></b>(738)</p>
                        </c>
                        <c ca="left">
                           <p>0.91</p>
                        </c>
                        <c ca="center">
                           <p>-0.36, [-0.44,-0.28]</p>
                        </c>
                        <c ca="left">
                           <p>2.45, [2.22, 2.68]</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p><b><it>R. solanacearum </it></b>(864)</p>
                        </c>
                        <c ca="left">
                           <p>0.96</p>
                        </c>
                        <c ca="center">
                           <p>-0.37, [-0.42, -0.31]</p>
                        </c>
                        <c ca="left">
                           <p>2.50, [2.32, 2.68]</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p><b><it>B. subtilis </it></b>(787)</p>
                        </c>
                        <c ca="left">
                           <p>0.92</p>
                        </c>
                        <c ca="center">
                           <p>-0.36, [-0.43, -0.28]</p>
                        </c>
                        <c ca="left">
                           <p>2.46, [2.23, 2.68]</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p><b><it>L. lactis </it></b>(545)</p>
                        </c>
                        <c ca="left">
                           <p>0.95</p>
                        </c>
                        <c ca="center">
                           <p>-0.38, [-0.45, -0.31]</p>
                        </c>
                        <c ca="left">
                           <p>2.40, [2.20, 2.58]</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p><b><it>S. coelicolor </it></b>(814)</p>
                        </c>
                        <c ca="left">
                           <p>0.95</p>
                        </c>
                        <c ca="center">
                           <p>-0.36, [-0.43, -0.30]</p>
                        </c>
                        <c ca="left">
                           <p>2.47, [2.29, 2.65]</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p><b><it>S. solfataricus </it></b>(586)</p>
                        </c>
                        <c ca="left">
                           <p>0.45</p>
                        </c>
                        <c ca="center">
                           <p>-0.24, [-0.41, -0.07]</p>
                        </c>
                        <c ca="left">
                           <p>2.21, [1.76, 2.68]</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p><b><it>S. tokodaii </it></b>(445)</p>
                        </c>
                        <c ca="left">
                           <p>0.46</p>
                        </c>
                        <c ca="center">
                           <p>-0.25, [-0.42, -0.08]</p>
                        </c>
                        <c ca="left">
                           <p>2.21, [1.76, 2.65]</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p><b><it>T. acidophilum </it></b>(458)</p>
                        </c>
                        <c ca="left">
                           <p>0.89</p>
                        </c>
                        <c ca="center">
                           <p>-0.30, [-0.41, -0.20]</p>
                        </c>
                        <c ca="left">
                           <p>2.00, [1.69, 2.32]</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p><b><it>M. acetivorans </it></b>(489)</p>
                        </c>
                        <c ca="left">
                           <p>0.46</p>
                        </c>
                        <c ca="center">
                           <p>-0.25, [-0.43, -0.08]</p>
                        </c>
                        <c ca="left">
                           <p>2.20, [1.75, 2.65]</p>
                        </c>
                     </r>
                  </tblbdy>
                  <tblfn>
                     <p>Model: log <it>P</it>(<it>K</it><sub><it>i</it></sub>) = <it>&#947; </it>log(<it>K</it><sub><it>i</it></sub>) + log(<it>&#945;</it>) + <it>&#949;</it><sub><it>i </it></sub>(<it>i </it>= 1,2,...,<it>n</it>). Parameters are estimated using Least Square Method.</p>
                  </tblfn>
               </tbl>
               <p>As we have shown, marginal degree distribution alone does not reveal the fundamental network structural differences between the Archaeal species and the non-Archaeal species. Simulation studies have shown that randomized networks preserving marginal degree distribution can be quite different in terms of global (higher level) topological properties such as average clustering coefficient <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>. In metabolic networks, we are unable to determine the preferred types of reactions based on just marginal substrate or product degree distributions. Since the metabolic network is rebuilt from chemical reactions, joint behavior of substrate and product in reactions should be more informative than disjoint behavior of metabolites. Therefore, we calculate the joint degree distributions hoping to gain more insight into the network organization.</p>
            </sec>
            <sec>
               <st>
                  <p>Joint degree distributions</p>
               </st>
               <p>Joint degree distribution measures and describes correlation between connectivities of neighboring nodes. <it>N</it>(<it>K</it><sub>0</sub>, <it>K</it><sub>1</sub>) is defined as the number of edges connecting nodes of connectivity <it>K</it><sub>0 </sub>to those of connectivity <it>K</it><sub>1</sub>. For metabolic networks, which are directed, <it>N</it>(<it>K</it><sub><it>out</it></sub>, <it>K</it><sub><it>in</it></sub>) is used to measure the number of arches where substrate (node) with out-connectivity <it>K</it><sub><it>out </it></sub>transforms to product with in-connectivity <it>K</it><sub><it>in</it></sub>. This quantity reflects intrinsic properties of the network and can be used to distinguish different types of networks. For instance, we can test whether <it>N</it>(<it>K</it><sub><it>out</it></sub>, <it>K</it><sub><it>in</it></sub>) of a particular network differs significantly from that of the random network. To be specific, we calculate <graphic file="1471-2105-6-8-i1.gif"/>, where <graphic file="1471-2105-6-8-i2.gif"/>(<it>K</it><sub><it>out</it></sub>, <it>K</it><sub><it>in</it></sub>) represents the mean of random variable <it>N</it>(<it>K</it><sub><it>out</it></sub>, <it>K</it><sub><it>in</it></sub>) in a large number (say, 1000) of random networks simulated by an edge-rewiring algorithm proposed by Maslov and Sneppen <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>, <graphic file="1471-2105-6-8-i3.gif"/>(<it>K</it><sub><it>out</it></sub>, <it>K</it><sub><it>in</it></sub>) denotes the estimated standard deviation of <it>N</it>(<it>K</it><sub><it>out</it></sub>, <it>K</it><sub><it>in</it></sub>). The <it>p</it>-value can then be obtained by compare <it>Z </it>to a standard normal distribution. Comparing with "properly" randomized network ensembles allows us to concentrate on those statistically significant patterns of the complex network that are likely to reflect the design principle(s) <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>.</p>
               <p>We calculated statistically significant correlation profiles (Z-score profiles, see Methods for details) for the metabolic network in each organism (Fig. <figr fid="F5">5</figr>). The Z-score profiles of the four Archaeal species are similar to each other but quite different from those in <it>S. cerevisiae </it>and the six Bacterial species. Although the dark red regions of the Z-score profiles in Archaeal species are quite different in scale, they all seem to differ significantly from the random network preserving the corresponding marginal degree distribution in a similar way (<it>p</it>-value &lt; 0.1). Looking into the correlation profiles more carefully, we found that the number of statistically significant positive <graphic file="1471-2105-6-8-i4.gif"/>(<it>K</it><sub><it>out</it></sub>, <it>K</it><sub><it>in</it></sub>) increases in the order of <it>S. cerevisiae</it>, the six Bacterial species and the four Archaeal species. The significant Z-score of certain observation <it>N</it>(<it>K</it><sub>out</sub>, <it>K</it><sub><it>in</it></sub>) implies that the chemical reaction between substrates with out-degree <it>K</it><sub><it>out </it></sub>and products with in-degree <it>K</it><sub><it>in </it></sub>are statistically significant. We define substrates whose <it>K</it><sub><it>out </it></sub>>= <it>2 </it>or products whose <it>K</it><sub><it>in </it></sub>>= <it>2 </it>as versatile metabolites. Thus, the above trend implies that the preference to employ reactions involving versatile metabolites increases in the order of <it>S. cerevisiae</it>, the six Bacterial species and the four Archaeal species. Correspondingly, the variety of metabolites decreases in the above order and so does the number of distinct enzymes or variety of enzymes because of the high specific binding of metabolites and enzyme. This is consistent with the biological facts that <it>S. cerevisiae </it>(Eukaryote) encodes a greater variety of enzymes than Bacterial and Archaeal species.</p>
               <fig id="F5">
                  <title>
                     <p>Figure 5</p>
                  </title>
                  <caption>
                     <p>Statistical significance of correlation (Z-scores) present in the metabolic networks</p>
                  </caption>
                  <text>
                     <p><b>Statistical significance of correlation (Z-scores) present in the metabolic networks. </b>To improve statistics, the connectivities in all eleven panels of this figure were logarithmically binned into two bins per decade. Statistically significant correlation profiles are generated using the Matlab program developed by Maslov and Sneppen [13].</p>
                  </text>
                  <graphic file="1471-2105-6-8-5"/>
               </fig>
            </sec>
         </sec>
         <sec>
            <st>
               <p>Group III measure: Network Motif</p>
            </st>
            <p>The network motif is defined to be recurring and non-random building blocks of the network <abbrgrp><abbr bid="B14">14</abbr><abbr bid="B15">15</abbr></abbrgrp>. Just like sequence motif, which is an over-represented and biologically meaningful DNA or protein sub-sequence, network motif is an over-represented and biologically meaningful subgraph.</p>
            <p>Network motif has been shown to be informative of network design principle(s) and network structure. It was found that over 80% of the nodes in the <it>E. coli </it>transcription regulation network are covered by network motifs <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>. Dobrin et al. <abbrgrp><abbr bid="B16">16</abbr></abbrgrp> recently discovered that in the <it>E. coli </it>transcriptional regulatory network, "individual motifs aggregate into homologous motif clusters and a supercluster forming the backbone of the network and play a central role in defining its global topological organization." More importantly, network motifs capture the information that is likely to be missed by the correlation profiles because motif actually describes the number of appearances of certain configurations of multiple nodes, and therefore nicely complement with the correlation profiles <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>. One might argue that there are certain amount of overlaps between the information they capture but the motif profile does not capture the degree information of the connecting nodes, which may be the most powerful feature of the correlation profiles.</p>
            <p>We searched for all of the 13 three-node subgraphs and all of the 199 four-node subgraphs in the metabolic networks of eleven species. The results showed that the three-node motif profiles found in <it>S. cerevisiae </it>and the six Bacterial species are identical while there is no three-node motif found in any of the four Archaeal networks (Fig. <figr fid="F6">6</figr>). Also there is no common four-node motif shared by Archaeal species and <it>S. cerevisiae</it>/Bacterial species while two four-node motifs (id4702, id4950) are shared by the latter (<supplr sid="S1">Additional file 1</supplr>). Among all the 13 possible three-node subgraphs, six of them have one pair of nodes not directly connected. Abundance of such subgraphs will lower the extent of clustering and modularity of the network. As expected, we found that all three-node motifs identified in <it>S. cerevisiae </it>and the six Bacterial species form triangles (Fig. <figr fid="F6">6</figr>). It may explain our main finding that metabolic networks in non-Archaeal species are more clustered and modular than those in Archaeal species.</p>
            <fig id="F6">
               <title>
                  <p>Figure 6</p>
               </title>
               <caption>
                  <p>Three-node motifs found in the metabolic networks in different species</p>
               </caption>
               <text>
                  <p><b>Three-node motifs found in the metabolic networks in different species. </b>The number of connecting nodes for each network is shown. For each motif, the numbers of appearances in real networks (<it>N</it><sub><it>real</it></sub>) and in randomized networks (<it>N</it><sub><it>rand </it></sub>&#177; <it>SD</it>, all values rounded) are shown. The <it>p</it>-values of all motifs are less than 0.01, as determined by comparing to 1000 randomized networks. Each motif occurs at least four times in one network. Motifs were detected and generated using program developed by Milo et al. (2002) and the motif dictionary therein [15].</p>
               </text>
               <graphic file="1471-2105-6-8-6"/>
            </fig>
            <suppl id="S1">
               <title>
                  <p>Additional File 1</p>
               </title>
               <text>
                  <p><b>Four-node motifs found in the metabolic networks in different species. </b>The number of connecting nodes for each network is shown. For each motif, the numbers of appearances in real networks (<it>N</it><sub><it>real</it></sub>) and in randomized networks (<it>N</it><sub><it>rand </it></sub>&#177; <it>SD</it>, all values rounded) are shown. The <it>p</it>-values of all motifs are less than 0.01, as determined by comparing to 1000 randomized networks. Each motif occurs at least four times in one network. Other restrictions apply. Motifs were detected and generated using program found in Milo et al. <abbrgrp><abbr bid="B15">15</abbr></abbrgrp> and the motif dictionary therein.</p>
               </text>
               <file name="1471-2105-6-8-S1.doc">
                  <p>Click here for file</p>
               </file>
            </suppl>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Discussion</p>
         </st>
         <p>Based on our comparison of network structural properties beyond network indices, we were able to gain more insight into the structural differences across the three domains of life. Having shown that the metabolic network is "scale-free", we further showed that metabolic networks in the four Archaeal species are closer to "exponential random network" [9:Ch2, <abbrgrp><abbr bid="B11">11</abbr></abbrgrp>] than those in <it>S. cerevisiae </it>and the six Bacterial species. The reasons are the following:</p>
         <p>First, the Archaeal metabolic networks are visually more homogeneous among themselves compared to their counterparts in the non-Archaeal species. In random networks, any pair of nodes is equally likely to be connected. The network topology should look homogeneous given that the size of network is large enough. The "scale-free" network, on the other hand, features a highly modular and heterogeneous topology since the marginal degree is power-law distributed <abbrgrp><abbr bid="B8">8</abbr><abbr bid="B9">9</abbr></abbrgrp>. Moreover, the marginal degree distributions of the metabolic networks in non-Archaeal species fit the power-law model better than Archaeal species (Table <tblr tid="T2">2</tblr> and Table <tblr tid="T3">3</tblr>).</p>
         <p>Second, the average clustering coefficient and average betweenness of Archaeal metabolic networks are much smaller than those in <it>S. cerevisiae </it>and the six Bacterial species. The same is true for the concentrations of three-node and four-node subgraphs. As pointed out by Watts and Strogatz, real-life networks show strong clustering or network transitivity while exponential random network does not <abbrgrp><abbr bid="B12">12</abbr></abbrgrp>.</p>
         <p>Third, there is no three-node motif and fewer four-node motifs found in Archaeal metabolic networks compared to non-Archaeal metabolic networks. In particular, the ubiquitous feed-forward loop (FFL) motif found in networks from biology (including metabolic networks in <it>S. cerevisiae </it>and the six Bacteria species in this study) to neurology and engineering fields was not found in any of the four Archaeal metabolic networks (Fig. <figr fid="F6">6</figr>). Since motifs are statistically significant subgraphs compared to "properly" randomized network ensembles, no motif or fewer than usual motifs found in a real-life network indicates that the network structure is closer to that of a random network. It has been shown by Milo et al. <abbrgrp><abbr bid="B15">15</abbr></abbrgrp> that concentration of FFL motif is insensitive to the network size within <it>E. coli </it>transcription regulation network, but diminishes to zero in increasingly larger random networks. This also supports that Archaeal metabolic networks are closer to randomized network ensembles than other real-life networks.</p>
         <p>The metabolic networks in Archaea are both "random-like" and "scale-free", which might exert profound influences on their adaptability to the hostile environment. Archaeal species are typically restricted to marginal habitats such as hot springs or areas of low oxygen concentration and can assimilate different kinds of inorganic carbon and nitrogen sources. Indeed, the chemical structure and component of the macromolecules such as protein and lipid make significant contributions to the organism's adaptability to the environment. The seemingly <it>ad hoc </it>network organization (both "random-like" and "scale-free") in Archaeal species might also enabled them to survive in those extreme physiological conditions. Archaeal species might employ some biologically significant subgraphs (rather than statistically significant motifs) which can not be detected by current motif searching algorithm <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>. This makes the Archaeal metabolic networks appear random in statistical sense (not statistically significantly different from random networks) but not in biological sense.</p>
         <p>Our comparison results showed that many network structural properties measured in Archaeal species are different from those of non-Archaeal species. However, the hidden anthropomorphic factors might account for some of the differences observed. Specifically, the drastic differences of topological profiles between the metabolic networks of Archaeal species and non-Archaeal species may be partially explained by the fact that significantly less extensive metabolic pathway studies have been conducted in Archaeal species <abbrgrp><abbr bid="B32">32</abbr></abbrgrp>. Robustness of topological profiles against random perturbations can alleviate the impact to a certain extent but is unable to eradicate it <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>.</p>
      </sec>
      <sec>
         <st>
            <p>Conclusions</p>
         </st>
         <p>Our network analysis results showed that in most of higher-level (global) topological properties measured, metabolic networks in the four Archaeal species are similar to each other but significantly different from those in <it>S. cerevisiae </it>and the six Bacterial species. This provides further evidence that the metabolic network structures and consequently the design principle(s) in the four Archaeal species are very different from those in <it>S. cerevisiae </it>(Eukaryote) and the six Bacterial species. Our finding that the metabolic networks in Archaeal species possess many properties of the exponential random network begs for better understanding of the design principle(s) in biological networks, which may be revealed by further systematic analyses. For example, locate and align conservative pathways such as glycosis between <it>E. coli </it>or <it>S. cerevisiae </it>and Archaeal species to understand the functional mechanisms of Archaeal metabolic networks.</p>
      </sec>
      <sec>
         <st>
            <p>Methods</p>
         </st>
         <sec>
            <st>
               <p>Data source</p>
            </st>
            <p>Chemical reaction data was obtained from metabolic database in Ma and Zeng <abbrgrp><abbr bid="B28">28</abbr></abbrgrp>, which consists of five related tables: <it>reaction</it>, <it>enzyme</it>, <it>react</it>, <it>connect </it>and <it>organism</it>. We compiled a new table from this database excluding any inconsistent or redundant connections between metabolites (details below). SQL was used to query the database.</p>
         </sec>
         <sec>
            <st>
               <p>Identify and remove inconsistency</p>
            </st>
            <p>Inconsistent connections refer to pairs of metabolites that have conflicting reversibility annotation. It is caused by the fact that a pair of metabolites can be in more than one reaction and the reversibility of these reactions can be different. For example, NAD<sup>+ </sup>and Nicotinamide is a pair of metabolites in two reactions: 1) <b>NAD</b><sup>+ </sup>+ L-Arginine = <b>Nicotinamide </b>+ N<sub>2 </sub>(ADP-D-ribosyl)-L-arginine 2) <b>NAD</b><sup>+ </sup>+ H<sub>2</sub>O -><b>Nicotinamide </b>+ ADPribose. (Note that here the role of NAD<sup>+ </sup>is NOT "current" metabolite, and hence connections established through it should NOT be removed). Reaction 1 is a reversible reaction while reaction 2 is not. We annotated an edge between the two metabolites as long as there was at least one reversible reaction that both of them were involved. For example, the type of connection between <b>NAD</b><sup>+ </sup>and <b>Nicotinamide </b>is edge (undirected connection). This step could be summarized as "edge &#8592; edge + arc".</p>
         </sec>
         <sec>
            <st>
               <p>Identify and remove redundancy</p>
            </st>
            <p>There are also numerous redundant connections where the same pair of metabolites switch their roles between substrate and product in two or more different irreversible reactions. For example: 1) UDPglucose + <b>N-Acylsphingosine </b>= UDP + <b>Glucosylceramide </b>2) <b>Glucosylceramide </b>+ H<sub>2</sub>O = D-Glucose + <b>N-Acylsphingosine</b>. (<b>N-Acylsphingosine </b>and <b>Glucosylceramide </b>is a pair of metabolites that switch their roles in two irreversible reactions). In case of redundancy, we annotated an edge between the pair of metabolites rather than the two arcs because they could be converted to each other through two reactions. This step could be summarized as "edge &#8592; arc + arc".</p>
         </sec>
         <sec>
            <st>
               <p>Definitions of some network topological measurements</p>
            </st>
            <sec>
               <st>
                  <p>Clustering coefficient (C)</p>
               </st>
               <p>We define two kinds of clustering coefficients for each node in the directed metabolic networks, i.e. <it>C</it><sub><it>in </it></sub>and <it>C</it><sub><it>out</it></sub>. <it>C</it><sub><it>in </it></sub>measures the average clustering coefficient of the node representing the product that can be generated from its first-order "nearest neighbors" through chemical reactions. <it>C</it><sub><it>out </it></sub>measures the average clustering coefficient of the node that generate its first-order "nearest neighbors" through chemical reactions. The larger the coefficients, the more clustered and modular the network appears to be.</p>
            </sec>
            <sec>
               <st>
                  <p>Betweenness (B)</p>
               </st>
               <p>The betweenness for any node <it>n</it><sub><it>i </it></sub>in the network is defined as <graphic file="1471-2105-6-8-i5.gif"/>, where <it>g</it><sub><it>jk </it></sub>is the number of shortest paths between node <it>j </it>and node <it>k</it>. <it>g</it><sub><it>jk</it></sub>(<it>n</it><sub><it>i</it></sub>) is the number of shortest path between node <it>j </it>and node <it>k </it>containing node <it>n</it><sub><it>i</it></sub>, <it>g </it>is the total number of nodes with edges/arcs. <it>C</it><sub><it>B</it></sub>(<it>n</it><sub><it>i</it></sub>) needs to be multiplied by two in the case of directed network <abbrgrp><abbr bid="B27">27</abbr></abbrgrp>. The average betweenness is defined as: <graphic file="1471-2105-6-8-i6.gif"/>. Higher value of betweenness indicates the network is more clustered and modular.</p>
            </sec>
            <sec>
               <st>
                  <p>Average path length (L)</p>
               </st>
               <p>Watts and Strogatz <abbrgrp><abbr bid="B12">12</abbr></abbrgrp> defined the average path length as <graphic file="1471-2105-6-8-i7.gif"/>, where <it>d</it>(<it>j</it>, <it>k</it>) is the shortest path length between node <it>j </it>and node <it>k </it>(distance), <it>V </it>represents the set of all nodes with edges/arcs of the graph, and <it>g </it>is the number of nodes with edges/arcs.</p>
            </sec>
            <sec>
               <st>
                  <p>Diameter (D)</p>
               </st>
               <p>The diameter of the directed graph <it>G </it>is the longest geodesic between any pairs of nodes. The geodesic is the shortest path between a pair of nodes. Pajek <abbrgrp><abbr bid="B33">33</abbr></abbrgrp> was used to calculate the average betweenness, average path length and diameter.</p>
            </sec>
         </sec>
         <sec>
            <st>
               <p>Concentration of subgraphs (S)</p>
            </st>
            <p>Wasserman and Katherine <abbrgrp><abbr bid="B27">27</abbr></abbrgrp> defined the subgraph as follows: A graph <it>G</it><sub><it>s </it></sub>is a subgraph of <it>G </it>if the set of nodes of <it>G</it><sub><it>s </it></sub>is a subset of the set of nodes of <it>G</it>, and the set of lines in <it>G</it><sub><it>s </it></sub>is a subset of the lines in the graph <it>G</it>. Let <it>M </it>be the number of subgraphs, and <it>N </it>be the number of nodes with edges or arcs. Then the "concentration of subgraph" is defined as <it>C </it>= <it>M/N</it>. A high value of <it>C </it>indicates the network is more clustered and modular. Mfinder1.1 <abbrgrp><abbr bid="B15">15</abbr></abbrgrp> was used to calculate both <it>M </it>and <it>N</it>.</p>
         </sec>
         <sec>
            <st>
               <p>Marginal degree distribution calculations</p>
            </st>
            <p>The marginal degree distribution of each network is calculated from the Boolean adjacency matrix <it>A</it>, a matrix of 0 or 1. Zero means there is no connection between nodes, and 1 the opposite. The outgoing degree of the node <it>i</it>, <it>k</it><sub><it>out</it>(<it>i</it>) </sub>is defined as <graphic file="1471-2105-6-8-i8.gif"/>, where <graphic file="1471-2105-6-8-i9.gif"/>. The incoming degree of the node <it>i</it>, <it>k</it><sub><it>in</it>(<it>i</it>) </sub>is defined as <graphic file="1471-2105-6-8-i10.gif"/>.</p>
         </sec>
         <sec>
            <st>
               <p>Simple regression analyses of marginal degree distributions</p>
            </st>
            <p>The power-law degree model was first log transformed into linear model, i.e. log <it>P</it>(<it>K</it><sub><it>i</it></sub>) = <it>&#947; </it>log(<it>K</it><sub><it>i</it></sub>) + log(<it>&#945;</it>) + <it>&#949;</it><sub><it>i </it></sub>(<it>i </it>= <it>1,2,...,n</it>), <it>&#947; </it>and <it>&#945; </it>are parameters, <it>&#949;</it><sub><it>i </it></sub>is the residual. <it>K</it><sub><it>i </it></sub>is the degree and <it>P</it>(<it>K</it><sub><it>i</it></sub>) is the corresponding probability. Based on the fitted linear model, we made statistical inference including parameter estimation and individual confidence intervals on the estimates using the Least Square method.</p>
         </sec>
         <sec>
            <st>
               <p>Correlation profile calculations</p>
            </st>
            <p>Statistically significant correlation profiles were calculated using Matlab code downloaded from Dr. Maslov's website <abbrgrp><abbr bid="B34">34</abbr></abbrgrp>. The adjacency matrix of the network is the input.</p>
         </sec>
         <sec>
            <st>
               <p>Motif profiles calculations</p>
            </st>
            <p>According to Milo et al.<abbrgrp><abbr bid="B15">15</abbr></abbrgrp> , a subgraph is referred to as a motif if the following criteria are met: 1) Its empirical <it>p</it>-value is smaller than a pre-specified threshold, e.g. 0.01. 2) The number of appearances in real networks with distinct sets of nodes is larger than another pre-specified cut-off value, e.g. 4. 3) The number of appearances in real networks is significantly larger than that in randomized networks, i.e. <graphic file="1471-2105-6-8-i11.gif"/>. <it>N</it><sub><it>real </it></sub>and <it>N</it><sub><it>rand </it></sub>represent the number of certain subgraphs detected in real-life network and randomized networks, respectively. This is to avoid the situation where some common subgraphs are detected as motifs that have only slight differences in <it>N</it><sub><it>real </it></sub>and <it>N</it><sub><it>rand </it></sub>but have a narrow spread of distribution in randomized networks <abbrgrp><abbr bid="B14">14</abbr><abbr bid="B15">15</abbr></abbrgrp>. Motif profiles are generated using the Mfinder program. This program and the motif dictionary were downloaded from Dr. Uri Alon group's website <abbrgrp><abbr bid="B35">35</abbr></abbrgrp>.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Authors' contributions</p>
         </st>
         <p>DZ and ZSQ conceived and designed the study; DZ wrote the computer code, analyzed the data and draft the manuscript. Both authors read and approved the final manuscript.</p>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>We thank Drs. Hong-Wu Ma and An-Ping Zeng for their compiled metabolic database; Dr. Kerby A. Shedden for valuable discussion and the two anonymous reviewers for their constructive comments</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>The universal ancestor</p>
            </title>
            <aug>
               <au>
                  <snm>Woese</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci U S A</source>
            <pubdate>1998</pubdate>
            <volume>95</volume>
            <fpage>6854</fpage>
            <lpage>6859</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">22660</pubid>
                  <pubid idtype="pmpid" link="fulltext">9618502</pubid>
                  <pubid idtype="doi">10.1073/pnas.95.12.6854</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <title>
               <p>Phylogenetic classification and the universal tree</p>
            </title>
            <aug>
               <au>
                  <snm>Doolittle</snm>
                  <fnm>WF</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>1999</pubdate>
            <volume>284</volume>
            <fpage>2124</fpage>
            <lpage>2129</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.284.5423.2124</pubid>
                  <pubid idtype="pmpid" link="fulltext">10381871</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>Mosaic bacterial chromosomes: a challenge en route to a tree of genome</p>
            </title>
            <aug>
               <au>
                  <snm>Martin</snm>
                  <fnm>W</fnm>
               </au>
            </aug>
            <source>Bioessays</source>
            <pubdate>1999</pubdate>
            <volume>21</volume>
            <fpage>99</fpage>
            <lpage>104</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">10193183</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>Molecular networks: the top-down view</p>
            </title>
            <aug>
               <au>
                  <snm>Bray</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2003</pubdate>
            <volume>301</volume>
            <fpage>1864</fpage>
            <lpage>1865</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1089118</pubid>
                  <pubid idtype="pmpid" link="fulltext">14512614</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>Comparable system-level organization of Archaea and Eukaryotes</p>
            </title>
            <aug>
               <au>
                  <snm>Podani</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Oltvai</snm>
                  <fnm>ZN</fnm>
               </au>
               <au>
                  <snm>Jeong</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Tombor</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Barab&#225;si</snm>
                  <fnm>AL</fnm>
               </au>
               <au>
                  <snm>Szathm&#225;ry</snm>
                  <fnm>E</fnm>
               </au>
            </aug>
            <source>Nat Genet</source>
            <pubdate>2001</pubdate>
            <volume>29</volume>
            <fpage>54</fpage>
            <lpage>56</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/ng708</pubid>
                  <pubid idtype="pmpid" link="fulltext">11528391</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>Phylogenetic comparison of metabolic capacities of organisms at genome level</p>
            </title>
            <aug>
               <au>
                  <snm>Ma</snm>
                  <fnm>HW</fnm>
               </au>
               <au>
                  <snm>Zeng</snm>
                  <fnm>AP</fnm>
               </au>
            </aug>
            <source>Mol Phylogenet Evol</source>
            <pubdate>2004</pubdate>
            <volume>31</volume>
            <fpage>204</fpage>
            <lpage>213</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.ympev.2003.08.011</pubid>
                  <pubid idtype="pmpid" link="fulltext">15019620</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>Scale-free networks in biology: new insights into the fundamentals of evolution?</p>
            </title>
            <aug>
               <au>
                  <snm>Wolf</snm>
                  <fnm>YI</fnm>
               </au>
               <au>
                  <snm>Karev</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Koonin</snm>
                  <fnm>EV</fnm>
               </au>
            </aug>
            <source>Bioessays</source>
            <pubdate>2002</pubdate>
            <volume>24</volume>
            <fpage>105</fpage>
            <lpage>109</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1002/bies.10059</pubid>
                  <pubid idtype="pmpid" link="fulltext">11835273</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>Statistical mechanics of complex networks</p>
            </title>
            <aug>
               <au>
                  <snm>Albert</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Barab&#225;si</snm>
                  <fnm>AL</fnm>
               </au>
            </aug>
            <source>Rev Mod Phy</source>
            <pubdate>2002</pubdate>
            <volume>74</volume>
            <fpage>47</fpage>
            <lpage>97</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1103/RevModPhys.74.47</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <aug>
               <au>
                  <snm>Bornholdt</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Schuster</snm>
                  <fnm>HG</fnm>
               </au>
            </aug>
            <source>Handbooks of Graphs and Networks: From the Genome to the Internet</source>
            <publisher>Weinheim: Wiley-Vch</publisher>
            <pubdate>2003</pubdate>
         </bibl>
         <bibl id="B10">
            <aug>
               <au>
                  <snm>Pemmaraju</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Skiena</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Computational discrete mathematics: Combinatorics and Graph Theory with Mathematica&#174;</source>
            <publisher>Cambridge: Cambridge University Press</publisher>
            <pubdate>2003</pubdate>
         </bibl>
         <bibl id="B11">
            <title>
               <p>The structure and function of complex networks</p>
            </title>
            <aug>
               <au>
                  <snm>Newman</snm>
                  <fnm>MEJ</fnm>
               </au>
            </aug>
            <source>SIAM Review</source>
            <pubdate>2003</pubdate>
            <volume>45</volume>
            <fpage>167</fpage>
            <lpage>256</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1137/5003614450342480</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>Collective dynamics of 'small-world' networks</p>
            </title>
            <aug>
               <au>
                  <snm>Watts</snm>
                  <fnm>DJ</fnm>
               </au>
               <au>
                  <snm>Strogatz</snm>
                  <fnm>SH</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>1998</pubdate>
            <volume>393</volume>
            <fpage>440</fpage>
            <lpage>442</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/30918</pubid>
                  <pubid idtype="pmpid" link="fulltext">9623998</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B13">
            <title>
               <p>Specificity and stability in topology of protein networks</p>
            </title>
            <aug>
               <au>
                  <snm>Maslov</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Sneppen</snm>
                  <fnm>K</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2002</pubdate>
            <volume>296</volume>
            <fpage>910</fpage>
            <lpage>913</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1065103</pubid>
                  <pubid idtype="pmpid" link="fulltext">11988575</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B14">
            <title>
               <p>Network motifs in the transcriptional regulation network of Escherichia coli</p>
            </title>
            <aug>
               <au>
                  <snm>Shen-Orr</snm>
                  <fnm>SS</fnm>
               </au>
               <au>
                  <snm>Milo</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Mangan</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Alon</snm>
                  <fnm>U</fnm>
               </au>
            </aug>
            <source>Nat Genet</source>
            <pubdate>2002</pubdate>
            <volume>31</volume>
            <fpage>64</fpage>
            <lpage>68</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/ng881</pubid>
                  <pubid idtype="pmpid" link="fulltext">11967538</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B15">
            <title>
               <p>Network motifs: simple building blocks of complex networks</p>
            </title>
            <aug>
               <au>
                  <snm>Milo</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Shen-Orr</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Itzkovitz</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Kashtan</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Chklovskii</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Alon</snm>
                  <fnm>U</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2002</pubdate>
            <volume>298</volume>
            <fpage>824</fpage>
            <lpage>827</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.298.5594.824</pubid>
                  <pubid idtype="pmpid" link="fulltext">12399590</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B16">
            <title>
               <p>Aggregating of topological motifs in the Escherichia coli transcriptional regulatory network</p>
            </title>
            <aug>
               <au>
                  <snm>Dobrin</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Beg</snm>
                  <fnm>QK</fnm>
               </au>
               <au>
                  <snm>Barab&#225;si AL Oltvai</snm>
                  <fnm>ZN</fnm>
               </au>
            </aug>
            <source>BMC Bioinformatics</source>
            <pubdate>2004</pubdate>
            <volume>5</volume>
            <fpage>10</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">357809</pubid>
                  <pubid idtype="pmpid" link="fulltext">15018656</pubid>
                  <pubid idtype="doi">10.1186/1471-2105-5-10</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B17">
            <title>
               <p>Eukaryote genome duplication &#8211; where's the evidence?</p>
            </title>
            <aug>
               <au>
                  <snm>Skrabanek</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Wolfe</snm>
                  <fnm>KH</fnm>
               </au>
            </aug>
            <source>Curr Opin Genet Dev</source>
            <pubdate>1998</pubdate>
            <volume>8</volume>
            <fpage>694</fpage>
            <lpage>700</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0959-437X(98)80039-7</pubid>
                  <pubid idtype="pmpid">9914206</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B18">
            <title>
               <p>Molecular evidence for an ancient duplication of the entire yeast genome</p>
            </title>
            <aug>
               <au>
                  <snm>Wolfe</snm>
                  <fnm>KH</fnm>
               </au>
               <au>
                  <snm>Shields</snm>
                  <fnm>DC</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>1997</pubdate>
            <volume>387</volume>
            <fpage>708</fpage>
            <lpage>713</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/42711</pubid>
                  <pubid idtype="pmpid" link="fulltext">9192896</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B19">
            <title>
               <p>A comprehensive analysis of protein-protein interactions in Saccharomyces cerevisiae</p>
            </title>
            <aug>
               <au>
                  <snm>Uetz</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Giot</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Cagney</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Mansfield</snm>
                  <fnm>TA</fnm>
               </au>
               <au>
                  <snm>Judson</snm>
                  <fnm>RS</fnm>
               </au>
               <au>
                  <snm>Knight</snm>
                  <fnm>JR</fnm>
               </au>
               <au>
                  <snm>Lockshon</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Narayan</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Srinivasan</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Pochart</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Qureshi-Emili</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Godwin</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Conover</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Kalbfleisch</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Vijayadamodar</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Yang</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Johnston</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Fields</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Rothberg</snm>
                  <fnm>JM</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2000</pubdate>
            <volume>403</volume>
            <fpage>623</fpage>
            <lpage>627</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/35001009</pubid>
                  <pubid idtype="pmpid" link="fulltext">10688190</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B20">
            <title>
               <p>A comprehensive two-hybrid analysis to explore the yeast protein interactome</p>
            </title>
            <aug>
               <au>
                  <snm>Ito</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Chiba</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Ozawa</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Yoshida</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Hattori</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Sakaki</snm>
                  <fnm>Y</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci U S A</source>
            <pubdate>2001</pubdate>
            <volume>98</volume>
            <fpage>4569</fpage>
            <lpage>4574</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">31875</pubid>
                  <pubid idtype="pmpid" link="fulltext">11283351</pubid>
                  <pubid idtype="doi">10.1073/pnas.061034498</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B21">
            <title>
               <p>Systematic genetic analysis with ordered arrays of yeast deletion mutants</p>
            </title>
            <aug>
               <au>
                  <snm>Tong</snm>
                  <fnm>AH</fnm>
               </au>
               <au>
                  <snm>Evangelista</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Parsons</snm>
                  <fnm>AB</fnm>
               </au>
               <au>
                  <snm>Xu</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Bader</snm>
                  <fnm>GD</fnm>
               </au>
               <au>
                  <snm>Page</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Robinson</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Raghibizadeh</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Hogue</snm>
                  <fnm>CW</fnm>
               </au>
               <au>
                  <snm>Bussey</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Andrews</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Tyers</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Boone</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2001</pubdate>
            <volume>294</volume>
            <fpage>2364</fpage>
            <lpage>2368</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1065810</pubid>
                  <pubid idtype="pmpid" link="fulltext">11743205</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B22">
            <title>
               <p>Transcriptional regulatory networks in Saccharomyces cerevisiae</p>
            </title>
            <aug>
               <au>
                  <snm>Lee</snm>
                  <fnm>TI</fnm>
               </au>
               <au>
                  <snm>Rinaldi</snm>
                  <fnm>NJ</fnm>
               </au>
               <au>
                  <snm>Robert</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Odom</snm>
                  <fnm>DT</fnm>
               </au>
               <au>
                  <snm>Bar-Joseph</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Gerber</snm>
                  <fnm>GK</fnm>
               </au>
               <au>
                  <snm>Hannett</snm>
                  <fnm>NM</fnm>
               </au>
               <au>
                  <snm>Harbison</snm>
                  <fnm>CR</fnm>
               </au>
               <au>
                  <snm>Thompson</snm>
                  <fnm>CM</fnm>
               </au>
               <au>
                  <snm>Simon</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Zeitlinger</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Jennings</snm>
                  <fnm>EG</fnm>
               </au>
               <au>
                  <snm>Murray</snm>
                  <fnm>HL</fnm>
               </au>
               <au>
                  <snm>Gordon</snm>
                  <fnm>DB</fnm>
               </au>
               <au>
                  <snm>Ren</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Wyrick</snm>
                  <fnm>JJ</fnm>
               </au>
               <au>
                  <snm>Tagne</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Volkert</snm>
                  <fnm>TL</fnm>
               </au>
               <au>
                  <snm>Fraenkel</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Gifford</snm>
                  <fnm>DK</fnm>
               </au>
               <au>
                  <snm>Young</snm>
                  <fnm>RA</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2002</pubdate>
            <volume>298</volume>
            <fpage>799</fpage>
            <lpage>804</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1075090</pubid>
                  <pubid idtype="pmpid" link="fulltext">12399584</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B23">
            <title>
               <p>Metabolic pathways in the post-genomic era</p>
            </title>
            <aug>
               <au>
                  <snm>Papin</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Price</snm>
                  <fnm>ND</fnm>
               </au>
               <au>
                  <snm>Wiback</snm>
                  <fnm>SJ</fnm>
               </au>
               <au>
                  <snm>Fell</snm>
                  <fnm>DA</fnm>
               </au>
               <au>
                  <snm>Palsson</snm>
                  <fnm>BO</fnm>
               </au>
            </aug>
            <source>TRENDS in Biochem Sci</source>
            <pubdate>2003</pubdate>
            <volume>28</volume>
            <fpage>250</fpage>
            <lpage>258</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1016/S0968-0004(03)00064-1</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B24">
            <title>
               <p>The small world of metabolism</p>
            </title>
            <aug>
               <au>
                  <snm>Fell</snm>
                  <fnm>DA</fnm>
               </au>
               <au>
                  <snm>Wagner</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Nat Biotechnol</source>
            <pubdate>2000</pubdate>
            <volume>18</volume>
            <fpage>1121</fpage>
            <lpage>1122</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/81025</pubid>
                  <pubid idtype="pmpid" link="fulltext">11062388</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B25">
            <title>
               <p>The large-scale organization of metabolic networks</p>
            </title>
            <aug>
               <au>
                  <snm>Jeong</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Tombor</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Albert</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Oltvai</snm>
                  <fnm>ZN</fnm>
               </au>
               <au>
                  <snm>Barab&#225;si</snm>
                  <fnm>AL</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2000</pubdate>
            <volume>407</volume>
            <fpage>651</fpage>
            <lpage>654</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/35036627</pubid>
                  <pubid idtype="pmpid" link="fulltext">11034217</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B26">
            <title>
               <p>The connectivity structure, giant strong component and centrality of metabolic networks</p>
            </title>
            <aug>
               <au>
                  <snm>Ma</snm>
                  <fnm>HW</fnm>
               </au>
               <au>
                  <snm>Zeng</snm>
                  <fnm>AP</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2003</pubdate>
            <volume>19</volume>
            <fpage>1423</fpage>
            <lpage>1430</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/btg177</pubid>
                  <pubid idtype="pmpid" link="fulltext">12874056</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B27">
            <aug>
               <au>
                  <snm>Wasserman</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Katherine</snm>
                  <fnm>F</fnm>
               </au>
            </aug>
            <source>Social Network Analysis: Methods and Applications</source>
            <publisher>Cambridge: Cambridge University Press</publisher>
            <pubdate>1994</pubdate>
         </bibl>
         <bibl id="B28">
            <title>
               <p>Reconstruction of metabolic networks from genome data and analysis of their global structure for various organisms</p>
            </title>
            <aug>
               <au>
                  <snm>Ma</snm>
                  <fnm>HW</fnm>
               </au>
               <au>
                  <snm>Zeng</snm>
                  <fnm>AP</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2003</pubdate>
            <volume>19</volume>
            <fpage>270</fpage>
            <lpage>277</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/19.2.270</pubid>
                  <pubid idtype="pmpid" link="fulltext">12538249</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B29">
            <title>
               <p>Functional and topological characterization of protein interaction networks</p>
            </title>
            <aug>
               <au>
                  <snm>Yook</snm>
                  <fnm>SH</fnm>
               </au>
               <au>
                  <snm>Oltvai</snm>
                  <fnm>ZN</fnm>
               </au>
               <au>
                  <snm>Barab&#225;si</snm>
                  <fnm>AL</fnm>
               </au>
            </aug>
            <source>Proteomics</source>
            <pubdate>2004</pubdate>
            <volume>4</volume>
            <fpage>928</fpage>
            <lpage>942</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1002/pmic.200300636</pubid>
                  <pubid idtype="pmpid" link="fulltext">15048975</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B30">
            <title>
               <p>The topology of transcription regulatory network in the yeast, Saccharomyces cerevisiae</p>
            </title>
            <aug>
               <au>
                  <snm>Farkas</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Jeong</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Vicsek</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Barab&#225;si</snm>
                  <fnm>AL</fnm>
               </au>
               <au>
                  <snm>Oltvai</snm>
                  <fnm>ZN</fnm>
               </au>
            </aug>
            <source>Physica A</source>
            <pubdate>2003</pubdate>
            <volume>318</volume>
            <fpage>601</fpage>
            <lpage>612</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/50378-4371(02)01731-4</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B31">
            <title>
               <p>Lethality and centrality in protein networks</p>
            </title>
            <aug>
               <au>
                  <snm>Jeong</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Mason</snm>
                  <fnm>SP</fnm>
               </au>
               <au>
                  <snm>Barab&#225;si</snm>
                  <fnm>AL</fnm>
               </au>
               <au>
                  <snm>Oltvai</snm>
                  <fnm>ZN</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2001</pubdate>
            <volume>411</volume>
            <fpage>41</fpage>
            <lpage>42</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/35075138</pubid>
                  <pubid idtype="pmpid" link="fulltext">11333967</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B32">
            <title>
               <p>Novel archaeal alanine: glyoxylate aminotransferase from <it>Thermococcus litoralis</it></p>
            </title>
            <aug>
               <au>
                  <snm>Sakuraba</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Kawakami</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Takahashi</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Ohshima</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>J Bacteriol</source>
            <pubdate>2004</pubdate>
            <volume>186</volume>
            <fpage>5513</fpage>
            <lpage>5518</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">490878</pubid>
                  <pubid idtype="pmpid" link="fulltext">15292154</pubid>
                  <pubid idtype="doi">10.1128/JB.186.16.5513-5518.2004</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B33">
            <title>
               <p>Pajek &#8211; Program for large network analysis</p>
            </title>
            <aug>
               <au>
                  <snm>Batagelj</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Mrvar</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Connections</source>
            <pubdate>1998</pubdate>
            <volume>21</volume>
            <fpage>47</fpage>
            <lpage>57</lpage>
         </bibl>
         <bibl id="B34">
            <title>
               <p>Dr. Maslov's Matlab Programs for Random Rewiring and Correlation Profiles of a Complex Network</p>
            </title>
            <url>http://www.cmth.bnl.gov/~maslov/matlab.htm</url>
         </bibl>
         <bibl id="B35">
            <title>
               <p>Dr. Uri Alon Lab Homepage</p>
            </title>
            <url>http://www.weizmann.ac.il/mcb/UriAlon/</url>
         </bibl>
      </refgrp>
   </bm>
</art>
