<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art><ui>1471-2091-12-42</ui><ji>1471-2091</ji><fm>
<dochead>Research article</dochead>
<bibl>
<title>
<p>
<it>BKM-react</it>, an integrated biochemical reaction database</p>
</title>
<aug>
<au id="A1" ce="yes"><snm>Lang</snm><fnm>Maren</fnm><insr iid="I1"/><email>M.Lang@tu-bs.de</email></au>
<au id="A2" ce="yes"><snm>Stelzer</snm><fnm>Michael</fnm><insr iid="I1"/><email>M.Stelzer@tu-bs.de</email></au>
<au ca="yes" id="A3"><snm>Schomburg</snm><fnm>Dietmar</fnm><insr iid="I1"/><email>D.Schomburg@tu-bs.de</email></au>
</aug>
<insg>
<ins id="I1"><p>Department of Bioinformatics and Biochemistry, Institute for Biochemistry and Biotechnology, Technische Universit&#228;t Braunschweig, Langer Kamp 19 B, 38106 Braunschweig, Germany</p></ins>
</insg>
<source>BMC Biochemistry</source>
<issn>1471-2091</issn>
<pubdate>2011</pubdate>
<volume>12</volume>
<issue>1</issue>
<fpage>42</fpage>
<url>http://www.biomedcentral.com/1471-2091/12/42</url>
<xrefbib><pubidlist><pubid idtype="doi">10.1186/1471-2091-12-42</pubid><pubid idtype="pmpid">21824409</pubid></pubidlist></xrefbib>
</bibl>
<history><rec><date><day>10</day><month>1</month><year>2011</year></date></rec><acc><date><day>8</day><month>8</month><year>2011</year></date></acc><pub><date><day>8</day><month>8</month><year>2011</year></date></pub></history>
<cpyrt><year>2011</year><collab>Lang et al; licensee BioMed Central Ltd.</collab><note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note></cpyrt>
<abs>
<sec>
<st>
<p>Abstract</p>
</st>
<sec>
<st>
<p>Background</p>
</st>
<p>The systematic, complete and correct reconstruction of genome-scale metabolic networks or metabolic pathways is one of the most challenging tasks in systems biology research. An essential requirement is the access to the complete biochemical knowledge - especially on the biochemical reactions. This knowledge is extracted from the scientific literature and collected in biological databases. Since the available databases differ in the number of biochemical reactions and the annotation of the reactions, an integrated knowledge resource would be of great value.</p>
</sec>
<sec>
<st>
<p>Results</p>
</st>
<p>We developed a comprehensive non-redundant reaction database containing known enzyme-catalyzed and spontaneous reactions. Currently, it comprises 18,172 unique biochemical reactions. As source databases the biochemical databases <it>BRENDA</it>, <it>KEGG</it>, and <it>MetaCyc </it>were used. Reactions of these databases were matched and integrated by aligning substrates and products. For the latter a two-step comparison using their structures (<it>via InChIs</it>) and names was performed. Each biochemical reaction given as a reaction equation occurring in at least one of the databases was included.</p>
</sec>
<sec>
<st>
<p>Conclusions</p>
</st>
<p>An integrated non-redundant reaction database has been developed and is made available to users. The database can significantly facilitate and accelerate the construction of accurate biochemical models.</p>
</sec>
</sec>
</abs>
</fm><bdy>
<sec>
<st>
<p>Background</p>
</st>
<p>For the construction of cellular models, the development of organism-specific reaction networks is essential. A number of sources for biochemical reactions exist, as the databases <it>BRENDA </it>
<abbrgrp>
<abbr bid="B1">1</abbr>
</abbrgrp>, <it>KEGG </it>
<abbrgrp>
<abbr bid="B2">2</abbr>
</abbrgrp>, and <it>MetaCyc </it>
<abbrgrp>
<abbr bid="B3">3</abbr>
</abbrgrp>. In general, the integration of biological databases is not trivial <abbrgrp>
<abbr bid="B4">4</abbr>
</abbrgrp>. Due to the fact that the completeness of reaction data differs between the databases, it becomes important to combine the available reaction information of the used source databases in form of an integrated reaction database.</p>
<p>So a combination will lead to more complete and reliable metabolic networks. Therefore it is necessary to find identical reactions between the recognized databases. As different compound names and compound IDs, as well as reaction IDs, are in use within the described biochemical reactions a comparison is far from straightforward. A major obstacle results from the use of generic compound names, <it>e.g. 'an aldehyde' </it>or <it>'an alcohol'</it>. Furthermore some reactions even occur in the same database twice with different reaction IDs.</p>
<p>Integrated databases exist for diverse biological topics. The <it>TRANSPATH<sup>&#174; </sup>
</it>database for example is an integrated database which deals with signal transduction information <abbrgrp>
<abbr bid="B5">5</abbr>
</abbrgrp>. As an example for an integrated metabolic database system the database <it>BioSilico </it>can be mentioned here <abbrgrp>
<abbr bid="B6">6</abbr>
</abbrgrp>. For creation of this database, information of the metabolic databases <it>KEGG</it>, <it>ENZYME </it>
<abbrgrp>
<abbr bid="B7">7</abbr>
</abbrgrp>, <it>EcoCyc </it>
<abbrgrp>
<abbr bid="B8">8</abbr>
</abbrgrp>, and <it>MetaCyc </it>was combined, the latter two building parts of <it>BioCyc </it>
<abbrgrp>
<abbr bid="B9">9</abbr>
</abbrgrp>. The database <it>BioSilico </it>includes information on enzymes, biochemical compounds, and reactions. Radrich <it>et al. </it>
<abbrgrp>
<abbr bid="B10">10</abbr>
</abbrgrp> provide a semi-automated tool for the process of genome-scale network reconstruction demonstrated on the basis of data for <it>Arabidopsis thaliana</it>. Their integrated data set is built on the two sources <it>KEGG </it>and <it>AraCyc </it>
<abbrgrp>
<abbr bid="B11">11</abbr>
</abbrgrp>. Furthermore a reaction database on human biological pathways and processes named <it>Reactome </it>
<abbrgrp>
<abbr bid="B12">12</abbr>
</abbrgrp> exists as well as an annotated reaction database called <it>Rhea </it>
<abbrgrp>
<abbr bid="B13">13</abbr>
</abbrgrp>, basically a modified version of the reactions defined in the <it>IUBMB </it>enzyme list <abbrgrp>
<abbr bid="B14">14</abbr>
</abbrgrp>. A collection of biochemical reactions and pathways in printed form contains the book <it>Biochemical Pathways: An Atlas of Biochemistry and Molecular Biology </it>
<abbrgrp>
<abbr bid="B15">15</abbr>
</abbrgrp>.</p>
</sec>
<sec>
<st>
<p>Methods</p>
</st>
<p>In this work information from the biological databases <it>BRENDA </it>
<abbrgrp>
<abbr bid="B1">1</abbr>
</abbrgrp>, <it>KEGG </it>
<abbrgrp>
<abbr bid="B2">2</abbr>
</abbrgrp>, and <it>MetaCyc </it>
<abbrgrp>
<abbr bid="B3">3</abbr>
</abbrgrp> was used (May 2011). Reaction comparisons were done by an <it>in silico </it>approach in which two steps, first a comparison of reactant structures using <it>InChIs </it>(linearized chemical structure descriptors <abbrgrp>
<abbr bid="B16">16</abbr>
</abbrgrp>) and, second, a compound name comparison (incl. synonyms), were combined. An <it>InChI </it>structure coding was generated based on an original <it>Molfile </it>(contains molecular structure information <abbrgrp>
<abbr bid="B17">17</abbr>
</abbrgrp>) by using a special converting tool (<it>InChI version 1 (software version 1.03) for Standard and Non-Standard InChI/InChIKey </it>
<abbrgrp>
<abbr bid="B18">18</abbr>
</abbrgrp>). By using only relevant layers of an self-generated <it>InChI</it>, a higher matching rate was achieved. For this purpose we dropped the <it>InChI </it>layers dependent on the ionisation state so that <it>e.g. </it>acetic acid and the acetate ion were considered to be the same compound. Reactions without <it>EC </it>numbers were included as well as those reactions with incomplete <it>EC </it>numbers. Spontaneous reactions without <it>EC </it>number were labelled <it>SPONTANEOUS</it>. Before the comparison, the compounds water (H<sub>2</sub>O) and proton (H<sup>+</sup>) were removed from the reactions. Additionally, a stoichiometry check was executed. This information was added as attribute to the reactions in the database as a quality measure. Stoichiometrically imbalanced reactions were marked as <it>incomplete </it>in the column <it>Stoichiometry</it>, except in cases where only a proton or water is missing. In two supplemental columns the incomplete cases are differentiated into <it>Missing Substrate </it>and <it>Missing Product</it>.</p>
<p>For the compound name based comparison step all found synonyms were used as well as generated '<it>DAYLIGHT names</it>' (<it>Chemical Information Systems, Inc</it>. <abbrgrp>
<abbr bid="B19">19</abbr>
</abbrgrp>). We applied a special conversion that removed most of the common sources of differences in equivalent compound names like hyphens, parentheses, <it>etc</it>. Most of the special characters, except '+' and apostrophe ('), were deleted. For identifying common reactions, all available synonyms and '<it>DAYLIGHT names</it>' (see above) of the compounds are included in form of a link table containing assigned compound IDs. Where possible, <it>KEGG </it>glycan IDs (<it>G </it>number) were exchanged by their corresponding compound IDs (<it>C </it>number). Reactions with NAD(P)/H (<it>BRENDA</it>) and NADP/H_OR_NO_P (<it>MetaCyc</it>) were split into two reactions, one with NADH, the other with NADPH. The reaction ID of the form without phosphate was labelled as the original but with <it>_WOP </it>(= <ul>W</ul>ith<ul>O</ul>ut <ul>P</ul>hosphate) at the end.</p>
<p>Data download, storage, and comparison was realized by <it>C++ </it>as well as <it>Python </it>scripts and embedded <it>MySQL </it>statements. By executing a cron-job in regular time points, the information about metabolites, enzymes, reactions, <it>Molfiles</it>, and <it>InChIs </it>was downloaded from the source databases and so kept up to date automatically.</p>
<p>The access to the integrated database is possible <it>via </it>the link to <it>BKM-react </it>
<abbrgrp>
<abbr bid="B20">20</abbr>
</abbrgrp>, Figure <figr fid="F1">1A</figr>, or <it>via </it>the <it>BRENDA </it>website, making use of the <it>BRENDA </it>query engine. Figure <figr fid="F1">1B</figr> illustrates the access to the integrated non-redundant reaction database <abbrgrp>
<abbr bid="B21">21</abbr>
</abbrgrp> &#8594; <it>Reaction &amp; Specificity &#8594; Biochemicals Reactions Aligned </it>(see arrow). Parameters for doing queries are presented in Figure <figr fid="F2">2A</figr> for the reaction table. Figure <figr fid="F2">2B</figr> shows an example for a query result. The downloadable content of the database consists of three tables, containing the compared reactions, the according compounds as well as a link table connecting both with each other.</p>
<fig id="F1"><title><p>Figure 1</p></title><caption><p>Link to the integrated non-redundant reaction database</p></caption><text>
   <p><b>Link to the integrated non-redundant reaction database</b>. Website of <it>BKM-react </it>(A) and the <it>BRENDA </it>main menu with the link <it><ul>Biochemicals Reactions Aligned</ul></it>(B, see arrow).</p>
</text><graphic file="1471-2091-12-42-1" hint_layout="double"/></fig>
<fig id="F2"><title><p>Figure 2</p></title><caption><p>
   <it>BKM-react</it>
</p></caption><text>
   <p><b><it>BKM-react</it></b>. Complete menu for the <it>MySQL </it>query of the reaction table (A) and example query (B).</p>
</text><graphic file="1471-2091-12-42-2" hint_layout="double"/></fig>
</sec>
<sec>
<st>
<p>Results and discussion</p>
</st>
<p>The combined database contains a unique list of reactions that occur in any of the compared databases <it>BRENDA </it>
<abbrgrp>
<abbr bid="B1">1</abbr>
</abbrgrp>, <it>KEGG </it>
<abbrgrp>
<abbr bid="B2">2</abbr>
</abbrgrp>, and <it>MetaCyc </it>
<abbrgrp>
<abbr bid="B3">3</abbr>
</abbrgrp> and the associations between equivalent reactions. Additionally these reactions are assigned to <it>KEGG </it>and <it>MetaCyc </it>pathways. Table <tblr tid="T1">1</tblr> lists the data used for the comparison. The largest number of reactions originates from the <it>BRENDA </it>database, followed by <it>MetaCyc</it>, and <it>KEGG</it>.</p>
<tbl id="T1"><title><p>Table 1</p></title><caption><p>Overview reaction sources and data</p></caption><tblbdy cols="9">
      <r>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>
               <b>Different</b>
            </p>
            <p>
               <b><it>EC</it> numbers</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>Incomplete</b>
            </p>
            <p>
               <b><it>EC</it> numbers</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>Reaction IDs</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>Reaction IDs</b>
            </p>
            <p>
               <b>without <it>EC</it> number</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>Compound IDs</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>Synonyms</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>
                  <it>Molfiles</it>
               </b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>
                  <it>InChIs</it>
               </b>
            </p>
         </c>
      </r>
      <r>
         <c cspan="9">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>
               <b>
                  <it>KEGG</it>
               </b>
            </p>
         </c>
         <c ca="center">
            <p>3,761</p>
         </c>
         <c ca="center">
            <p>122</p>
         </c>
         <c ca="center">
            <p>8,452</p>
         </c>
         <c ca="center">
            <p>1,288</p>
         </c>
         <c ca="center">
            <p>6,522</p>
         </c>
         <c ca="center">
            <p>11,597</p>
         </c>
         <c ca="center">
            <p>6,327</p>
         </c>
         <c ca="center">
            <p>5,416</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>
               <b>
                  <it>MetaCyc</it>
               </b>
            </p>
         </c>
         <c ca="center">
            <p>4,159</p>
         </c>
         <c ca="center">
            <p>138</p>
         </c>
         <c ca="center">
            <p>9,343</p>
         </c>
         <c ca="center">
            <p>2,236</p>
         </c>
         <c ca="center">
            <p>6,095</p>
         </c>
         <c ca="center">
            <p>19,707</p>
         </c>
         <c ca="center">
            <p>6,035</p>
         </c>
         <c ca="center">
            <p>5,782</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>
               <b>
                  <it>BRENDA</it>
               </b>
            </p>
         </c>
         <c ca="center">
            <p>4,425</p>
         </c>
         <c ca="center">
            <p>207</p>
         </c>
         <c ca="center">
            <p>10,109</p>
         </c>
         <c ca="center">
            <p>55</p>
         </c>
         <c ca="center">
            <p>9,750</p>
         </c>
         <c ca="center">
            <p>20,922</p>
         </c>
         <c ca="center">
            <p>9,750</p>
         </c>
         <c ca="center">
            <p>5,242</p>
         </c>
      </r>
   </tblbdy><tblfn>
      <p>The number of reactions in <it>BRENDA </it>is in fact close to 180,000. In this case only complete reactions with natural substrates were included.</p>
   </tblfn></tbl>
<p>A significantly improved matching of reactions was achieved by removing the compounds H<sup>+ </sup>and water (H<sub>2</sub>O) from the reactions before comparing them because the reactions in the databases are not always stoichiometrically balanced. The order of executing first the <it>InChI </it>comparison followed by the name comparison was chosen because identical synonyms may occur for different compounds. To rely on synonyms could therefore result in incorrect links. By using the reverse order more false positive matchings would appear.</p>
<p>One of the difficulties in the comparison consists in the - sometimes implied - stereochemistry not given in the compound name. Whereas cases like "alanine" being used for "<it>L</it>-alanine" are obviously to be expected, sometimes things become more complicated. For example, in <it>BRENDA </it>and <it>MetaCyc beta</it>-stereochemistry is implied for C5 of <it>D</it>-fructose-1,6-bisphosphate, being the major stereoisomer (see Figure <figr fid="F3">3A</figr> and <figr fid="F3">3B</figr>), the <it>KEGG </it>database includes in fact two different reactions, one with <it>beta</it>-stereochemistry at C5, the other with undefined stereochemistry (see Figure <figr fid="F4">4A</figr> and <figr fid="F4">4B</figr>) where pathway information is only assigned to the reaction with the full stereochemistry. In general metabolites with complete stereochemistry are favored in <it>BKM-react. </it>
</p>
<p>If no structural information is available, reactions are allowed to match by name comparison.</p>
<fig id="F3"><title><p>Figure 3</p></title><caption><p>Screenshots of the databases <it>BRENDA </it>and <it>MetaCyc</it></p></caption><text>
   <p><b>Screenshots of the databases <it>BRENDA </it>and <it>MetaCyc</it></b>. These reactions, <it>BRENDA </it>reaction <it>BR47724 </it>and the <it>MetaCyc </it>reaction <it>6PFRUCTPH</it>O<it>S-RXN </it>are matching the <it>KEGG </it>reaction <it>R04</it>7<it>79 </it>(Fig. 4 B) because of the complete <it>InChI </it>string for <it>beta-D</it>-Fructose 1,6-bisphosphate even if <it>MetaCyc </it>names it <it>D</it>-fructose-6-phosphate.</p>
</text><graphic file="1471-2091-12-42-3" hint_layout="single"/></fig>
<fig id="F4"><title><p>Figure 4</p></title><caption><p>Screenshots of the database <it>KEGG</it></p></caption><text>
   <p><b>Screenshots of the database <it>KEGG</it></b>. <it>KEGG R00756 </it>and <it>R04779</it>. The second reaction is the preferred one. <it>C04779 </it>possesses the complete <it>InChI </it>string and is therefore matched with the more complete described metabolites of the other databases.</p>
</text><graphic file="1471-2091-12-42-4" hint_layout="double"/></fig>
<p>This example shows a general problem in biochemical compound name comparison. The large majority of biochemists refer to <it>S</it>-alanine just by the name alanine although the name in principle is ambiguous or should be used for the racemate. In most cases we assume that for the standard amino acids the name without stereo-descriptor implicitly means <it>S</it>- (or <it>L</it>-, respectively). This holds true also for some other compound names where the stereo-descriptor is implicitly given. A related problem occurs at positions where the stereochemistry is ambiguous like in the case of C1 of <it>D</it>-glucose. In some cases the stereochemistry for this position is undefined in the <it>Molfiles </it>
<abbrgrp>
<abbr bid="B17">17</abbr>
</abbrgrp>, in others the more stable form (<it>e.g. beta </it>in the case of glucose) is used and defined.</p>
<p>Although all three databases offer their own <it>InChIs</it>, they are not directly comparable because <it>KEGG </it>uses the non-standard form of an <it>InChI</it>, whereby <it>MetaCyc </it>and <it>BRENDA </it>use the standard <it>InChI </it>format. So for a standardized comparison it is necessary to use self-generated <it>InChIs </it>based on <it>Molfiles</it>. For this purpose the official <it>IUPAC </it>converting tool was utilized <abbrgrp>
<abbr bid="B18">18</abbr>
</abbrgrp>. A higher matching rate was achieved by using only essential layers (see <it>Methods </it>section) of an <it>InChI </it>string. A drawback is that not for each compound an <it>InChI </it>is available, <it>e.g. </it>for macromolecular reactants or for generic compounds.</p>
<p>A pairwise comparison of reactions revealed a high identity between <it>KEGG </it>&amp; <it>MetaCyc</it>. About 50% reactions were equal, out of which most were also found in <it>BRENDA </it>(Figure <figr fid="F5">5</figr>). Between <it>MetaCyc </it>&amp; <it>BRENDA </it>3,174 reactions were identified to be equal. Comparing <it>KEGG </it>&amp; <it>BRENDA</it>, even more reactions (3,617) could be assigned to each other.</p>
<fig id="F5"><title><p>Figure 5</p></title><caption><p>Distribution of unique reactions between the used databases</p></caption><text>
   <p><b>Distribution of unique reactions between the used databases</b>.</p>
</text><graphic file="1471-2091-12-42-5" hint_layout="single"/></fig>
<p>Table <tblr tid="T2">2</tblr> shows the assignment of diverse reactions between the databases which are equal. There are examples of reactions that have a 1:n relation because of redundant reactions occurring within the same database. In <it>KEGG </it>for example, metabolites are differentiated into glycans and compounds, respectively. This means that identical compounds may get two different IDs, starting with <it>G </it>and <it>C</it>. This results in reactions with different reaction IDs (no. 3 in Tab. 2). Sometimes there are synonyms or keto-enol tautomers which describe one reaction in various forms (no. 1 in Tab. 2) or other alternative writing styles (no. 2 in Tab. 2). Further <it>KEGG </it>uses one reaction-ID for the same reaction being catalysed by enzymes with different <it>EC </it>numbers, whereas <it>MetaCyc </it>often uses different reaction IDs in such cases (no. 1 in Tab. 2).</p>
<tbl id="T2"><title><p>Table 2</p></title><caption><p>Some instructive cases of different forms for identical reactions</p></caption><tblbdy cols="5">
      <r>
         <c ca="center">
            <p>
               <b>No.</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>
                  <it>KEGG</it>
               </b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>
                  <it>MetaCyc</it>
               </b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>
                  <it>BRENDA</it>
               </b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>Definition</b>
            </p>
         </c>
      </r>
      <r>
         <c cspan="5">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>1</p>
         </c>
         <c ca="center">
            <p>
               <it>R04915</it>
            </p>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>Quinoline-3,4-diol + Oxygen &lt;=> Formylanthranilate + CO</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>
               <it>R05719</it>
            </p>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>3-Hydroxy-1H-quinolin-4-one + Oxygen &lt;=> Formylanthranilate + CO</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>
               <it>1.13.11.47-RXN</it>
            </p>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>3-hydroxy-1H-quinolin-4-one + oxygen = carbon monoxide + <it>N</it>-formylanthranilate</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>
               <it>BR22597</it>
            </p>
         </c>
         <c ca="center">
            <p>3-hydroxy-1H-quinolin-4-one + O2 = N- formylanthranilate + CO</p>
         </c>
      </r>
      <r>
         <c cspan="5">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>2</p>
         </c>
         <c ca="center">
            <p>
               <it>R00004</it>
            </p>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>Diphosphate + H2O &lt;=> 2 Orthophosphate</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>
               <it>INORGPYROPHOSPHAT-RXN</it>
            </p>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>diphosphate + H<sub>2</sub>O = 2 phosphate + H<sup>+</sup></p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>
               <it>BR22749</it>
            </p>
         </c>
         <c ca="center">
            <p>diphosphate + H2O = 2 phosphate</p>
         </c>
      </r>
      <r>
         <c cspan="5">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>3</p>
         </c>
         <c ca="center">
            <p>
               <it>R00010</it>
            </p>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>alpha, alpha-Trehalose + H2O &lt;=> 2 D-Glucose (<it>C01083</it>)</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>
               <it>R06103</it>
            </p>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>Trehalose + H2O &lt;=> 2 D-Glucose (<it>G00293</it>)</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>
               <it>TREHALA-RXN</it>
            </p>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>trehalose + H2O &#8594; 2 &#946;-D-glucose</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>
               <it>BR15991</it>
            </p>
         </c>
         <c ca="center">
            <p>alpha, alpha-trehalose + H2O = 2 D-glucose</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>
               <it>BS370856</it>
            </p>
         </c>
         <c ca="center">
            <p>alpha, alpha-trehalose + H2O = beta-Dglucose</p>
         </c>
      </r>
   </tblbdy></tbl>
<p>In Figure <figr fid="F5">5</figr> the distribution of equal reactions occurring in any of the three databases is illustrated. 2,890 of all reactions are contained in all three databases, corresponding to 34% of all <it>KEGG </it>reactions, 31% of all <it>MetaCyc </it>reactions, and 29% of the included <it>BRENDA </it>reactions, respectively. In the present version of the data set, 3,743 <it>KEGG </it>reactions, 5,038 <it>MetaCyc </it>reactions, and 4,606 <it>BRENDA </it>reactions occur only in the respective database (Figure <figr fid="F5">5</figr>). Altogether the non-redundant reaction database up to now contains 18,172 unique reactions and 20,358 <it>EC</it>/reaction combinations as some reactions are catalyzed by a number of different enzymes.</p>
<p>In Figure <figr fid="F6">6</figr> the fraction of all unique reactions belonging to the six main <it>EC </it>classes is shown. The largest fractions belong to <it>EC </it>classes 1 and 2, followed by class 3. Statistical data about the <it>EC </it>numbers occurring in the non-redundant reaction database are given in Table <tblr tid="T3">3</tblr>. Additionally to all <it>EC </it>numbers, complete and incomplete, the latter ones are listed separately. Furthermore it is distinguished between <it>EC </it>numbers representing at least one single reaction or more than one. A detailed look on the <it>EC </it>numbers with the highest number of reactions is given in Table <tblr tid="T4">4</tblr> together with the number of reactions.</p>
<fig id="F6"><title><p>Figure 6</p></title><caption><p>Fraction [%] of unique reactions in main EC classes (1-6)</p></caption><text>
   <p><b>Fraction [%] of unique reactions in main EC classes (1-6)</b>.</p>
</text><graphic file="1471-2091-12-42-6" hint_layout="single"/></fig>
<tbl id="T3"><title><p>Table 3</p></title><caption><p>Statistics about <it>EC </it>numbers occurring in the integrated non-redundant reaction database</p></caption><tblbdy cols="3">
      <r>
         <c ca="center">
            <p>
               <b><it>EC </it>numbers</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>Different <it>EC </it>numbers</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>Incomplete <it>EC </it>numbers</b>
            </p>
         </c>
      </r>
      <r>
         <c cspan="3">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>in total</p>
         </c>
         <c ca="center">
            <p>4,288</p>
         </c>
         <c ca="center">
            <p>365</p>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>with > 1 reaction</p>
         </c>
         <c ca="center">
            <p>2,681</p>
         </c>
         <c ca="center">
            <p>185</p>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>with > 5 reactions</p>
         </c>
         <c ca="center">
            <p>561</p>
         </c>
         <c ca="center">
            <p>73</p>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>with > 10 reactions</p>
         </c>
         <c ca="center">
            <p>184</p>
         </c>
         <c ca="center">
            <p>49</p>
         </c>
      </r>
   </tblbdy></tbl>
<tbl id="T4"><title><p>Table 4</p></title><caption><p>Complete <it>EC </it>numbers with the highest number of reactions</p></caption><tblbdy cols="3">
      <r>
         <c ca="left">
            <p>
               <b><it>EC </it>number</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>Enzyme</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>Number of reactions</b>
            </p>
         </c>
      </r>
      <r>
         <c cspan="3">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>1.14.14.1</p>
         </c>
         <c ca="center">
            <p>unspecific monooxygenase</p>
         </c>
         <c ca="center">
            <p>116</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>2.4.1.17</p>
         </c>
         <c ca="center">
            <p>glucuronosyltransferase</p>
         </c>
         <c ca="center">
            <p>80</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>3.2.1.21</p>
         </c>
         <c ca="center">
            <p><it>beta</it>-glucosidase</p>
         </c>
         <c ca="center">
            <p>74</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>1.1.1.100</p>
         </c>
         <c ca="center">
            <p>3-oxoacyl-[acyl-carrier-protein] reductase</p>
         </c>
         <c ca="center">
            <p>55</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>3.5.1.4</p>
         </c>
         <c ca="center">
            <p>amidase</p>
         </c>
         <c ca="center">
            <p>46</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>3.6.3.44</p>
         </c>
         <c ca="center">
            <p>xenobiotic-transporting ATPase</p>
         </c>
         <c ca="center">
            <p>46</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>3.1.3.16</p>
         </c>
         <c ca="center">
            <p>phosphoprotein phosphatase</p>
         </c>
         <c ca="center">
            <p>44</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>1.3.1.10</p>
         </c>
         <c ca="center">
            <p>enoyl-[acyl-carrier-protein] reductase (NADPH, B-specific)</p>
         </c>
         <c ca="center">
            <p>43</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>3.2.1.1</p>
         </c>
         <c ca="center">
            <p><it>alpha</it>-amylase</p>
         </c>
         <c ca="center">
            <p>43</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>1.1.1.50</p>
         </c>
         <c ca="center">
            <p>3<it>alpha</it>-hydroxysteroid dehydrogenase (B-specific)</p>
         </c>
         <c ca="center">
            <p>42</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>2.3.1.41</p>
         </c>
         <c ca="center">
            <p><it>beta</it>-ketoacyl-acyl-carrier-protein synthase I</p>
         </c>
         <c ca="center">
            <p>41</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>3.6.1.9</p>
         </c>
         <c ca="center">
            <p>nucleotide diphosphatase</p>
         </c>
         <c ca="center">
            <p>39</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>1.1.1.1</p>
         </c>
         <c ca="center">
            <p>alcohol dehydrogenase</p>
         </c>
         <c ca="center">
            <p>37</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>2.3.1.86</p>
         </c>
         <c ca="center">
            <p>fatty-acyl-CoA synthase</p>
         </c>
         <c ca="center">
            <p>37</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>1.14.13.72</p>
         </c>
         <c ca="center">
            <p>methylsterol monooxygenase</p>
         </c>
         <c ca="center">
            <p>36</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>1.2.1.3</p>
         </c>
         <c ca="center">
            <p>aldehyde dehydrogenase (NAD<sup>+</sup>)</p>
         </c>
         <c ca="center">
            <p>34</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>1.2.1.5</p>
         </c>
         <c ca="center">
            <p>aldehyde dehydrogenase [NAD(P)<sup>+</sup>]</p>
         </c>
         <c ca="center">
            <p>33</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>3.2.1.24</p>
         </c>
         <c ca="center">
            <p><it>alpha</it>-mannosidase</p>
         </c>
         <c ca="center">
            <p>33</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>3.2.1.51</p>
         </c>
         <c ca="center">
            <p><it>alpha-L</it>-fucosidase</p>
         </c>
         <c ca="center">
            <p>33</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>1.14.13.8</p>
         </c>
         <c ca="center">
            <p>flavin-containing monooxygenase</p>
         </c>
         <c ca="center">
            <p>32</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>1.4.3.3</p>
         </c>
         <c ca="center">
            <p><it>D</it>-amino-acid oxidase</p>
         </c>
         <c ca="center">
            <p>32</p>
         </c>
      </r>
   </tblbdy><tblfn>
      <p><it>Recommended names </it>of enzymes: source <it>BRENDA </it>database.</p>
   </tblfn></tbl>
<p>The only database with a similar goal is <it>BioSilico </it>
<abbrgrp>
<abbr bid="B6">6</abbr>
</abbrgrp>. One important difference consists of the fact that the assignment of identical reactions in our database is done by an actual comparison of the compounds structure in combination with synonyms whereas in <it>BioSilico</it>, the matching is only a simple assignment of reactions having the same <it>EC </it>number without redundancy check.</p>
<p>The number of reactions in the database described in this paper is far beyond that in <it>BioSilico</it>. Selecting three <it>EC </it>numbers by chance resulted in <it>e.g. EC </it>number <it>1.14.14.1 </it>&#8594; 4 reactions in <it>BioSilico vs</it>. 116 reactions in our reaction database, <it>EC </it>number <it>2.1.1.103 </it>&#8594; 1 reaction in <it>BioSilico vs</it>. 4 reactions in our database, <it>3.1.1.47 </it>&#8594; 1 reaction in <it>BioSilico vs</it>. 12 reactions in our database. The fact that in these examples not even all available <it>KEGG </it>reactions were found in <it>BioSilico </it>indicates that this database is not updated.</p>
<p>Additionally, our reaction database contains the information whether a reaction is stoichiometric incomplete or not. This test is performed before the removal of H<sup>+ </sup>and H<sub>2</sub>O. Non-balanced reactions are labeled in a separate table column. 2,803 out of 18,172 reactions are at present in this category. The labeling allows modelers to select only balanced reactions for the reconstruction of organism-specific models and networks.</p>
<p>The tool of Radrich <it>et al. </it>
<abbrgrp>
<abbr bid="B10">10</abbr>
</abbrgrp> also includes a stoichiometric evaluation. Their method includes a name comparison where they compare the similarity of compound names. Further they use <it>SMILES </it>strings for a structural comparison. The tool was executed only for <it>Arabidopsis thaliana</it>, so no general comparison could be done. For this purpose the authors combined data of the databases <it>KEGG </it>and <it>AraCyc </it>
<abbrgrp>
<abbr bid="B11">11</abbr>
</abbrgrp>.</p>
</sec>
<sec>
<st>
<p>Conclusions</p>
</st>
<p>In this work we present an integrated and non-redundant reaction database implementing a combined approach of structure and name based comparison.</p>
<p>The tool, integrated into the <it>BRENDA </it>
<abbrgrp>
<abbr bid="B1">1</abbr>
</abbrgrp> query engine but not confined to <it>BRENDA </it>data is offering a novel valuable tool that can be used <it>e.g. </it>for the construction of biological models. The resulting models will be much more complete than if only one of the databases is used as the three biological databases <it>BRENDA</it>, <it>KEGG </it>
<abbrgrp>
<abbr bid="B2">2</abbr>
</abbrgrp>, and <it>MetaCyc </it>
<abbrgrp>
<abbr bid="B3">3</abbr>
</abbrgrp> complement each other. Regular 6-monthly updates of this database will make guarantee actuality.</p>
</sec>
<sec>
<st>
<p>Availability and requirements</p>
</st>
<p>The integrated and non-redundant reaction database is accessible <it>via BKM-react </it>
<abbrgrp>
<abbr bid="B20">20</abbr>
</abbrgrp> and the website of the <it>BRENDA </it>
<abbrgrp>
<abbr bid="B1">1</abbr>
</abbrgrp> database: <it>BRENDA </it>website <abbrgrp>
<abbr bid="B21">21</abbr>
</abbrgrp> &#8594; <it>Reaction &amp; Specificity </it>&#8594; <it>Biochemicals Reactions Aligned </it>(Figure <figr fid="F1">1</figr>). The complete dataset is additionally provided as a CSV-formatted download at the same site. Available is a reaction table, a table with all compounds occurring in the reactions, and an assignment table with the linkage between reactions and compounds.</p>
</sec>
<sec>
<st>
<p>List of abbreviations used</p>
</st>
<p>
<it>BRENDA</it>: <ul>
<it>BR</it>
</ul>aunschweig <ul>
<it>EN</it>
</ul>zyme <ul>
<it>DA</it>
</ul>tabase; <it>EC</it>: <ul>
<it>E</it>
</ul>nzyme <ul>
<it>C</it>
</ul>ommission; <it>InChI</it>: IUPAC <ul>
<it>In</it>
</ul>ternational <ul>
<it>Ch</it>
</ul>emical <ul>
<it>I</it>
</ul>dentifier; <it>IUBMB</it>: <ul>
<it>I</it>
</ul>nternational <ul>
<it>U</it>
</ul>nion of <ul>
<it>B</it>
</ul>iochemistry and <ul>
<it>M</it>
</ul>olecular <ul>
<it>B</it>
</ul>iology; <it>IUPAC</it>: <ul>
<it>I</it>
</ul>nternational <ul>
<it>U</it>
</ul>nion of <ul>
<it>P</it>
</ul>ure and <ul>
<it>A</it>
</ul>pplied <ul>
<it>C</it>
</ul>hemistry; <it>KEGG</it>: <ul>
<it>K</it>
</ul>yoto <ul>
<it>E</it>
</ul>ncyclopedia of <ul>
<it>G</it>
</ul>enes and <ul>
<it>G</it>
</ul>enomes; <it>SMILES</it>: <ul>
<it>S</it>
</ul>implified <ul>
<it>M</it>
</ul>olecular <ul>
<it>I</it>
</ul>nput <ul>
<it>L</it>
</ul>ine <ul>
<it>E</it>
</ul>ntry <ul>
<it>S</it>
</ul>ystem.</p>
</sec>
<sec>
<st>
<p>Authors' contributions</p>
</st>
<p>ML and MS executed the data acquisition and implemented the reaction comparison. ML and MS were involved in the construction of the integrated reaction database and the scientific evaluation. DS had the idea to develop the reaction database and supervised the development. ML, MS, and DS wrote the manuscript. All authors read and approved the final manuscript.</p>
</sec>
</bdy><bm>
<ack>
<sec>
<st>
<p>Acknowledgements and funding</p>
</st>
<p>The authors are grateful to Ron Caspi from <it>MetaCyc </it>for help with the implementation of the <it>MetaCyc </it>data, Adam Podstawka for technical support, Maurice Scheer for implementing the webinterface, and Ren&#233; Rex for providing a stoichiometry verification tool. Financial support: <it>European Union </it>(<it>FELICS </it>
<abbrgrp>
<abbr bid="B22">22</abbr>
</abbrgrp>, <it>SLING </it>
<abbrgrp>
<abbr bid="B23">23</abbr>
</abbrgrp>).</p>
</sec>
</ack>
<refgrp><bibl id="B1"><title><p>BRENDA, the enzyme information system in 2011</p></title><aug><au><snm>Scheer</snm><fnm>M</fnm></au><au><snm>Grote</snm><fnm>A</fnm></au><au><snm>Chang</snm><fnm>A</fnm></au><au><snm>Schomburg</snm><fnm>I</fnm></au><au><snm>Munaretto</snm><fnm>C</fnm></au><au><snm>Rother</snm><fnm>M</fnm></au><au><snm>S&#246;hngen</snm><fnm>C</fnm></au><au><snm>Stelzer</snm><fnm>M</fnm></au><au><snm>Thiele</snm><fnm>J</fnm></au><au><snm>Schomburg</snm><fnm>D</fnm></au></aug><source>Nucleic Acids Research</source><pubdate>2011</pubdate><volume>39</volume><fpage>D670</fpage><lpage>D676</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/nar/gkq1089</pubid><pubid idtype="pmcid">3013686</pubid><pubid idtype="pmpid" link="fulltext">21062828</pubid></pubidlist></xrefbib></bibl><bibl id="B2"><title><p>KEGG for representation and analysis of molecular networks involving diseases and drugs</p></title><aug><au><snm>Kanehisa</snm><fnm>M</fnm></au><au><snm>Goto</snm><fnm>S</fnm></au><au><snm>Furumichi</snm><fnm>M</fnm></au><au><snm>Tanabe</snm><fnm>M</fnm></au><au><snm>Hirakawa</snm><fnm>M</fnm></au></aug><source>Nucleic Acids Research</source><pubdate>2009</pubdate><volume>38</volume><fpage>D355</fpage><lpage>D360</lpage><xrefbib><pubidlist><pubid idtype="pmcid">2808910</pubid><pubid idtype="pmpid" link="fulltext">19880382</pubid></pubidlist></xrefbib></bibl><bibl id="B3"><title><p>The MetaCyc database of metabolic pathways and enzymes and the BioCyc collection of pathway/genome databases</p></title><aug><au><snm>Caspi</snm><fnm>R</fnm></au><au><snm>Altman</snm><fnm>T</fnm></au><au><snm>Dale</snm><fnm>JM</fnm></au><au><snm>Dreher</snm><fnm>K</fnm></au><au><snm>Fulcher</snm><fnm>CA</fnm></au><au><snm>Gilham</snm><fnm>F</fnm></au><au><snm>Kaipa</snm><fnm>P</fnm></au><au><snm>Karthikeyan</snm><fnm>AS</fnm></au><au><snm>Kothari</snm><fnm>A</fnm></au><au><snm>Krummenacker</snm><fnm>M</fnm></au><au><snm>Latendresse</snm><fnm>M</fnm></au><au><snm>Mueller</snm><fnm>LA</fnm></au><au><snm>Paley</snm><fnm>S</fnm></au><au><snm>Popescu</snm><fnm>L</fnm></au><au><snm>Pujar</snm><fnm>A</fnm></au><au><snm>Shearer</snm><fnm>AG</fnm></au><au><snm>Zhang</snm><fnm>P</fnm></au><au><snm>Karp</snm><fnm>PD</fnm></au></aug><source>Nucleic Acids Research</source><pubdate>2009</pubdate><volume>38</volume><fpage>D473</fpage><lpage>D479</lpage><xrefbib><pubidlist><pubid idtype="pmcid">2808959</pubid><pubid idtype="pmpid" link="fulltext">19850718</pubid></pubidlist></xrefbib></bibl><bibl id="B4"><title><p>Integrating biological databases</p></title><aug><au><snm>Stein</snm><fnm>LD</fnm></au></aug><source>Nat Rev Genet</source><pubdate>2003</pubdate><volume>4</volume><fpage>337</fpage><lpage>345</lpage><xrefbib><pubid idtype="pmpid" link="fulltext">12728276</pubid></xrefbib></bibl><bibl id="B5"><title><p>TRANSPATH<sup>&#174;</sup>: an integrated database on signal transduction and a tool for array analysis</p></title><aug><au><snm>Krull</snm><fnm>M</fnm></au><au><snm>Voss</snm><fnm>N</fnm></au><au><snm>Choi</snm><fnm>C</fnm></au><au><snm>Pistor</snm><fnm>S</fnm></au><au><snm>Potapov</snm><fnm>A</fnm></au><au><snm>Wingender</snm><fnm>E</fnm></au></aug><source>Nucleic Acids Research</source><pubdate>2003</pubdate><volume>31</volume><fpage>97</fpage><lpage>100</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/nar/gkg089</pubid><pubid idtype="pmcid">165536</pubid><pubid idtype="pmpid" link="fulltext">12519957</pubid></pubidlist></xrefbib></bibl><bibl id="B6"><title><p>BioSilico: an integrated metabolic database system</p></title><aug><au><snm>Hou</snm><fnm>BK</fnm></au><au><snm>Kim</snm><fnm>JS</fnm></au><au><snm>Jun</snm><fnm>JH</fnm></au><au><snm>Lee</snm><fnm>D-Y</fnm></au><au><snm>Kim</snm><fnm>YW</fnm></au><au><snm>Chae</snm><fnm>S</fnm></au><au><snm>Roh</snm><fnm>M</fnm></au><au><snm>In</snm><fnm>Y-H</fnm></au><au><snm>Lee</snm><fnm>SY</fnm></au></aug><source>Bioinformatics</source><pubdate>2004</pubdate><volume>20</volume><fpage>3270</fpage><lpage>3272</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/bioinformatics/bth363</pubid><pubid idtype="pmpid" link="fulltext">15201189</pubid></pubidlist></xrefbib></bibl><bibl id="B7"><title><p>ExPASy - ENZYME</p></title><url>http://www.expasy.org/enzyme/</url></bibl><bibl id="B8"><title><p>EcoCyc: Encyclopedia of Escherichia coli K-12 Genes and Metabolism</p></title><url>http://ecocyc.org/</url></bibl><bibl id="B9"><title><p>BioCyc Home</p></title><url>http://biocyc.org/</url></bibl><bibl id="B10"><title><p>Integration of metabolic databases for the reconstruction of genome-scale metabolic networks</p></title><aug><au><snm>Radrich</snm><fnm>K</fnm></au><au><snm>Tsuruoka</snm><fnm>Y</fnm></au><au><snm>Dobson</snm><fnm>P</fnm></au><au><snm>Gevorgyan</snm><fnm>A</fnm></au><au><snm>Swainston</snm><fnm>N</fnm></au><au><snm>Baart</snm><fnm>G</fnm></au><au><snm>Schwartz</snm><fnm>J-M</fnm></au></aug><source>BMC Systems Biology</source><pubdate>2010</pubdate><volume>4</volume><fpage>114</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/1752-0509-4-114</pubid><pubid idtype="pmcid">2930596</pubid><pubid idtype="pmpid" link="fulltext">20712863</pubid></pubidlist></xrefbib></bibl><bibl id="B11"><title><p>TAIR - AraCyc</p></title><url>http://www.arabidopsis.org/biocyc/index.jsp</url></bibl><bibl id="B12"><title><p>Reactome knowledgebase of human biological pathways and processes</p></title><aug><au><snm>Matthews</snm><fnm>L</fnm></au><au><snm>Gopinath</snm><fnm>G</fnm></au><au><snm>Gillespie</snm><fnm>M</fnm></au><au><snm>Caudy</snm><fnm>M</fnm></au><au><snm>Croft</snm><fnm>D</fnm></au><au><snm>de Bono</snm><fnm>B</fnm></au><au><snm>Garapati</snm><fnm>P</fnm></au><au><snm>Hemish</snm><fnm>J</fnm></au><au><snm>Hermjakob</snm><fnm>H</fnm></au><au><snm>Jassal</snm><fnm>B</fnm></au><au><snm>Kanapin</snm><fnm>A</fnm></au><au><snm>Lewis</snm><fnm>S</fnm></au><au><snm>Mahajan</snm><fnm>S</fnm></au><au><snm>May</snm><fnm>B</fnm></au><au><snm>Schmidt</snm><fnm>E</fnm></au><au><snm>Vastrik</snm><fnm>I</fnm></au><au><snm>Wu</snm><fnm>G</fnm></au><au><snm>Birney</snm><fnm>E</fnm></au><au><snm>Stein</snm><fnm>L</fnm></au><au><snm>D&apos;Eustachio</snm><fnm>P</fnm></au></aug><source>Nucleic Acids Research</source><pubdate>2009</pubdate><volume>37</volume><fpage>D619</fpage><lpage>D622</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/nar/gkn863</pubid><pubid idtype="pmcid">2686536</pubid><pubid idtype="pmpid" link="fulltext">18981052</pubid></pubidlist></xrefbib></bibl><bibl id="B13"><title><p>Rhea - Annotated reactions database</p></title><url>http://www.ebi.ac.uk/rhea//home.xhtml#</url></bibl><bibl id="B14"><title><p>IUBMB Nomenclature Home Page</p></title><url>http://www.chem.qmul.ac.uk/iubmb/</url></bibl><bibl id="B15"><title><p>Biochemical Pathways: An Atlas of Biochemistry and Molecular</p></title><aug><au><snm>Michal</snm><fnm>G</fnm></au><au><snm>Schomburg</snm><fnm>D</fnm></au></aug><source>Biology</source><publisher>Wiley-Interscience</publisher><edition>2</edition><pubdate>2011</pubdate><inpress/></bibl><bibl id="B16"><title><p>An open standard for chemical structure representation--the IUPAC chemical identifier</p></title><aug><au><snm>Stein</snm><fnm>SE</fnm></au><au><snm>Heller</snm><fnm>SR</fnm></au><au><snm>Tchekhovski</snm><fnm>D</fnm></au></aug><source>Nimes International Chemical Information Conference Proceedings, Nimes, France</source><pubdate>2003</pubdate><fpage>131</fpage><lpage>143</lpage></bibl><bibl id="B17"><title><p>Description of several chemical structure file formats used by computer programs developed at Molecular Design Limited</p></title><aug><au><snm>Dalby</snm><fnm>A</fnm></au><au><snm>Nourse</snm><fnm>JG</fnm></au><au><snm>Hounshell</snm><fnm>WD</fnm></au><au><snm>Gushurst</snm><fnm>AKI</fnm></au><au><snm>Grier</snm><fnm>DL</fnm></au><au><snm>Leland</snm><fnm>BA</fnm></au><au><snm>Laufer</snm><fnm>J</fnm></au></aug><source>Journal of Chemical Information and Computer Sciences</source><pubdate>1992</pubdate><volume>32</volume><fpage>244</fpage><lpage>255</lpage></bibl><bibl id="B18"><title><p>International Union of Pure and Applied Chemistry</p></title><url>http://www.iupac.org/inchi/download/index.html</url></bibl><bibl id="B19"><title><p>Daylight</p></title><url>http://www.daylight.com/</url></bibl><bibl id="B20"><title><p>BKM-react online</p></title><url>http://bkm-react.tu-bs.de/</url></bibl><bibl id="B21"><title><p>Enzyme Database - BRENDA</p></title><url>http://www.brenda-enzymes.org/</url></bibl><bibl id="B22"><title><p>FELICS | FELICS</p></title><url>http://www.felics.org</url></bibl><bibl id="B23"><title><p>SLING | sling-fp7.org</p></title><url>http://www.sling-fp7.org/</url></bibl></refgrp>
</bm></art>