<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1471-2105-7-234</ui>
   <ji>1471-2105</ji>
   <fm>
      <dochead>Research article</dochead>
      <bibl>
         <title>
            <p>Metabolomic database annotations <it>via </it>query of elemental compositions: Mass accuracy is insufficient even at less than 1 ppm</p>
         </title>
         <aug>
            <au id="A1" ca="yes">
               <snm>Kind</snm>
               <fnm>Tobias</fnm>
               <insr iid="I1"/>
               <email>tkind@ucdavis.edu</email>
            </au>
            <au id="A2">
               <snm>Fiehn</snm>
               <fnm>Oliver</fnm>
               <insr iid="I1"/>
               <email>ofiehn@ucdavis.edu</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>University of California Davis, Genome Center, 451 E. Health Sci Dr., Davis, CA 95616, USA</p>
            </ins>
         </insg>
         <source>BMC Bioinformatics</source>
         <issn>1471-2105</issn>
         <pubdate>2006</pubdate>
         <volume>7</volume>
         <issue>1</issue>
         <fpage>234</fpage>
         <url>http://www.biomedcentral.com/1471-2105/7/234</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">16646969</pubid>
               <pubid idtype="doi">10.1186/1471-2105-7-234</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>22</day>
               <month>12</month>
               <year>2005</year>
            </date>
         </rec>
         <acc>
            <date>
               <day>28</day>
               <month>4</month>
               <year>2006</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>28</day>
               <month>4</month>
               <year>2006</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2006</year>
         <collab>Kind and Fiehn; licensee BioMed Central Ltd.</collab>
         <note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>Metabolomic studies are targeted at identifying and quantifying all metabolites in a given biological context. Among the tools used for metabolomic research, mass spectrometry is one of the most powerful tools. However, metabolomics by mass spectrometry always reveals a high number of unknown compounds which complicate in depth mechanistic or biochemical understanding. In principle, mass spectrometry can be utilized within strategies of <it>de novo </it>structure elucidation of small molecules, starting with the computation of the elemental composition of an unknown metabolite using accurate masses with errors &lt;5 ppm (parts per million). However even with very high mass accuracy (&lt;1 ppm) many chemically possible formulae are obtained in higher mass regions. In automatic routines an additional orthogonal filter therefore needs to be applied in order to reduce the number of potential elemental compositions. This report demonstrates the necessity of isotope abundance information by mathematical confirmation of the concept.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>High mass accuracy (&lt;1 ppm) alone is not enough to exclude enough candidates with complex elemental compositions (C, H, N, S, O, P, and potentially F, Cl, Br and Si). Use of isotopic abundance patterns as a single further constraint removes >95% of false candidates. This orthogonal filter can condense several thousand candidates down to only a small number of molecular formulas. Example calculations for 10, 5, 3, 1 and 0.1 ppm mass accuracy are given. Corresponding software scripts can be downloaded from <url>http://fiehnlab.ucdavis.edu</url>. A comparison of eight chemical databases revealed that PubChem and the Dictionary of Natural Products can be recommended for automatic queries using molecular formulae.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusion</p>
               </st>
               <p>More than 1.6 million molecular formulae in the range 0&#8211;500 Da were generated in an exhaustive manner under strict observation of mathematical and chemical rules. Assuming that ion species are fully resolved (either by chromatography or by high resolution mass spectrometry), we conclude that a mass spectrometer capable of 3 ppm mass accuracy and 2% error for isotopic abundance patterns outperforms mass spectrometers with less than 1 ppm mass accuracy or even hypothetical mass spectrometers with 0.1 ppm mass accuracy that do not include isotope information in the calculation of molecular formulae.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>Metabolomics seeks to identify and quantify all metabolites in a given biological context <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>. In this respect its aim is different from metabolic fingerprinting or metabonomic approaches which utilize high dimensional unannotated variables and multivariate statistics to find biomarkers that may or may not be structurally identified in subsequent steps. Therefore, an important task in metabolomics is to identify or structurally annotate compounds in a high throughput manner. Mass spectrometry is one of the most powerful tools for unbiased analysis of small molecules in life sciences. Hundreds to thousands of metabolites can be detected when suitable sample preparation methods <abbrgrp><abbr bid="B2">2</abbr></abbrgrp> and mass spectrometric techniques are used <abbrgrp><abbr bid="B3">3</abbr></abbrgrp>. However, most of the metabolites in complex biological materials like plant tissues are non-annotated, unidentified metabolites <abbrgrp><abbr bid="B5">5</abbr></abbrgrp> due to the lack of experimental databases and the chemical complexity and changing nature of an organism's metabolome. Metabolites cannot be sequenced like proteins or polynucleotides. Instead, each individual compound needs to undergo structural elucidation, starting from the elemental composition. In addition to detection and quantification of metabolites, mass spectra can further be exploited for structural elucidation of compounds <abbrgrp><abbr bid="B4">4</abbr></abbrgrp>.</p>
         <p>In order to reduce the number of <it>de novo </it>elucidations for metabolomic studies, a reasonable strategy could start with tentatively annotating metabolomic mass spectra with a list of compounds that match the elemental composition of small molecules found in publicly available databases. For numerical reasons the list of potential metabolic candidates will vary with the size and the quality of the queried database, but in principle, even structures with uncommon chemical conformations like ladderanes <abbrgrp><abbr bid="B6">6</abbr></abbrgrp> (Figure <figr fid="F1">1</figr>) cannot be excluded <it>a priori</it>. The list of tentative annotations could be further confined in subsequent steps by including additional physicochemical or biological information such as matching predicted versus determined MS/MS fragmentation patterns <abbrgrp><abbr bid="B7">7</abbr><abbr bid="B8">8</abbr></abbrgrp> or likelihood assessments from exploiting genomic knowledge about an organisms' biochemical pathways. However, without reference standards or complementary structural data (e.g. garnered by 2D nuclear magnetic resonance <abbrgrp><abbr bid="B21">21</abbr></abbrgrp>, a certain level of ambiguity will remain in purely mass spectrometric approaches due to the combinatorial explosion. It is important to note that mass spectrometry alone can not distinguish between stereoisomers.</p>
         <fig id="F1">
            <title>
               <p>Figure 1</p>
            </title>
            <caption>
               <p>Nature is known to synthesize "fancy" compounds</p>
            </caption>
            <text>
               <p><b>Nature is known to synthesize "fancy" compounds</b>. A natural occurring ladderane produced by the anammox bacterium "Candidatus Brocadia anammoxidans"</p>
            </text>
            <graphic file="1471-2105-7-234-1"/>
         </fig>
         <p>The mass of chemical elements is based on the conventional scale that defines carbon C = 12.000 u. Chemical elements are comprised of a different number of neutrons, protons and electrons, so that the combined mass for each element (other than <sup>12</sup>C) is a rational (non-integer) number: <sup>1</sup>H = 1.007825 u, <sup>14</sup>N = 14.003070 u, <sup>16</sup>O = 15.994910 u <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>. Consequently, for any given metabolite, the accurate mass deviates from the nominal mass. This feature can be exploited for recursively calculating the elemental composition from an unknown metabolite mass spectrum in the ranges of the measurement error. Mass spectrometers today can measure mass/charge ratios with high (&lt;5 ppm error; <it>parts per million</it>) or very high mass accuracy (&lt;1 ppm) <abbrgrp><abbr bid="B11">11</abbr></abbrgrp> and can be purchased with implemented software algorithms that derive a list of possible elemental compositions from the measured monoisotopic mass. Using the accurate mass one can either solve the diophantine equation <abbrgrp><abbr bid="B15">15</abbr></abbrgrp> or one can use a brute force approach <abbrgrp><abbr bid="B16">16</abbr></abbrgrp> and can calculate all possible elemental combinations in a certain range.</p>
         <p>Another important prerequisite for this approach is not only accurate mass measurement but also a high resolving power of the mass spectrometer. As the output of a mass spectrum is represented as a Gaussian or Lorentzian like peak shape, very near peaks can overlap on devices with low resolving power. Resolving power (<it>m/&#916;m</it>) at a certain m/z value can be calculated at full-width half-height maximum (FWHM) of the peak. Quadrupole mass spectrometers usually can reach 3000 <abbrgrp><abbr bid="B11">11</abbr></abbrgrp>, TOF analyzers up to 10,000 and Fourier Transform Ion Cyclotron Resonance (FT-ICR) mass spectrometers can have a resolving power up to 1,000,000 or larger <abbrgrp><abbr bid="B35">35</abbr></abbrgrp>. Isobaric masses (for example C<sub>37</sub>H<sub>31</sub>N<sub>8</sub>P<sub>7</sub>S<sub>3 </sub>MW = 899.999692 and C<sub>20</sub>H<sub>43</sub>N<sub>2</sub>O<sub>19</sub>P<sub>3</sub>S<sub>6 </sub>MW= 899.999678) can not be resolved by mass spectrometry only. In this case chromatography helps to separate these overlapping components.</p>
         <p>For the case of peptides it was claimed that accurate mass measurements of 1 ppm error would be sufficient to derive a single solution in most cases <abbrgrp><abbr bid="B38">38</abbr></abbrgrp>. However this is not applicable for small molecules, because they are not only derived from combinations of certain amino acids. We demonstrate in this report that even a hypothetical instrument capable of accurate mass measurements of 0.1 ppm error would not fulfill this premise when matching against a comprehensive list of chemically possible elemental compositions.</p>
         <p>Additional information is required that can readily be gathered from mass spectra: the abundance of natural isotopes in metabolites which refer to the percentages in which the isotopes of an element are found in natural sources on earth. The isotopic abundance pattern of a metabolite's mass spectrum can serve as a powerful additional constraint for removing wrong elemental composition candidates. Isotope ratio mass spectrometers <abbrgrp><abbr bid="B14">14</abbr></abbrgrp> exactly determine isotope abundances, however, under combustions of the original molecule into CO2 or other gases and therefore irrelevant for the calculation of elemental compositions of unidentified metabolites. In general, the theoretical isotopic abundance pattern of a molecular formula can be calculated using different approaches <abbrgrp><abbr bid="B12">12</abbr></abbrgrp> either solving polynomial equations or using fast Fourier transformations <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>. An isotope abundance filter can be used for any mass spectrometer which can provide very low root mean square (RMS) errors for isotopic patterns, especially if the contribution of further metabolites can be ignored by coupling compound separation to mass spectrometric detection using liquid or gas chromatography (LC/MS and GC/MS). Mass spectra may include fragmentations, rearrangements, and adducts <abbrgrp><abbr bid="B10">10</abbr></abbrgrp>. For the sake of clarity, mathematical and chemical considerations reported here are constrained to pseudo-molecular ions that are completely resolved from interfering compounds, assuming the utilization of efficient chromatography or high resolution mass spectrometry <abbrgrp><abbr bid="B18">18</abbr></abbrgrp>, or a combination of both.</p>
      </sec>
      <sec>
         <st>
            <p>Results and discussion</p>
         </st>
         <sec>
            <st>
               <p>Database queries of elemental compositions</p>
            </st>
            <p>Assuming that a unique elemental composition could be derived from a mass spectrum, this molecular information can be furnished for metabolite annotation in either of two distinct ways: an exhaustive computation of all chemically possible isomeric structures or a query of databases for known (bio)chemical compounds.</p>
            <p>Exhaustive methods (Figure <figr fid="F2">2</figr>) utilize either a deterministic approach <abbrgrp><abbr bid="B17">17</abbr></abbrgrp> or a stochastic molecular isomer generator <abbrgrp><abbr bid="B27">27</abbr></abbrgrp>. For a given molecular formula, several hundred to billions of isomers can be constructed, depending on the number and nature of elements given by the chemical composition. The number of molecular formulas for the eleven most common elements at 1000 u is reported to be more than 350 millions <abbrgrp><abbr bid="B37">37</abbr></abbrgrp>. For small molecules that are analyzed by electron impact mass spectrometry, a deterministic method called MOLGEN-MS is available <abbrgrp><abbr bid="B20">20</abbr></abbrgrp>. For high molecular weight compounds, deterministic methods are quickly challenged by computing power limits due to the combinatorial explosion of isomeric structures which may render stochastic isomer generators more promising for the future <abbrgrp><abbr bid="B21">21</abbr></abbrgrp>.</p>
            <fig id="F2">
               <title>
                  <p>Figure 2</p>
               </title>
               <caption>
                  <p>Metabolite annotation schema based on mass spectrometric calculation of elemental compositions and subsequent database queries</p>
               </caption>
               <text>
                  <p>Metabolite annotation schema based on mass spectrometric calculation of elemental compositions and subsequent database queries.</p>
               </text>
               <graphic file="1471-2105-7-234-2"/>
            </fig>
            <p>For automatic annotation of metabolites in metabolomic screens, it seems today more reasonable to first search against existing chemical structures or even to limit searches for known natural product databases. A randomly chosen molecular formula of C<sub>15</sub>H<sub>12</sub>O<sub>7 </sub>(304.0583 u) was taken as test case for query results, which should comprise structures like the naturally occurring pentahydroxyflavone (Figure <figr fid="F3">3</figr>). Seven repositories were compared for this exemplary case (Table <tblr tid="T1">1</tblr>): the life-science oriented PubChem database of the U.S. National Institutes of Health and its sub-DB ChemIDplus, the Kyoto ligand biochemical pathway database (KEGG), the CRC dictionary of natural products (DNP), a large compendium of organic chemical structures (Beilstein), a list of commercially available chemicals which could be used for confirming any given hit (MDL), a mass spectrum library used in GC/MS (NIST 5.0) and the complement of small molecules that have been described in the chemical and biochemical literature: the Chemical Abstracts Service (CAS) database.</p>
            <fig id="F3">
               <title>
                  <p>Figure 3</p>
               </title>
               <caption>
                  <p>An example Pentahydroxyflavone (C<sub>15</sub>H<sub>12</sub>O<sub>7</sub>) taken from the KEGG database</p>
               </caption>
               <text>
                  <p>An example Pentahydroxyflavone (C<sub>15</sub>H<sub>12</sub>O<sub>7</sub>) taken from the KEGG database.</p>
               </text>
               <graphic file="1471-2105-7-234-3"/>
            </fig>
            <tbl id="T1">
               <title>
                  <p>Table 1</p>
               </title>
               <caption>
                  <p>Example of a molecular formula search for C<sub>15</sub>H<sub>12</sub>O<sub>7 </sub>in different chemical databases. Search date: July 2007</p>
               </caption>
               <tblbdy cols="3">
                  <r>
                     <c ca="left">
                        <p>
                           <b>Database name</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Compounds found</b>
                        </p>
                     </c>
                     <c ca="right">
                        <p>
                           <b>Total database entries</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="3">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Chemical Abstracts (CAS)</p>
                     </c>
                     <c ca="center">
                        <p>181</p>
                     </c>
                     <c ca="right">
                        <p>24,000,000</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Beilstein Database (MDL)</p>
                     </c>
                     <c ca="center">
                        <p>166</p>
                     </c>
                     <c ca="right">
                        <p>8,000,000</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Dictionary of Natural Products (DNP)</p>
                     </c>
                     <c ca="center">
                        <p>129</p>
                     </c>
                     <c ca="right">
                        <p>170,000</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>PubChem (NIH)</p>
                     </c>
                     <c ca="center">
                        <p>19</p>
                     </c>
                     <c ca="right">
                        <p>800,000</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Available Chemicals Directory (MDL)</p>
                     </c>
                     <c ca="center">
                        <p>6</p>
                     </c>
                     <c ca="right">
                        <p>400,000</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>ChemIDplus (NIH)</p>
                     </c>
                     <c ca="center">
                        <p>6</p>
                     </c>
                     <c ca="right">
                        <p>370,000</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>KEGG (Kyoto University)</p>
                     </c>
                     <c ca="center">
                        <p>3</p>
                     </c>
                     <c ca="right">
                        <p>13,000</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NIST05 (NIST mass spectral database)</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c ca="right">
                        <p>163,000</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>MOLGEN molecular isomer generator (allowing 2 benzene groups; 1 ether group, 1 keto group; 5 hydroxy groups)</p>
                     </c>
                     <c ca="center">
                        <p>788,000</p>
                     </c>
                     <c ca="right">
                        <p>-</p>
                     </c>
                  </r>
               </tblbdy>
            </tbl>
            <p>A range of conclusions can be derived from this exercise (Table <tblr tid="T1">1</tblr>). Due to its limited size and its focus to consensus biochemical pathways, the KEGG database returned far fewer hits compared to more comprehensive repositories like CAS or DNP. It is important to note that therefore, automatic annotations of mass spectra must not be limited to KEGG searches alone. Conversely, however, any hit retrieved from KEGG queries might receive a higher likelihood of truly representing an identifiable metabolite due to the focus on (conserved) biochemical pathways represented in KEGG. In contrast to the small KEGG (Ligand) DB, the CAS database represents the largest database available for small molecules containing ~ 20 million organic chemicals. However, CAS cannot serve as suitable database for routine metabolite queries. On the one hand, CAS contains many compounds that have been artificially synthesized and reported by chemists, and thus are often unlikely to be present as natural compounds. On the other hand, the CAS SciFinder front end enables only a very limited and slow formula search, allowing queries of one formula at a time but not batches or series of queries. For these two reasons, CAS queries can be excluded from automated annotation efforts of complex metabolomic surveys; however, for identification purposes of selected unknown compounds in biomarker studies, the CAS database still provides the most comprehensive overview. It is interesting to note that DNP with only 170,000 entries retrieves 129 different isomeric structures of C<sub>15</sub>H<sub>12</sub>O<sub>7 </sub>(among them many stereoisomers) whereas the far larger PubChem database resulted in only 19 hits. The PubChem database is a fast growing database. At the time of search it had only 800,000 entries, now it has more than 5 millions. PubChem is a freely accessible database and includes KEGG, ChemIDplus and NCBI and several other databases and should therefore be included in automatic metabolite annotations. An in-depth molecular diversity calculation could reveal any overlap <abbrgrp><abbr bid="B22">22</abbr></abbrgrp>. For an automated approach, the DNP database in SD file format (*.sdf) could be used whereas only semi-automatic procedures would be possible for the Beilstein database. Consequently, for identification routines of unknown metabolites starting from elemental compositions, DNP and PubChem search results should be combined.</p>
         </sec>
         <sec>
            <st>
               <p>Calculating elemental compositions: construction of an exhaustive test data set</p>
            </st>
            <p>The input into metabolomic queries are elemental compositions which are calculated from experimental mass spectra. Often, the performance of mass spectrometers and underlying software algorithms to calculate such molecular formulae are presented on test cases. However, molecular formulae are not uniformly distributed across the mass range. In order to exhaustively test the performance and power of algorithms calculating elemental compositions, a data set containing all chemically possible molecular formulae between a molecular mass of 20 &#8211; 500 u (using the most common elements C, H, N, O, P and S) was constructed. It is wrongly assumed by researchers outside the mass spectrometry community that within that mass range, high mass accuracy calculations of &lt;1 ppm would result in unambiguous calculation of unique elemental compositions. We therefore have applied a number of chemical constraints to reduce the number of potential elemental compositions in the exhaustive data set to only those combinations that are allowed by chemical bonding rules. Applying constraints is the most crucial step during the whole process of formula finding and structure elucidation. Consequently, we have used the molecular weight calculator MWTWIN with a variety of restrictions: the "smart H atoms" option was used to avoid the calculation of an unreasonably high number of hydrogen atoms. This excludes species like C2<sub>6</sub>H<sub>2 </sub>which are chemically possibly but not relevant for metabolomics. In extremely seldom cases this can lead to an exclusion of certain formulas with multicenter bonds (C<sub>10</sub>H<sub>25</sub>NO<sub>4</sub>). Secondly, metals have been excluded in our test data set because most metabolites do not contain coordinating metal atoms (although certainly a number of naturally occurring metabolites do, such as chlorophylls). However, in case trimethylsilylation was used for derivatization, search queries in GC/MS metabolite profiling data must obviously include Si which was left aside for this test data set. A third important constraint is the application of valence rules for which LEWIS and SENIOR rules were applied. These rules were found to serve as an important constraint that helped reducing an initial number of 3.5 million combinations of elemental compositions to 1.6 million for the mass range of 20&#8211;500 u (C, H, N, S O and P). Surprisingly, a number of both commercial and non-commercial formula generators are based purely on mathematical rules but do not obey the LEWIS and SENIOR chemical rules. As result, for a mass of 129.034 u species like C<sub>9</sub>H<sub>5</sub>O would be calculated by such formula generators which do not exist as natural compounds (however, which might exist as charged or radical species in the gas phase). Shortly, the LEWIS rule expects each compound to account for an even number of electrons with atoms that all obey the octet rule. SENIOR's theorem <abbrgrp><abbr bid="B25">25</abbr><abbr bid="B26">26</abbr></abbrgrp> requires three essential conditions for the existence of molecular graphs:</p>
            <p>A) the sum of valences is an even number, or the total number of atoms having odd valences is even;</p>
            <p>B) the sum of valences is greater than or equal to twice the maximum valence;</p>
            <p>C) the sum of valences is greater than or equal to twice the number of atoms minus 1.</p>
            <p>We have written scripts that include these rules in order to reduce the number of generated formulae that are exported from current commercial or non-commercial software products. The second rule was not included because it only proofs the non-existence of very small molecules like CH<sub>2 </sub><abbrgrp><abbr bid="B26">26</abbr></abbrgrp>. The current script only allows atom numbers less than 100. We have not put in a further constraint that would account for the number of and double bonds (RDBE <abbrgrp><abbr bid="B32">32</abbr></abbrgrp>) or double bond equivalent (DBE) because for complex molecules with more than five atom types the calculation gets quite complicated. For example, nitrogen and phosphorous can have 3 or 5 valences, and sulphur atoms may have 2, 4 or 6 valences. For molecules that contain these three atoms in different valance states, no single solution for RDBE can be calculated but an RDBE range would result. An in-dept mathematical discussion of this problem can be found here <abbrgrp><abbr bid="B37">37</abbr></abbrgrp>. Applying the LEWIS and SENIOR check is thus much more reliable and straightforward. Our current software script obeys standard valences (<it>ground state chemistry </it><abbrgrp><abbr bid="B17">17</abbr></abbrgrp>) in a conservative effort to produce an exhaustive number of formulas for ground state chemistry.</p>
            <p>A plot of all elemental compositions between 200&#8211;300 u is given in Figure <figr fid="F4">4</figr>. It becomes immediately clear that elemental compositions are not uniformly distributed across the mass range but recurring modalities, which are due to the dependence of elemental compositions upon the chemical constraints applied. Hence, there are large areas where not a single elemental composition exists (e.g. at 297.500 u there is no formulae within +/- 0.148 u (497 ppm mass accuracy; MWTWIN smart H option). Conversely, at maximum frequency modalities, several thousand of potential formulae are chemically allowed (e.g. around 2000 elemental compositions are retrieved between 297.74&#8211;298.34 u). Mass ranges without existing molecular formulae will shift and narrow with higher mass ranges, but peak frequencies and the characteristic pattern will remain. Consequently, the performance of mass spectrometers during elemental compositions analysis tests should be shown with masses at &#177; 1&#963; around maximum peak frequencies and not with the low number of compounds that are found at the valley of the composition distributions.</p>
            <fig id="F4">
               <title>
                  <p>Figure 4</p>
               </title>
               <caption>
                  <p>Trend pattern histogram for mathematical possible number of molecular formulae (C, H, N, S, O and P) for the mass range 200 u-300 u</p>
               </caption>
               <text>
                  <p><b>Trend pattern histogram for mathematical possible number of molecular formulae (C, H, N, S, O and P) for the mass range 200 u-300 u</b>. MWTWIN with bounded search was used, LEWIS check was applied. A step size of 0.01 u was taken for counting the number of formulae.</p>
               </text>
               <graphic file="1471-2105-7-234-4"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>Limits for unique molecular formula assignment</p>
            </st>
            <p>The generation of a comprehensive data set of all chemically possible molecular formulae between 20&#8211;500 u enables the prediction of the upper ppm limit for unique molecular formula assignment (see Table <tblr tid="T2">2</tblr>). Querying masses and formulae at peak frequency distributions from Figure <figr fid="F4">4</figr>, we have determined that this mass limit is as low as 126.000 u when the most common elements (C, H, N, S, O and P) are included and a 1 ppm mass accuracy level is assumed. With these restrictions, two chemically possible formulae are generated, C<sub>2</sub>H<sub>8</sub>O<sub>2</sub>P<sub>2 </sub>and C<sub>3</sub>H<sub>2</sub>N<sub>4</sub>S, both of which can be found in the CAS database and have thus indeed been reported to be existent. This level is far lower than conventionally assumed <abbrgrp><abbr bid="B35">35</abbr></abbrgrp> and would likely be found at an even lower mass if elements like F, Cl, Br and Si were included. It is important to note that from this mass on an increasing number of formulas occur, demonstrating that &lt;1 ppm mass accuracy alone is not sufficient for unique elemental composition assignment. Consequently, for an automatic routine, additional constraints are needed to limit the number of unique formulae from a given mass measurement.</p>
            <tbl id="T2">
               <title>
                  <p>Table 2</p>
               </title>
               <caption>
                  <p>Limits for unique formula assignment at certain levels of mass accuracy [ppm]. Above the listed mass ranges multiple formula findings cumulate. The CAS database sometimes reports D instead of H and radicals and ions as substances. Molgen was used with lowest element valence values. Formulas must contain C and H out of elements CHNSOP</p>
               </caption>
               <tblbdy cols="8">
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>
                           <b>example</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>example</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>CAS Hits</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>CAS Hits</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>MOLGEN</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>MOLGEN</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>
                           <b>ppm</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>mass range &lt; [Da]</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>compound 1</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>compound 2</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>formula 1</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>formula 2</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>formula 1</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>formula 2</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="8">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>0.1</p>
                     </c>
                     <c ca="center">
                        <p>185.9760</p>
                     </c>
                     <c ca="center">
                        <p>CH2N2O9</p>
                     </c>
                     <c ca="center">
                        <p>C4H11PS3</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>6</p>
                     </c>
                     <c ca="center">
                        <p>7116</p>
                     </c>
                     <c ca="center">
                        <p>1116</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>0.5</p>
                     </c>
                     <c ca="center">
                        <p>138.0000</p>
                     </c>
                     <c ca="center">
                        <p>C4H2N4S</p>
                     </c>
                     <c ca="center">
                        <p>C3H8O2P2</p>
                     </c>
                     <c ca="center">
                        <p>27</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>247932</p>
                     </c>
                     <c ca="center">
                        <p>353</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>126.0000</p>
                     </c>
                     <c ca="center">
                        <p>C2H8O2P2</p>
                     </c>
                     <c ca="center">
                        <p>C3H2N4S</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>27</p>
                     </c>
                     <c ca="center">
                        <p>2852</p>
                     </c>
                     <c ca="center">
                        <p>24928</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>126.0000</p>
                     </c>
                     <c ca="center">
                        <p>C2H8O2P2</p>
                     </c>
                     <c ca="center">
                        <p>C3H2N4S</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>27</p>
                     </c>
                     <c ca="center">
                        <p>2852</p>
                     </c>
                     <c ca="center">
                        <p>24928</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>3</p>
                     </c>
                     <c ca="center">
                        <p>126.0000</p>
                     </c>
                     <c ca="center">
                        <p>C2H8O2P2</p>
                     </c>
                     <c ca="center">
                        <p>C3H2N4S</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>27</p>
                     </c>
                     <c ca="center">
                        <p>2852</p>
                     </c>
                     <c ca="center">
                        <p>24928</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>4</p>
                     </c>
                     <c ca="center">
                        <p>95.9881</p>
                     </c>
                     <c ca="center">
                        <p>C3HN2P</p>
                     </c>
                     <c ca="center">
                        <p>CH4O3S</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>17</p>
                     </c>
                     <c ca="center">
                        <p>522</p>
                     </c>
                     <c ca="center">
                        <p>14</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>5</p>
                     </c>
                     <c ca="center">
                        <p>95.9881</p>
                     </c>
                     <c ca="center">
                        <p>C3HN2P</p>
                     </c>
                     <c ca="center">
                        <p>CH4O3S</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>17</p>
                     </c>
                     <c ca="center">
                        <p>522</p>
                     </c>
                     <c ca="center">
                        <p>14</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>6</p>
                     </c>
                     <c ca="center">
                        <p>95.9881</p>
                     </c>
                     <c ca="center">
                        <p>C3HN2P</p>
                     </c>
                     <c ca="center">
                        <p>CH4O3S</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>17</p>
                     </c>
                     <c ca="center">
                        <p>522</p>
                     </c>
                     <c ca="center">
                        <p>14</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>7</p>
                     </c>
                     <c ca="center">
                        <p>95.9881</p>
                     </c>
                     <c ca="center">
                        <p>C3HN2P</p>
                     </c>
                     <c ca="center">
                        <p>CH4O3S</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>17</p>
                     </c>
                     <c ca="center">
                        <p>522</p>
                     </c>
                     <c ca="center">
                        <p>14</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>8</p>
                     </c>
                     <c ca="center">
                        <p>95.9881</p>
                     </c>
                     <c ca="center">
                        <p>C3HN2P</p>
                     </c>
                     <c ca="center">
                        <p>CH4O3S</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>17</p>
                     </c>
                     <c ca="center">
                        <p>522</p>
                     </c>
                     <c ca="center">
                        <p>14</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>9</p>
                     </c>
                     <c ca="center">
                        <p>95.9881</p>
                     </c>
                     <c ca="center">
                        <p>C3HN2P</p>
                     </c>
                     <c ca="center">
                        <p>CH4O3S</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>17</p>
                     </c>
                     <c ca="center">
                        <p>522</p>
                     </c>
                     <c ca="center">
                        <p>14</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>10</p>
                     </c>
                     <c ca="center">
                        <p>93.9911</p>
                     </c>
                     <c ca="center">
                        <p>CH2O5</p>
                     </c>
                     <c ca="center">
                        <p>C2H6S2</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>22</p>
                     </c>
                     <c ca="center">
                        <p>9</p>
                     </c>
                     <c ca="center">
                        <p>5</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>20</p>
                     </c>
                     <c ca="center">
                        <p>77.9788</p>
                     </c>
                     <c ca="center">
                        <p>CH2O2S</p>
                     </c>
                     <c ca="center">
                        <p>CH4P2</p>
                     </c>
                     <c ca="center">
                        <p>13</p>
                     </c>
                     <c ca="center">
                        <p>20</p>
                     </c>
                     <c ca="center">
                        <p>9</p>
                     </c>
                     <c ca="center">
                        <p>4</p>
                     </c>
                  </r>
               </tblbdy>
            </tbl>
         </sec>
         <sec>
            <st>
               <p>Accurate isotope abundance complements accurate mass measurements</p>
            </st>
            <p>Natural compounds on earth (such as metabolites from biological specimen) reflect the natural abundance of stable elemental isotopes, such as <sup>13</sup>C (which is found at approx. 1.11% of the most frequent isotope <sup>12</sup>C), <sup>18</sup>O (0.2% of <sup>16</sup>O), <sup>15</sup>N (0.367 % of <sup>14</sup>N), <sup>2</sup>D (0.015% of <sup>1</sup>H) and <sup>33</sup>S and <sup>34 </sup>S (0.79 and 4.43 % of <sup>32</sup>S). The actual ratios of these stable isotopes slightly differ for each element within narrow ranges <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>. Consequently, each monoisotopic pseudomolecular ion (M<sub>0</sub>) that is used for accurate mass determinations is always accompanied by additional isotope ions. The abundance of the isotope ions (M+1, M+2, M+3) is dependent on the actual elemental composition and can therefore serve as a powerful filter in calculating unique elemental compositions from mass spectral data. In table <tblr tid="T3">3</tblr> the number of calculated elemental compositions for 150.000 to 900.000 u is given at mass accuracy levels of 10-0.1 ppm without and with additional isotope abundance information. Using conventional calculations, isotope information is not included. It is clearly seen that above approx. 200 u, mass accuracies of 3&#8211;5 ppm (an error level that is usually achieved by time-of-flight mass spectrometers, TOF <abbrgrp><abbr bid="B11">11</abbr></abbrgrp>) lead to multiple chemically possible formulae, and to dozens of elemental compositions at masses above 400 u. It has therefore been argued to utilize the high resolving power and mass accuracy of Fourier transform ion cyclotron resonance mass spectrometers that achieve around 1 ppm average error in daily routines in unattended mode sometimes worse <abbrgrp><abbr bid="B18">18</abbr></abbrgrp>). However, even at 1 ppm error, ambiguity of chemical formulae increases sharply above 400 u, a range in which many secondary metabolites are detected. Use of a hypothetical mass spectrometer with only 0.1 ppm error would still not result in unique solutions above 500 u, which leads to the conclusion that improving mass accuracy is not the solution for automatic assignments of elemental compositions. In contrast, applying isotope pattern recognition greatly reduces the search space for possible elemental compositions. Today, TOF mass spectrometers are available that specify 2% absolute isotope abundance accuracy at 3 ppm mass accuracy level with a resolving power of 10,000 <abbrgrp><abbr bid="B29">29</abbr></abbrgrp>. Table <tblr tid="T3">3</tblr> demonstrates that such instruments may clearly outperform the 5-fold more expensive ion cyclotron resonance mass spectrometers with respect to calculation of molecular formulae. Up to 400 u, unique solutions are achieved and between 400&#8211;800 u only 2&#8211;13 possible elemental compositions are reported. A direct comparison of the list of retrieved hits at the 3 ppm level with and without exploiting the isotope abundance information confirms that applying such an orthogonal filter above 500 u removes always more than 95% of the potential formulae. It has been argued that the chemical intuition and experience of analytical chemists would sort out unlikely chemical compositions; however, such routines cannot be implemented into query algorithms and are hard to conceive even at the 1 ppm level, when hundreds of possible hits are returned at searches between 700&#8211;900 Da, the mass range of membrane lipids. The principal idea of using a combined analysis of mass spectra and isotopic distributions is known since several decades <abbrgrp><abbr bid="B33">33</abbr><abbr bid="B31">31</abbr><abbr bid="B34">34</abbr></abbrgrp>. There is a further approach called MPPSIRD (mass peak profiling from selected ion recording data) <abbrgrp><abbr bid="B19">19</abbr></abbrgrp> in which molecular formulas with non matching ion abundances are excluded. Another approach was suggested to use isotopic pattern and "virtually" enhance the resolving power of a magnetic sector instrument from 30,000 to 90,000 or that of an FT-MS from 500,000 to 1,500,000 <abbrgrp><abbr bid="B24">24</abbr></abbrgrp>. It has also been argued that complementary information may be garnered from mass spectral fragmentation, sometimes including accurate mass data in an intelligent basket method <abbrgrp><abbr bid="B28">28</abbr></abbrgrp>. However such an approach is not universally applicable, and even more importantly, the interdependency of accurate mass and accurate isotope analysis for automated calculation of elemental compositions has not yet been demonstrated on a comprehensive data set of chemically possible formulae.</p>
            <tbl id="T3">
               <title>
                  <p>Table 3</p>
               </title>
               <caption>
                  <p>Number of possible molecular formulas at different levels of mass accuracy and the impact of isotopic abundance accuracy. A mass spectrometer capable of 3 ppm but with 2% correct isotopic pattern outperforms even a (non-existing) mass spectrometer with 0.1 ppm mass accuracy! The results are computed for randomly selected targets, so single results vary but the trend remains. LEWIS and SENIOR check was applied. Candidates with unrelated high element counts were already excluded</p>
               </caption>
               <tblbdy cols="8">
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c cspan="5" ca="center">
                        <p>without isotope abundance information</p>
                     </c>
                     <c ca="center">
                        <p>2% isotopic abundance accuracy</p>
                     </c>
                     <c ca="center">
                        <p>5% isotopic abundance accuracy</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c cspan="2">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>molecular mass [Da]</p>
                     </c>
                     <c ca="center">
                        <p>10 ppm</p>
                     </c>
                     <c ca="center">
                        <p>5 ppm</p>
                     </c>
                     <c ca="center">
                        <p>3 ppm</p>
                     </c>
                     <c ca="center">
                        <p>1 ppm</p>
                     </c>
                     <c ca="center">
                        <p>0.1 ppm</p>
                     </c>
                     <c ca="center">
                        <p>3 ppm</p>
                     </c>
                     <c ca="center">
                        <p>5 ppm</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="8">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>150</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>200</p>
                     </c>
                     <c ca="center">
                        <p>3</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>300</p>
                     </c>
                     <c ca="center">
                        <p>24</p>
                     </c>
                     <c ca="center">
                        <p>11</p>
                     </c>
                     <c ca="center">
                        <p>7</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>6</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>400</p>
                     </c>
                     <c ca="center">
                        <p>78</p>
                     </c>
                     <c ca="center">
                        <p>37</p>
                     </c>
                     <c ca="center">
                        <p>23</p>
                     </c>
                     <c ca="center">
                        <p>7</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>13</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>500</p>
                     </c>
                     <c ca="center">
                        <p>266</p>
                     </c>
                     <c ca="center">
                        <p>115</p>
                     </c>
                     <c ca="center">
                        <p>64</p>
                     </c>
                     <c ca="center">
                        <p>21</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>3</p>
                     </c>
                     <c ca="center">
                        <p>33</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>600</p>
                     </c>
                     <c ca="center">
                        <p>505</p>
                     </c>
                     <c ca="center">
                        <p>257</p>
                     </c>
                     <c ca="center">
                        <p>155</p>
                     </c>
                     <c ca="center">
                        <p>50</p>
                     </c>
                     <c ca="center">
                        <p>5</p>
                     </c>
                     <c ca="center">
                        <p>4</p>
                     </c>
                     <c ca="center">
                        <p>36</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>700</p>
                     </c>
                     <c ca="center">
                        <p>1046</p>
                     </c>
                     <c ca="center">
                        <p>538</p>
                     </c>
                     <c ca="center">
                        <p>321</p>
                     </c>
                     <c ca="center">
                        <p>108</p>
                     </c>
                     <c ca="center">
                        <p>10</p>
                     </c>
                     <c ca="center">
                        <p>10</p>
                     </c>
                     <c ca="center">
                        <p>97</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>800</p>
                     </c>
                     <c ca="center">
                        <p>1964</p>
                     </c>
                     <c ca="center">
                        <p>973</p>
                     </c>
                     <c ca="center">
                        <p>599</p>
                     </c>
                     <c ca="center">
                        <p>200</p>
                     </c>
                     <c ca="center">
                        <p>20</p>
                     </c>
                     <c ca="center">
                        <p>13</p>
                     </c>
                     <c ca="center">
                        <p>111</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>900</p>
                     </c>
                     <c ca="center">
                        <p>3447</p>
                     </c>
                     <c ca="center">
                        <p>1712</p>
                     </c>
                     <c ca="center">
                        <p>1045</p>
                     </c>
                     <c ca="center">
                        <p>345</p>
                     </c>
                     <c ca="center">
                        <p>32</p>
                     </c>
                     <c ca="center">
                        <p>18</p>
                     </c>
                     <c ca="center">
                        <p>196</p>
                     </c>
                  </r>
               </tblbdy>
            </tbl>
            <p>A further example supports this notion of a high impact of an orthogonal isotopic pattern filter. Actual measurement data were taken from analysis of trimethylsilylated (TMS) sorbitol, which was calculated as a pseudomolecular ion with a mass/charge 615.324 u at 5 ppm error under chemical ionization using a gas chromatography &#8211; time of flight mass spectrometer (GC-TOF, <abbrgrp><abbr bid="B36">36</abbr></abbrgrp>). In Figure <figr fid="F5">5</figr> all 370 possible elemental compositions are plotted that are calculated from this mass including elements C, H, N, O, S, Si and P using MWTWIN with smart hydrogen option, without a restriction on the number of elements. When LEWIS and SENIOR checks were applied together with a 5% isotope abundance error, 12 result possible elemental compositions were obtained. In comparison, at 1 ppm mass accuracy still 56 formulae were calculated without orthogonal filter applied. For trimethylsilylated compounds in GC-TOF analysis, actually further constraints can be applied. After subtraction of elements counting for the trimethylsilyl group, the correct formula of the non-derivatized molecule is obtained.</p>
            <fig id="F5">
               <title>
                  <p>Figure 5</p>
               </title>
               <caption>
                  <p>The isotopic abundances of the M+1 and M+2 ions can be used to filter molecular formula candidates</p>
               </caption>
               <text>
                  <p><b>The isotopic abundances of the M+1 and M+2 ions can be used to filter molecular formula candidates</b>. This example shows isotopic abundance pattern for silylated sorbitol. The red circle shows a 5% region with the correct target. All other formulae can be excluded if the mass spectrometer has a 5% error (RMS) on isotopic abundances.</p>
               </text>
               <graphic file="1471-2105-7-234-5"/>
            </fig>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Conclusion</p>
         </st>
         <p>Based on exhaustive generation of 1.6 million molecular formulas it has been shown that high mass accuracy (1 ppm) and high resolving power alone is not sufficient for obtaining a low numbers of molecular formulas for further structure elucidation. This is especially true for molecular masses above 300 Da containing the most common elements C, H, N, S, O and P. Only an orthogonal isotopic abundance pattern filter was able to strongly reduce the number of molecular formula candidates. This of course requires mass spectrometers with a very low error for isotopic abundance distributions (RMS 1&#8211;5%). A mass spectrometer capable of 3 ppm mass accuracy but 2% isotopic pattern accuracy usually removes more than 95% of false candidates and outperforms even a (non-existing) mass spectrometer capable of 0.1 ppm mass accuracy but no isotopic pattern accuracy. Mass spectrometry producers should be enforced to provide the isotopic abundance errors in their documentation. Software producers should be enforced to use such an approach in their formula generation software for mass spectrometers.</p>
      </sec>
      <sec>
         <st>
            <p>Methods</p>
         </st>
         <sec>
            <st>
               <p>Generation of molecular formulas</p>
            </st>
            <p>Exhaustive calculation of formulae from 20&#8211;500 u using C, H, N, O, S and P was performed using the Molecular Weight Calculator MWTWIN <abbrgrp><abbr bid="B23">23</abbr></abbrgrp> on a 1.7 GHz Pentium M with 1 GByte RAM. Calculation time and data cleaning with Textpad <abbrgrp><abbr bid="B40">40</abbr></abbrgrp> was about 24 h. As valence values and molecular masses for each of the elements are constant, the resulting patterns of these calculations are also applicable to higher mass ranges. It is feasible to calculate molecular formulas in much higher range using CHEFOEG <abbrgrp><abbr bid="B30">30</abbr></abbrgrp>. LEWIS and SENIOR rules were checked using self-written scripts in Visual Basic which were implemented into Statistica Dataminer v7 <abbrgrp><abbr bid="B23">23</abbr></abbrgrp> and Microsoft Excel 2003. A demo version of Molgen 3.5 <abbrgrp><abbr bid="B42">42</abbr></abbrgrp> was downloaded and used for the calculation of the number of structural isomers of some formulae given as examples.</p>
         </sec>
         <sec>
            <st>
               <p>Isotopic pattern filter</p>
            </st>
            <p>Isotopic pattern were calculated with a modified Mercury6 version <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>. This version takes the molecular formula as input and writes the isotopic abundances with the according masses to a log file. It can process 1 million formulas in 3 hours on a Pentium M 1.7 GHz. The resulting formulae and isotopic patterns of a single example were transferred to an MS Excel sheet where a simple matching function was implemented. Isotopic abundances are normalized to 100. The root mean square error (RMS) of the isotopic abundances is given in percent. This Excel function adds the differences between the calculated and target intensities for each of the M+1, M+2 and M+3 peaks and matches the sum of these differences against the target intensities. Furthermore an MS Excel array formula was implemented to report the number of remaining formulae when manually entering the isotope abundance accuracy in percent (according to the mass spectrometer specifications).</p>
            <p>Mass spectrometry always reports charged species. For the correct use of the software, the neutral form of the molecule is required. In this case the charge of molecular ion can be removed and hydrogen is added or subtracted to retrieve the neutral form of the molecule (mass of proton and electron = 1.007825 u). Any other adduct must be removed in the same manner.</p>
            <p>In table <tblr tid="T3">3</tblr>, isotope abundance examples were taken from individual compounds that were randomly selected from 48,000 example formulae in the range of 150&#8211;900 u, each of which had to pass LEWIS and SENIOR checks and an inclusion of C and H out of the list of C, H, N, S, O and P. Accordingly, selection of another compound for each mass example would change the single result given in the 'isotope abundance accuracy' columns, but not the overall conclusions. For all cases, the MWTWIN smart H option was applied, excluding potential formulae with a high combination of elements (e.g. C<sub>26</sub>H<sub>4</sub>) <abbrgrp><abbr bid="B39">39</abbr></abbrgrp> that are inexistent in metabolome compositions. A complete matrix containing all results for 10, 5, 3, 1 and 0.1 ppm and 20, 10, 5, 2 and 1% isotopic abundance accuracy for 150&#8211;900 ppm can be found at <url>http://fiehnlab.ucdavis.edu</url>.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Authors' contributions</p>
         </st>
         <p>Both authors contributed equally to the work.</p>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>We thank Matthew Monroe for making MWTWIN <abbrgrp><abbr bid="B23">23</abbr></abbrgrp> available as freeware. We thank Alan L. Rockwood (ARUP-Lab) and Steve Van Orden (Bruker Daltonics) for providing the Mercury6 source code for accurate isotopic pattern calculation for free. We thank Ernst Schumacher (University of Bern) for providing the chemical formula generator "CHEFOG" to the public domain.</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>Combining genomics, metabolome analysis, and biochemical modelling to understand metabolic networks</p>
            </title>
            <aug>
               <au>
                  <snm>Fiehn</snm>
                  <fnm>O</fnm>
               </au>
            </aug>
            <source>Comp Funct Genom</source>
            <pubdate>2001</pubdate>
            <volume>2</volume>
            <fpage>155</fpage>
            <lpage>168</lpage>
         </bibl>
         <bibl id="B2">
            <title>
               <p>Process for the integrated extraction, identification and quantification of metabolites, proteins and RNA to reveal their co-regulation in biochemical networks</p>
            </title>
            <aug>
               <au>
                  <snm>Weckwerth</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Wenzel</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Fiehn</snm>
                  <fnm>O</fnm>
               </au>
            </aug>
            <source>Proteomics</source>
            <pubdate>2004</pubdate>
            <volume>4</volume>
            <fpage>78</fpage>
            <lpage>83</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">14730673</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>Metabolic networks unravel the effects of silent plant phenotypes</p>
            </title>
            <aug>
               <au>
                  <snm>Weckwerth</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Loureiro</snm>
                  <fnm>ME</fnm>
               </au>
               <au>
                  <snm>Wenzel</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Fiehn</snm>
                  <fnm>O</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2004</pubdate>
            <volume>101</volume>
            <fpage>7809</fpage>
            <lpage>7814</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">419688</pubid>
                  <pubid idtype="pmpid" link="fulltext">15136733</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>Chasing molecules that were never there: Misassigned natural products and the role of chemical synthesis in modern structure elucidation</p>
            </title>
            <aug>
               <au>
                  <snm>Nicolaou</snm>
                  <fnm>KC</fnm>
               </au>
               <au>
                  <snm>Snyder</snm>
                  <fnm/>
               </au>
               <au>
                  <snm>Scott</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Angew Chem Int Ed</source>
            <pubdate>2005</pubdate>
            <volume>44</volume>
            <fpage>1012</fpage>
            <lpage>1044</lpage>
         </bibl>
         <bibl id="B5">
            <title>
               <p>Construction and application of a mass spectral and retention time index database generated from plant GC/EI-TOF-MS metabolite profiles</p>
            </title>
            <aug>
               <au>
                  <snm>Wagner</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Sefkow</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Kopka</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Phytochemistry</source>
            <pubdate>2003</pubdate>
            <volume>62</volume>
            <issue>6</issue>
            <fpage>887</fpage>
            <lpage>900</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12590116</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>Total synthesis of (+/-)-pentacycloanammoxic acid</p>
            </title>
            <aug>
               <au>
                  <snm>Mascitti</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Corey</snm>
                  <fnm>EJ</fnm>
               </au>
            </aug>
            <source>J Am Chem Soc</source>
            <pubdate>2004</pubdate>
            <volume>126</volume>
            <fpage>15664</fpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">15571387</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>ACD/MS Fragmenter</p>
            </title>
            <source>Mass spectral fragmentation analysis software</source>
            <url>http://www.acdlabs.com</url>
            <note>cited December 2005</note>
         </bibl>
         <bibl id="B8">
            <title>
               <p>MassFrontier</p>
            </title>
            <source>Mass spectral fragmentation analysis software</source>
            <url>http://www.highchem.com</url>
            <note>cited December 2005</note>
         </bibl>
         <bibl id="B9">
            <title>
               <p>Atomic weights of the elements: Review 2000</p>
            </title>
            <aug>
               <au>
                  <snm>De Laeter</snm>
                  <fnm>JR</fnm>
               </au>
               <au>
                  <snm>B&#246;hlke</snm>
                  <fnm>JK</fnm>
               </au>
               <au>
                  <snm>De Bi&#232;vre</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Hidaka</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Peiser</snm>
                  <fnm>HS</fnm>
               </au>
               <au>
                  <snm>Rosman</snm>
                  <fnm>KJR</fnm>
               </au>
               <au>
                  <snm>Taylor</snm>
                  <fnm>PDP</fnm>
               </au>
            </aug>
            <source>Pure Appl Chem</source>
            <pubdate>2003</pubdate>
            <volume>75</volume>
            <issue>6</issue>
            <fpage>683</fpage>
            <lpage>800</lpage>
         </bibl>
         <bibl id="B10">
            <title>
               <p>An MS/MS library on an ion-trap instrument for efficient dereplication of natural products. Different fragmentation patterns for [M + H]+ and [M + Na]+ ions</p>
            </title>
            <aug>
               <au>
                  <snm>Fredenhagen</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Derrien</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Gassmann</snm>
                  <fnm>E</fnm>
               </au>
            </aug>
            <source>J Nat Prod</source>
            <pubdate>2005</pubdate>
            <volume>68</volume>
            <issue>3</issue>
            <fpage>385</fpage>
            <lpage>91</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">15787441</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>Debating resolution and mass accuracy</p>
            </title>
            <aug>
               <au>
                  <snm>Balogh</snm>
                  <fnm>MP</fnm>
               </au>
            </aug>
            <source>LC GC NORTH AMERICA</source>
            <pubdate>2004</pubdate>
            <volume>22</volume>
            <issue>2</issue>
            <fpage>118</fpage>
         </bibl>
         <bibl id="B12">
            <title>
               <p>A General approach to calculating isotopic distributions for mass spectrometry</p>
            </title>
            <aug>
               <au>
                  <snm>Yergey</snm>
                  <fnm>JA</fnm>
               </au>
            </aug>
            <source>Int J Mass Spectrom Ion Phys</source>
            <pubdate>1983</pubdate>
            <volume>52</volume>
            <fpage>337</fpage>
            <lpage>349</lpage>
         </bibl>
         <bibl id="B13">
            <title>
               <p>Ultrahigh-Speed Calculation of Isotope Distributions</p>
            </title>
            <aug>
               <au>
                  <snm>Rockwood</snm>
                  <fnm>AL</fnm>
               </au>
               <au>
                  <snm>Van Orden</snm>
                  <fnm>SL</fnm>
               </au>
               <au>
                  <snm>Smith</snm>
                  <fnm>RD</fnm>
               </au>
            </aug>
            <source>Anal Chem</source>
            <pubdate>1996</pubdate>
            <volume>68</volume>
            <fpage>2027</fpage>
            <lpage>2030</lpage>
         </bibl>
         <bibl id="B14">
            <title>
               <p>Modern isotope ratio mass spectrometry</p>
            </title>
            <aug>
               <au>
                  <snm>Platzner</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <publisher>John Wiley &amp; Sons</publisher>
            <pubdate>1997</pubdate>
            <note>ISBN 0-471-97416-1</note>
            <xrefbib>
               <pubid idtype="pmpid">9374955</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B15">
            <title>
               <p>Diophantine approach to isotopic abundance calculations</p>
            </title>
            <aug>
               <au>
                  <snm>Hsu</snm>
                  <fnm>CS</fnm>
               </au>
            </aug>
            <source>Anal Chem</source>
            <pubdate>1984</pubdate>
            <volume>56</volume>
            <fpage>1356</fpage>
            <lpage>1361</lpage>
         </bibl>
         <bibl id="B16">
            <title>
               <p>How DENDRAL was conceived and born</p>
            </title>
            <aug>
               <au>
                  <snm>Lederberg</snm>
                  <fnm>L</fnm>
               </au>
            </aug>
            <source>ACM Conference on the History of Medical Informatics, History of Medical Informatics archive</source>
            <pubdate>1987</pubdate>
            <fpage>5</fpage>
            <lpage>19</lpage>
            <url>http://doi.acm.org/10.1145/41526.41528</url>
         </bibl>
         <bibl id="B17">
            <title>
               <p>Molecules in Silico: The generation of structural formulae and its applications</p>
            </title>
            <aug>
               <au>
                  <snm>Kerber</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Laue</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Meringer</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Ruecker</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>J Comput Chem Jpn</source>
            <pubdate>2004</pubdate>
            <volume>3</volume>
            <issue>3</issue>
            <fpage>85</fpage>
            <lpage>96</lpage>
         </bibl>
         <bibl id="B18">
            <title>
               <p>Intercomparison study on accurate mass measurement of small molecules in mass spectrometry</p>
            </title>
            <aug>
               <au>
                  <snm>Bristow</snm>
                  <fnm>AWT</fnm>
               </au>
               <au>
                  <snm>Webb</snm>
                  <fnm>KS</fnm>
               </au>
            </aug>
            <source>J Am Soc Mass Spectrom</source>
            <pubdate>2003</pubdate>
            <volume>14</volume>
            <issue>10</issue>
            <fpage>1086</fpage>
            <lpage>1098</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">14530089</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B19">
            <title>
               <p>Utility of three types of mass spectrometers for determining elemental compositions of ions formed from chromatographically separated compounds</p>
            </title>
            <aug>
               <au>
                  <snm>Grange</snm>
                  <fnm>AH</fnm>
               </au>
               <au>
                  <snm>Genicola</snm>
                  <fnm>FA</fnm>
               </au>
               <au>
                  <snm>Sovocool</snm>
                  <fnm>GW</fnm>
               </au>
            </aug>
            <source>Rapid Commun Mass Spectrom</source>
            <pubdate>2002</pubdate>
            <volume>16</volume>
            <fpage>2356</fpage>
            <xrefbib>
               <pubid idtype="pmpid">12478582</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B20">
            <title>
               <p>MOLGEN-MS: Evaluation of low resolution electron impact mass spectra with MS classification and exhaustive structure generation</p>
            </title>
            <aug>
               <au>
                  <snm>Kerber</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Laue</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Meringer</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Varmuza</snm>
                  <fnm>K</fnm>
               </au>
            </aug>
            <source>Adv Mass Spectrom</source>
            <pubdate>2001</pubdate>
            <volume>15</volume>
            <fpage>939</fpage>
            <lpage>940</lpage>
         </bibl>
         <bibl id="B21">
            <title>
               <p>Recent developments in automated structure elucidation of natural products</p>
            </title>
            <aug>
               <au>
                  <snm>Steinbeck</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>Nat Prod Rep</source>
            <pubdate>2004</pubdate>
            <volume>4</volume>
            <fpage>512</fpage>
            <lpage>8</lpage>
         </bibl>
         <bibl id="B22">
            <title>
               <p>Comparison of the NCI open database with seven large chemical structural databases</p>
            </title>
            <aug>
               <au>
                  <snm>Voigt</snm>
                  <fnm>JH</fnm>
               </au>
               <au>
                  <snm>Bienfait</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Nicklaus</snm>
                  <fnm>MC</fnm>
               </au>
            </aug>
            <source>J Chem Inf Comput Sci</source>
            <pubdate>2001</pubdate>
            <volume>41</volume>
            <fpage>702</fpage>
            <lpage>712</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">11410049</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B23">
            <title>
               <p>Molecular Weight Calculator</p>
            </title>
            <aug>
               <au>
                  <snm>Monroe</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>MWTWIN v6.35</source>
            <url>http://www.alchemistmatt.com/</url>
            <note>Link checked: December 2005</note>
         </bibl>
         <bibl id="B24">
            <title>
               <p>Reduction of chemical formulas from the isotopic peak distributions of high-resolution mass spectra</p>
            </title>
            <aug>
               <au>
                  <snm>Roussis</snm>
                  <fnm>SG</fnm>
               </au>
               <au>
                  <snm>Proulx</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Anal Chem</source>
            <pubdate>2003</pubdate>
            <volume>75</volume>
            <issue>6</issue>
            <fpage>1470</fpage>
            <lpage>1482</lpage>
            <xrefbib>
               <pubid idtype="pmpid">12659212</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B25">
            <title>
               <p>Partitions and their representative graphs</p>
            </title>
            <aug>
               <au>
                  <snm>Senior</snm>
                  <fnm>JK</fnm>
               </au>
            </aug>
            <source>Amer J Math</source>
            <pubdate>1951</pubdate>
            <volume>73</volume>
            <issue>3</issue>
            <fpage>663</fpage>
            <lpage>689</lpage>
         </bibl>
         <bibl id="B26">
            <title>
               <p>Analogous odd-even parities in mathematics and chemistry</p>
            </title>
            <aug>
               <au>
                  <snm>Morikawa</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Newbol</snm>
                  <fnm>BT</fnm>
               </au>
            </aug>
            <source>Chemistry (Bulgarian Journal of Chemical Education)</source>
            <pubdate>2003</pubdate>
            <volume>12</volume>
            <issue>6</issue>
            <fpage>445</fpage>
            <lpage>450</lpage>
         </bibl>
         <bibl id="B27">
            <title>
               <p>Stochastic generator of chemical structure. (1) Application to the structure elucidation of large molecules</p>
            </title>
            <aug>
               <au>
                  <snm>Faulon</snm>
                  <fnm>JL</fnm>
               </au>
            </aug>
            <source>J Chem Inf Comput Sci</source>
            <pubdate>1994</pubdate>
            <volume>34</volume>
            <fpage>1204</fpage>
            <lpage>1218</lpage>
         </bibl>
         <bibl id="B28">
            <title>
               <p>Multistage accurate mass spectrometry: a "basket in a basket" approach for structure elucidation and its application to a compound from combinatorial synthesis</p>
            </title>
            <aug>
               <au>
                  <snm>Wu</snm>
                  <fnm>Q</fnm>
               </au>
            </aug>
            <source>Anal Chem</source>
            <pubdate>1998</pubdate>
            <volume>70</volume>
            <issue>5</issue>
            <fpage>865</fpage>
         </bibl>
         <bibl id="B29">
            <title>
               <p>Mass spectrometry method for accurate mass determination of unknown ions</p>
            </title>
            <aug>
               <au>
                  <snm>K&#246;ster</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>Bruker Daltonik GmbH (DE)</source>
            <pubdate>2001</pubdate>
            <note>US-Patent US6188064</note>
         </bibl>
         <bibl id="B30">
            <title>
               <p>Chemical Formula Generator</p>
            </title>
            <aug>
               <au>
                  <snm>Schumacher</snm>
                  <fnm>E</fnm>
               </au>
            </aug>
            <source>CHEFOG v1 1973, revised 1992</source>
            <url>http://www.chemsoft.ch/</url>
            <note>cited December 2005</note>
         </bibl>
         <bibl id="B31">
            <title>
               <p>Isotopic patterns of fragment ions from dissociation of mass-selected ions</p>
            </title>
            <aug>
               <au>
                  <snm>Tou</snm>
                  <fnm>JC</fnm>
               </au>
            </aug>
            <source>Anal Chem</source>
            <pubdate>1983</pubdate>
            <volume>55</volume>
            <issue>2</issue>
            <fpage>367</fpage>
            <lpage>372</lpage>
         </bibl>
         <bibl id="B32">
            <title>
               <p>Automated compatibility tests of the molecular formulas or structures of organic compounds with their mass spectra</p>
            </title>
            <aug>
               <au>
                  <snm>Seebass</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Pretsch</snm>
                  <fnm>E</fnm>
               </au>
            </aug>
            <source>J Chem Inf Comput Sci</source>
            <pubdate>1999</pubdate>
            <volume>39</volume>
            <issue>4</issue>
            <fpage>713</fpage>
            <lpage>717</lpage>
         </bibl>
         <bibl id="B33">
            <title>
               <p>Punched-card catalog of mass spectra useful in qualitative analysis</p>
            </title>
            <aug>
               <au>
                  <snm>Zemany</snm>
                  <fnm>PD</fnm>
               </au>
            </aug>
            <source>Anal Chem</source>
            <pubdate>1950</pubdate>
            <volume>22</volume>
            <fpage>920</fpage>
            <lpage>2</lpage>
         </bibl>
         <bibl id="B34">
            <title>
               <p>Qualitative analysis from mass spectra</p>
            </title>
            <aug>
               <au>
                  <snm>Rock</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Anal Chem</source>
            <pubdate>1951</pubdate>
            <volume>23</volume>
            <fpage>261</fpage>
            <lpage>8</lpage>
         </bibl>
         <bibl id="B35">
            <title>
               <p>Milestones in Fourier transform ion cyclotron resonance mass spectrometry technique development</p>
            </title>
            <aug>
               <au>
                  <snm>Marshall</snm>
                  <fnm>AG</fnm>
               </au>
            </aug>
            <source>Int J Mass Spectrom</source>
            <pubdate>2000</pubdate>
            <volume>200</volume>
            <fpage>331</fpage>
            <lpage>356</lpage>
         </bibl>
         <bibl id="B36">
            <title>
               <p>Exact molecular mass determination of polar plant metabolites using GCT with chemical ionization</p>
            </title>
            <aug>
               <au>
                  <snm>Fiehn</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Major</snm>
                  <fnm>H</fnm>
               </au>
            </aug>
            <source>Waters Application Library Number 720001260EN</source>
            <pubdate>2005</pubdate>
         </bibl>
         <bibl id="B37">
            <title>
               <p>Mathematische Modelle f&#252;r die kombinatorische Chemie und die molekulare Strukturaufkl&#228;rung</p>
            </title>
            <aug>
               <au>
                  <snm>Meringer</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Doctoral Thesis University of Bayreuth Germany;</source>
            <pubdate>2004</pubdate>
            <url>http://www.mathe2.uni-bayreuth.de/markus/pdf/pub/dis/MathModKombChemMolStrukt.pdf</url>
            <note>ISBN 3-8325-0673-X; cited December 2005</note>
         </bibl>
         <bibl id="B38">
            <title>
               <p>Accuracy Requirements for Peptide Characterization by Monoisotopic Molecular Mass Measurements</p>
            </title>
            <aug>
               <au>
                  <snm>Zubarev</snm>
                  <fnm>RA</fnm>
               </au>
            </aug>
            <source>Anal Chem</source>
            <pubdate>1996</pubdate>
            <volume>68</volume>
            <fpage>4060</fpage>
         </bibl>
         <bibl id="B39">
            <title>
               <p>Simple Tools for the Computer-Aided Interpretation of Mass Spectra</p>
            </title>
            <aug>
               <au>
                  <snm>Heuerding</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Clerc</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Chemom Intel Lab Syst</source>
            <pubdate>1993</pubdate>
            <volume>20</volume>
            <fpage>57</fpage>
            <lpage>69</lpage>
         </bibl>
         <bibl id="B40">
            <title>
               <p>Textpad</p>
            </title>
            <url>http://www.textpad.com</url>
         </bibl>
         <bibl id="B41">
            <title>
               <p>Statistica Dataminer v7</p>
            </title>
            <url>http://www.statsoft.com</url>
         </bibl>
         <bibl id="B42">
            <title>
               <p>Molgen 3.5</p>
            </title>
            <url>http://www.mathe2.uni-bayreuth.de/molgen4</url>
         </bibl>
      </refgrp>
   </bm>
</art>
