<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1752-153X-2-S1-P14</ui>
   <ji>1752-153X</ji>
   <fm>
      <dochead>Poster presentation</dochead>
      <bibl>
         <title>
            <p>Comparison of applicability domains of QSAR models: application to the modelling of the environmental toxicity against <it>Tetrahymena pyriformis</it></p>
         </title>
         <aug>
            <au id="A1" ca="yes">
               <snm>Tetko</snm>
               <mi>V</mi>
               <fnm>Igor</fnm>
               <insr iid="I1"/>
            </au>
            <au id="A2">
               <snm>Tropsha</snm>
               <fnm>Alexander</fnm>
               <insr iid="I2"/>
            </au>
            <au id="A3">
               <snm>Zhu</snm>
               <fnm>H</fnm>
               <insr iid="I2"/>
            </au>
            <au id="A4">
               <snm>Papa</snm>
               <fnm>E</fnm>
               <insr iid="I3"/>
            </au>
            <au id="A5">
               <snm>Gramatica</snm>
               <fnm>P</fnm>
               <insr iid="I3"/>
            </au>
            <au id="A6">
               <snm>&#214;berg</snm>
               <fnm>T</fnm>
               <insr iid="I4"/>
            </au>
            <au id="A7">
               <snm>Fourches</snm>
               <fnm>D</fnm>
               <insr iid="I5"/>
            </au>
            <au id="A8">
               <snm>Varnek</snm>
               <fnm>A</fnm>
               <insr iid="I5"/>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>GSF - National Research Centre for Environment and Health, Institute for Bioinformatics, Ingolstaedter Landstrasse 1, D-85764 Neuherberg, Germany</p>
            </ins>
            <ins id="I2">
               <p>UNC School of Pharmacy, Medicinal Chemistry and Natural Products, Beard Hall, Room 327A, Chapel Hill, NC 27599-7360, USA</p>
            </ins>
            <ins id="I3">
               <p>Varese, Italy</p>
            </ins>
            <ins id="I4">
               <p>Kalmar, Sweden</p>
            </ins>
            <ins id="I5">
               <p>Universit&#233; Louis Pasteur de Strasbourg, Institut de Chimie, 4, rue B. Pascal, F-67000 Strasbourg, France</p>
            </ins>
         </insg>
         <source>Chemistry Central Journal</source>
         <supplement>
            <title>
               <p>3rd German Conference on Chemoinformatics: 21. CIC-Workshop</p>
            </title>
            <note>Meeting abstracts - A single PDF containing all abstracts in this Supplement is available <a href="http://www.biomedcentral.com/content/files/pdf/1752-153X-2-S1-full.pdf">here</a>.</note>
            <url>http://www.biomedcentral.com/content/pdf/1752-153X-2-S1-info.pdf</url>
         </supplement>
         <conference>
            <title>
               <p>3rd German Conference on Chemoinformatics</p>
            </title>
            <location>Goslar, Germany</location>
            <date-range>11-13 November 2007</date-range>
            <url>http://www.gdch.de/gcc2007</url>
         </conference>
         <issn>1752-153X</issn>
         <pubdate>2008</pubdate>
         <volume>2</volume>
         <issue>Suppl 1</issue>
         <fpage>P14</fpage>
         <url>http://www.journal.chemistrycentral.com/content/2/S1/P14</url>
         <xrefbib>
            <pubid idtype="doi">10.1186/1752-153X-2-S1-P14</pubid>
         </xrefbib>
      </bibl>
      <history>
         <pub>
            <date>
               <day>26</day>
               <month>03</month>
               <year>2008</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2008</year>
         <collab>Tetko et al.</collab>
      </cpyrt>
   </fm>
   <bdy>
      <sec>
         <st>
            <p/>
         </st>
         <p>The estimation of the applicability domains and the accuracy of predictions are the critical problems in QSAR modelling. In this presentation we propose a new approach to compare different estimates of applicability domains based on &#8220;distances to models&#8221; as well as their significance. The &#8220;distance to model&#8221; is a measure of similarity between training and test sets compounds, which could be expressed in many different ways (e.g., Tanimoto coefficient, Euclidian distance, leverage, etc.). The main idea of our analysis is that the errors can be described by a mixture of Gaussian distributions rather than a single Gaussian distribution. Here, we analyse 12 QSAR models for aqueous toxicity against <it>Tetrahymena pyriformis</it> obtained with different machine-learning methods and various types of descriptors. The models were obtained and tested for a dataset of 1093 compounds measured in the same laboratory <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr></abbrgrp>. The dataset has been divided into training and test sets using a diversity sampling algorithm. The majority of examined distances to models provided significantly lower statistical scores (bootstrap test, <it>p</it>&lt;0.05) for description of errors when using a mixture of Gaussian distributions. The use of the mixture distributions fitted to the training set affords a significantly better assessment of the test set errors as compared to the assumption of identical distribution of errors in training and test sets.</p>
      </sec>
   </bdy>
   <bm>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>Development and evaluation of qsars for ecotoxic endpoints: The benzene response-surface model for Tetrahymena toxicity</p>
            </title>
            <aug>
               <au>
                  <snm>Schultz</snm>
                  <fnm>TW</fnm>
               </au>
               <au>
                  <snm>Netzeva</snm>
                  <fnm>TI</fnm>
               </au>
            </aug>
            <source>Modeling environmental fate and toxicity</source>
            <publisher>Boca Raton, FL, USA</publisher>
            <editor>M. T. D. Cronin and D. J. Livingstone</editor>
            <pubdate>2004</pubdate>
            <fpage>265</fpage>
            <lpage>284</lpage>
         </bibl>
         <bibl id="B2">
            <title>
               <p>Assessing applicability domains of toxicological QSARs: Definition, confidence in predicted values, and the role of mechanisms of action</p>
            </title>
            <aug>
               <au>
                  <snm>Schultz</snm>
                  <fnm>TW</fnm>
               </au>
               <au>
                  <snm>Hewitt</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Netzeva</snm>
                  <fnm>TI</fnm>
               </au>
               <au>
                  <snm>Cronin</snm>
                  <fnm>MTD</fnm>
               </au>
            </aug>
            <source>QSAR Comb Sci</source>
            <pubdate>2007</pubdate>
            <volume>26</volume>
            <fpage>238</fpage>
            <lpage>254</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1002/qsar.200630020</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>Evaluation of QSARs for ecotoxicity: A method for assigning quality and confidence</p>
            </title>
            <aug>
               <au>
                  <snm>Schultz</snm>
                  <fnm>TW</fnm>
               </au>
               <au>
                  <snm>Netzeva</snm>
                  <fnm>TI</fnm>
               </au>
               <au>
                  <snm>Cronin</snm>
                  <fnm>MT</fnm>
               </au>
            </aug>
            <source>SAR QSAR Environ Res</source>
            <pubdate>2004</pubdate>
            <volume>15</volume>
            <fpage>385</fpage>
            <lpage>397</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1080/10629360412331297344</pubid>
                  <pubid idtype="pmpid">15669697</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>Chemistry-toxicity relationships for the effects of di- and trihydroxybenzenes to tetrahymena pyriformis</p>
            </title>
            <aug>
               <au>
                  <snm>Aptula</snm>
                  <fnm>AO</fnm>
               </au>
               <au>
                  <snm>Roberts</snm>
                  <fnm>DW</fnm>
               </au>
               <au>
                  <snm>Cronin</snm>
                  <fnm>MT</fnm>
               </au>
               <au>
                  <snm>Schultz</snm>
                  <fnm>TW</fnm>
               </au>
            </aug>
            <source>Chem Res Toxicol</source>
            <pubdate>2005</pubdate>
            <volume>18</volume>
            <fpage>844</fpage>
            <lpage>854</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1021/tx049666n</pubid>
                  <pubid idtype="pmpid" link="fulltext">15892578</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
      </refgrp>
   </bm>
</art>
