<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1752-153X-2-S1-P2</ui>
   <ji>1752-153X</ji>
   <fm>
      <dochead>Poster presentation</dochead>
      <bibl>
         <title>
            <p>Estimating the applicability domain of kernel based QSPR models using classical descriptor vectors</p>
         </title>
         <aug>
            <au id="A1" ca="yes">
               <snm>Fechner</snm>
               <fnm>NH</fnm>
               <insr iid="I1"/>
            </au>
            <au id="A2">
               <snm>Hinselmann</snm>
               <fnm>G</fnm>
               <insr iid="I1"/>
            </au>
            <au id="A3">
               <snm>Schmiedl</snm>
               <fnm>C</fnm>
               <insr iid="I1"/>
            </au>
            <au id="A4">
               <snm>Zell</snm>
               <fnm>Andreas</fnm>
               <insr iid="I1"/>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>University of Tuebingen, Sand 1, 72076 Tuebingen, Germany</p>
            </ins>
         </insg>
         <source>Chemistry Central Journal</source>
         <supplement>
            <title>
               <p>3rd German Conference on Chemoinformatics: 21. CIC-Workshop</p>
            </title>
            <note>Meeting abstracts - A single PDF containing all abstracts in this Supplement is available <a href="http://www.biomedcentral.com/content/files/pdf/1752-153X-2-S1-full.pdf">here</a>.</note>
            <url>http://www.biomedcentral.com/content/pdf/1752-153X-2-S1-info.pdf</url>
         </supplement>
         <conference>
            <title>
               <p>3rd German Conference on Chemoinformatics</p>
            </title>
            <location>Goslar, Germany</location>
            <date-range>11-13 November 2007</date-range>
            <url>http://www.gdch.de/gcc2007</url>
         </conference>
         <issn>1752-153X</issn>
         <pubdate>2008</pubdate>
         <volume>2</volume>
         <issue>Suppl 1</issue>
         <fpage>P2</fpage>
         <url>http://www.journal.chemistrycentral.com/content/2/S1/P2</url>
         <xrefbib>
            <pubid idtype="doi">10.1186/1752-153X-2-S1-P2</pubid>
         </xrefbib>
      </bibl>
      <history>
         <pub>
            <date>
               <day>26</day>
               <month>03</month>
               <year>2008</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2008</year>
         <collab>Fechner et al.</collab>
      </cpyrt>
   </fm>
   <bdy>
      <sec>
         <st>
            <p/>
         </st>
         <p>The propagation of machine learning based property prediction methods (e.g. QSAR, QSPR,.&#8230;) has lead to the question of the reliability of the prediction. This leads to the development of methods enabling the estimation of the reliability of a model based prediction.</p>
         <p>There are two principal approaches in dealing with this demand: estimating the expected derivation from the prediction (e.g. <it>gaussian processes</it>) or classifying each compound whether the model is specified for it or not. The last approach has become known as estimating the <it>applicability domain </it><abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr></abbrgrp> of a model. One drawback of the different AD estimation methods is that most of them are based on the spatial embedding of the training dataset in the descriptor space. Thus these algorithms are not directly suited in modelling the <it>applicability domain</it> of kernel-based predictors, which are working in a extremely high dimensional implicit feature space.</p>
         <p>In this study we examined to what extent a standard descriptor based AD model can be used to describe the <it>applicability domain</it> of an <it>optimal assignment kernel </it><abbrgrp><abbr bid="B3">3</abbr></abbrgrp> based predictor. We split the popular Huuskonen <abbrgrp><abbr bid="B4">4</abbr></abbrgrp> logS dataset 2:1 in a training and a test set and compared some standard AD methods <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr></abbrgrp> (range-based, convex hull, leverage,&#8230;) regarding the correlation of the estimated AD with the test error. The results indicate that it is possible to estimate the <it>applicability domain</it> of a kernel based model using classical descriptor encodings of the molecules. Furthermore the results show that there are significant differences between the different methods. In our application the geometrical convex hull approach was superior.</p>
      </sec>
   </bdy>
   <bm>
      <refgrp>
         <bibl id="B1">
            <aug>
               <au>
                  <snm>Jaworska</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Nikolova-Jeliazkova</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Aldenberg</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>ATLA</source>
            <pubdate>2005</pubdate>
            <fpage>445</fpage>
            <lpage>459</lpage>
            <xrefbib>
               <pubid idtype="pmpid">16268757</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <aug>
               <au>
                  <snm>Netzeva</snm>
                  <fnm>TI</fnm>
               </au>
               <au>
                  <snm>Worth</snm>
                  <fnm>AP</fnm>
               </au>
               <au>
                  <snm>Aldenberg</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>ATLA</source>
            <pubdate>2005</pubdate>
            <fpage>1</fpage>
            <lpage>19</lpage>
            <xrefbib>
               <pubid idtype="pmpid">15804212</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <aug>
               <au>
                  <snm>Fr&#246;hlich</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Wegner</snm>
                  <fnm>JK</fnm>
               </au>
               <au>
                  <snm>Zell</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Proc Int Joint Conf Neur Net (IJCNN)</source>
            <pubdate>2005</pubdate>
            <fpage>913</fpage>
            <lpage>918</lpage>
         </bibl>
         <bibl id="B4">
            <aug>
               <au>
                  <snm>Huuskonen</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>J Chem Inf Mod</source>
            <pubdate>2000</pubdate>
            <fpage>773</fpage>
            <lpage>777</lpage>
         </bibl>
      </refgrp>
   </bm>
</art>
