<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1471-2105-7-364</ui>
   <ji>1471-2105</ji>
   <fm>
      <dochead>Methodology article</dochead>
      <bibl>
         <title>
            <p>Improving the quality of protein structure models by selecting from alignment alternatives</p>
         </title>
         <aug>
            <au id="A1" ca="yes">
               <snm>Sommer</snm>
               <fnm>Ingolf</fnm>
               <insr iid="I1"/>
               <email>sommer@mpi-sb.mpg.de</email>
            </au>
            <au id="A2">
               <snm>Toppo</snm>
               <fnm>Stefano</fnm>
               <insr iid="I2"/>
               <email>stefano@cribi.unipd.it</email>
            </au>
            <au id="A3">
               <snm>Sander</snm>
               <fnm>Oliver</fnm>
               <insr iid="I1"/>
               <email>osander@mpi-sb.mpg.de</email>
            </au>
            <au id="A4">
               <snm>Lengauer</snm>
               <fnm>Thomas</fnm>
               <insr iid="I1"/>
               <email>lengauer@mpi-sb.mpg.de</email>
            </au>
            <au id="A5">
               <snm>Tosatto</snm>
               <mi>CE</mi>
               <fnm>Silvio</fnm>
               <insr iid="I3"/>
               <email>silvio@cribi.unipd.it</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Department of Computational Biology and Applied Algorithmics, Max-Planck-lnstitute for Informatics, Stuhlsatzenhausweg 85, D-66123 Saarbr&#252;cken, Germany</p>
            </ins>
            <ins id="I2">
               <p>Department of Biological Chemistry, University of Padova, via U. Bassi 58/b, 1-35121 Padova, Italy</p>
            </ins>
            <ins id="I3">
               <p>Department of Biology and CRIBI Biotechnology Centre University of Padova, V.le G. Colombo 3, I-35131 Padova, Italy</p>
            </ins>
         </insg>
         <source>BMC Bioinformatics</source>
         <issn>1471-2105</issn>
         <pubdate>2006</pubdate>
         <volume>7</volume>
         <issue>1</issue>
         <fpage>364</fpage>
         <url>http://www.biomedcentral.com/1471-2105/7/364</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">16872519</pubid>
               <pubid idtype="doi">10.1186/1471-2105-7-364</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>25</day>
               <month>1</month>
               <year>2006</year>
            </date>
         </rec>
         <acc>
            <date>
               <day>27</day>
               <month>7</month>
               <year>2006</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>27</day>
               <month>7</month>
               <year>2006</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2006</year>
         <collab>Sommer et al; licensee BioMed Central Ltd.</collab>
         <note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>In the area of protein structure prediction, recently a lot of effort has gone into the development of Model Quality Assessment Programs (MQAPs). MQAPs distinguish high quality protein structure models from inferior models. Here, we propose a new method to use an MQAP to improve the quality of models. With a given target sequence and template structure, we construct a number of different alignments and corresponding models for the sequence. The quality of these models is scored with an MQAP and used to choose the most promising model. An SVM-based selection scheme is suggested for combining MQAP partial potentials, in order to optimize for improved model selection.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>The approach has been tested on a representative set of proteins. The ability of the method to improve models was validated by comparing the MQAP-selected structures to the native structures with the model quality evaluation program <it>TM</it>-score. Using the SVM-based model selection, a significant increase in model quality is obtained (as shown with a Wilcoxon signed rank test yielding p-values below 10<sup>-15</sup>). The average increase in <it>TM</it>score is 0.016, the maximum observed increase in <it>TM</it>-score is 0.29.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusion</p>
               </st>
               <p>In template-based protein structure prediction alignment is known to be a bottleneck limiting the overall model quality. Here we show that a combination of systematic alignment variation and modern model scoring functions can significantly improve the quality of alignment-based models.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <bdy>
      <sec>
         <st>
            <p>1 Background</p>
         </st>
         <p>Protein structure prediction by comparative modeling and/or fold recognition consists of three largely independent steps: (1) Postulating the structural similarity of the target protein sequence with a known template structure on the basis of a significant alignment score between the two protein sequences. (2) This or a different alignment serves as a basis for model construction. In this process residues in the target sequence that are aligned to residues in the template structure are mapped on the corresponding coordinates in the structure. (3) Finally, unmapped regions are filled in, breaks in the backbone are mended, and the overall model is refined.</p>
         <p>Thus the quality of the alignment in the second step has an essential impact on the quality of the resulting model. The continual benchmarks in the biannual CASP assessment of protein structure prediction methods witness that there is significant progress in identifying suitable templates <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>, due in part to the introduction of profile-profile alignment methods <abbrgrp><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr></abbrgrp> and the sophisticated construction of profiles <abbrgrp><abbr bid="B6">6</abbr></abbrgrp>. While CASP assessors found little improvement in the predicted models <abbrgrp><abbr bid="B7">7</abbr></abbrgrp>, they found steady progress in alignment quality over the years <abbrgrp><abbr bid="B8">8</abbr></abbrgrp>.</p>
         <p>The optimal alignment resulting from an algorithm with a specific optimized parameter setting is not always the best choice for model creation. Jaroszewski et al. have set up a computational experiment in which they sample a huge conformational space (size up to 10<sup>10</sup>) of alternative alignments by combining an approach of varying parameters (such as gap penalties and substitution matrices) with an iterative approach of penalizing previously visited regions of the sample space <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>. The study states that there exist alignments surpassing the original alignments in quality for about 50% of the protein pairs. Contreras-Moreira and coworkers <abbrgrp><abbr bid="B10">10</abbr></abbrgrp> as well as John and Sali <abbrgrp><abbr bid="B11">11</abbr></abbrgrp> propose genetic algorithms for constructing a large number of alternative alignments by recombining an initial set of alignments. A common problem of these approaches is the selection of the alignment allowing for the construction of the final model.</p>
         <p>Recently, a lot of effort has gone into the development of Model Quality Assessment Programs (MQAPs) <abbrgrp><abbr bid="B12">12</abbr><abbr bid="B13">13</abbr><abbr bid="B14">14</abbr></abbrgrp>. MQAPs are computer programs that receive as input a 3D model of a protein structure and produce as output a real number representing the quality of the model <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>. We will refer to this number as the model score. In contrast to model evaluation programs, like GDT <abbrgrp><abbr bid="B16">16</abbr></abbrgrp>, MaxSub <abbrgrp><abbr bid="B17">17</abbr></abbrgrp>, or TM-score <abbrgrp><abbr bid="B18">18</abbr></abbrgrp>, which assess the quality of the model by comparing it to the native structure, MQAPs do not compare to the native structure. Instead, they estimate the quality of a proposed model without knowledge of the native structure. Unlike scoring functions in sequence-to-structure alignment and to physical energy functions, MQAPs operate on an intermediate level &#8211; they are more flexible than a sequence-to-structure alignment function as the dynamic programming paradigm used in alignment computation imposes the requirement of prefix optimality which is not required in MQAPs. MQAPs aim at scoring the quality of predicted models. Typically, MQAPs use one or more different statistical potentials, representing information coded in protein structures <abbrgrp><abbr bid="B19">19</abbr><abbr bid="B20">20</abbr><abbr bid="B12">12</abbr><abbr bid="B13">13</abbr></abbrgrp>. Different MQAPs were recently tested in CAFASP-4 as meta-selectors for pinpointing high quality models from the ensemble of models proposed by different automated servers <abbrgrp><abbr bid="B15">15</abbr><abbr bid="B13">13</abbr><abbr bid="B21">21</abbr></abbrgrp> proving that MQAPs are highly effective selectors.</p>
      </sec>
      <sec>
         <st>
            <p>2 Results</p>
         </st>
         <sec>
            <st>
               <p>2.1 Overview of protocol and evaluation</p>
            </st>
            <p>In this manuscript we propose and validate a protocol for improving alignments in step (2) of comparative modeling or fold recognition. Optimization is achieved by generating alternative alignment-based models for a target sequence and selecting the most promising model using an MQAP.</p>
            <p>Ensembles of alternative alignments are generated with the state-of-the-art profile-profile alignment method Arby <abbrgrp><abbr bid="B22">22</abbr><abbr bid="B23">23</abbr></abbrgrp> by varying parameters. Apart from the Arby default, we suggest two different procedures for generating alternative alignments: <it>PVS </it>varies the parameters in the profile-profile alignment method slightly, whereas <it>PVH </it>varies the parameters heavily. Each procedure reports an ensemble of distinct alignments. For each alignment a model is constructed (see Methods for details, as well as Table <tblr tid="T1">1</tblr> for an overview of the parameters used in <it>PVS </it>and <it>PVH</it>).</p>
            <tbl id="T1">
               <title>
                  <p>Table 1</p>
               </title>
               <caption>
                  <p>The parameters used in the different model generating procedures.</p>
               </caption>
               <tblbdy cols="2">
                  <r>
                     <c ca="left">
                        <p>Model Generating Procedure</p>
                     </c>
                     <c ca="left">
                        <p>Description</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="2">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Arby default</p>
                     </c>
                     <c ca="left">
                        <p>Generates the default Arby models [23], exactly one for each target. Default parameters are: gap insertion 14.7, gap extension 0.37, substitution matrix Blosum62, Henikoff position-specific sequence weighting, relative weight of secondary structure is 0.24.</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>PVS</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Generates models resulting from alignments with parameters multiplied by a factor varying from 0.5 to 1.5 with a step width of 0.25. This results in an ensemble of alternative models for each target.</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>PVH</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Generates models resulting from alignments with parameters multiplied by a factor from 0.2 to 2.4 with a step width of 0.2. This results in an ensemble of alternative models for each target.</p>
                     </c>
                  </r>
               </tblbdy>
            </tbl>
            <p>The ensembles of alternative models typically contain models with higher quality as well as models with lower quality than the standard Arby model. The FRST <abbrgrp><abbr bid="B13">13</abbr></abbrgrp> MQAP program is applied to scoring the quality of the models. By choosing the model with the best model score according to the FRST potential, we can select a promising model for each target. These selected models are potentially improved with respect to the Arby default model. Additionally, we developed an SVM-based selection mechanism. A support vector machine (SVM) is trained on the model scores and on the FRST partial potentials for recognizing the models with increased quality.</p>
            <p>The performance of the protocol is evaluated by comparing the chosen models to the previously withheld native structures. The comparison is performed with the model evaluation program <it>TM</it>-score <abbrgrp><abbr bid="B18">18</abbr></abbrgrp>, its score reflecting the "real" quality of the models. The <it>TM</it>-score always lies in the interval (0,1], where the upper limit stands for a model perfectly superposable with the structure. This allows for comparing the quality of the generated and selected models with the quality of the default Arby models and for assessing the significance of the selection process.</p>
            <p>The protocol was evaluated on a set of 1612 target sequences with known structures (see Methods). For each target <it>t </it>we computed the Arby default model <it>d</it>(<it>t</it>) and exercised the two model generation procedures <it>PVS </it>and <it>PVH </it>resulting in two ensembles of models E<sub><it>PVS</it></sub>(<it>t</it>) and E<sub><it>PVH</it></sub>(<it>t</it>) per target. Summary statistics of the number of models per target are given in Table <tblr tid="T2">2</tblr>.</p>
            <tbl id="T2">
               <title>
                  <p>Table 2</p>
               </title>
               <caption>
                  <p>Summary statistics for the model generation procedures.</p>
               </caption>
               <tblbdy cols="6">
                  <r>
                     <c ca="left">
                        <p>Model Generation Procedure</p>
                     </c>
                     <c cspan="4" ca="center">
                        <p>Number of Models per Target <it>t</it></p>
                     </c>
                     <c ca="center">
                        <p>Overall Number of Models (&#8721;<sub><it>t</it></sub>|<it>E</it>(<it>t</it>)|)</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>min</p>
                     </c>
                     <c ca="center">
                        <p>median</p>
                     </c>
                     <c ca="center">
                        <p>mean</p>
                     </c>
                     <c ca="center">
                        <p>max</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c cspan="6">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Arby default</p>
                     </c>
                     <c ca="right">
                        <p>1</p>
                     </c>
                     <c ca="right">
                        <p>1.0</p>
                     </c>
                     <c ca="right">
                        <p>1.0</p>
                     </c>
                     <c ca="right">
                        <p>1</p>
                     </c>
                     <c ca="right">
                        <p>1612</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>PVS</it>
                        </p>
                     </c>
                     <c ca="right">
                        <p>1</p>
                     </c>
                     <c ca="right">
                        <p>3.0</p>
                     </c>
                     <c ca="right">
                        <p>4.3</p>
                     </c>
                     <c ca="right">
                        <p>28</p>
                     </c>
                     <c ca="right">
                        <p>6980</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>PVH</it>
                        </p>
                     </c>
                     <c ca="right">
                        <p>1</p>
                     </c>
                     <c ca="right">
                        <p>7.0</p>
                     </c>
                     <c ca="right">
                        <p>12.6</p>
                     </c>
                     <c ca="right">
                        <p>136</p>
                     </c>
                     <c ca="right">
                        <p>20234</p>
                     </c>
                  </r>
               </tblbdy>
            </tbl>
         </sec>
         <sec>
            <st>
               <p>2.2 Evaluation of model generation: quality of generated models</p>
            </st>
            <p>First, we analyze the quality of the model generation procedures. The key ideas are to count per target the number of models with increased quality, and to measure the average difference of model quality with respect to the default model in terms of <it>TM</it>-score.</p>
            <sec>
               <st>
                  <p>2.2.1 Analysis per target</p>
               </st>
               <p>For a target <it>t</it>, we denote the quality of a model <it>ml </it>by <it>TM</it>(<it>ml</it>), where greater <it>TM</it>-score is better. The relative frequency of models per target with a quality measure above the Arby default is defined as</p>
               <p>
                  <m:math name="1471-2105-7-364-i1" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:mi>f</m:mi>
                           <m:mi>p</m:mi>
                           <m:msub>
                              <m:mi>t</m:mi>
                              <m:mrow>
                                 <m:mi>E</m:mi>
                                 <m:mo>,</m:mo>
                                 <m:mo>></m:mo>
                              </m:mrow>
                           </m:msub>
                           <m:mo stretchy="false">(</m:mo>
                           <m:mi>t</m:mi>
                           <m:mo stretchy="false">)</m:mo>
                           <m:mo>=</m:mo>
                           <m:mfrac>
                              <m:mn>1</m:mn>
                              <m:mrow>
                                 <m:mo>|</m:mo>
                                 <m:mi>E</m:mi>
                                 <m:mo stretchy="false">(</m:mo>
                                 <m:mi>t</m:mi>
                                 <m:mo stretchy="false">)</m:mo>
                                 <m:mo>|</m:mo>
                              </m:mrow>
                           </m:mfrac>
                           <m:mstyle displaystyle="true">
                              <m:munder>
                                 <m:mo>&#8721;</m:mo>
                                 <m:mrow>
                                    <m:mi>m</m:mi>
                                    <m:mi>l</m:mi>
                                    <m:mo>&#8712;</m:mo>
                                    <m:mi>E</m:mi>
                                    <m:mo stretchy="false">(</m:mo>
                                    <m:mi>t</m:mi>
                                    <m:mo stretchy="false">)</m:mo>
                                 </m:mrow>
                              </m:munder>
                              <m:mrow>
                                 <m:mo stretchy="false">[</m:mo>
                                 <m:mi>T</m:mi>
                                 <m:mi>M</m:mi>
                                 <m:mo stretchy="false">(</m:mo>
                                 <m:mi>m</m:mi>
                                 <m:mi>l</m:mi>
                                 <m:mo stretchy="false">)</m:mo>
                                 <m:mo>></m:mo>
                                 <m:mi>T</m:mi>
                                 <m:mi>M</m:mi>
                                 <m:mo stretchy="false">(</m:mo>
                                 <m:mi>d</m:mi>
                                 <m:mo stretchy="false">(</m:mo>
                                 <m:mi>t</m:mi>
                                 <m:mo stretchy="false">)</m:mo>
                                 <m:mo stretchy="false">)</m:mo>
                                 <m:mo stretchy="false">]</m:mo>
                                 <m:mo>,</m:mo>
                              </m:mrow>
                           </m:mstyle>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacqWGMbGzcqWGWbaCcqWG0baDdaWgaaWcbaGaemyrauKaeiilaWIaeyOpa4dabeaakiabcIcaOiabdsha0jabcMcaPiabg2da9maalaaabaGaeGymaedabaGaeiiFaWNaemyrauKaeiikaGIaemiDaqNaeiykaKIaeiiFaWhaamaaqafabaGaei4waSLaemivaqLaemyta0KaeiikaGIaemyBa0MaemiBaWMaeiykaKIaeyOpa4JaemivaqLaemyta0KaeiikaGIaemizaqMaeiikaGIaemiDaqNaeiykaKIaeiykaKIaeiyxa0LaeiilaWcaleaacqWGTbqBcqWGSbaBcqGHiiIZcqWGfbqrcqGGOaakcqWG0baDcqGGPaqkaeqaniabggHiLdaaaa@5EB8@</m:annotation>
                     </m:semantics>
                  </m:math>
               </p>
               <p>where <it>d</it>(<it>t</it>) is the default Arby model, <it>E</it>(<it>t</it>) is an ensemble of models for the target, and [<it>x</it>] is the Iverson bracket defined for arbitrary propositions <it>x </it>as</p>
               <p>
                  <m:math name="1471-2105-7-364-i2" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:mo stretchy="false">[</m:mo>
                           <m:mi>x</m:mi>
                           <m:mo stretchy="false">]</m:mo>
                           <m:mo>=</m:mo>
                           <m:mrow>
                              <m:mo>{</m:mo>
                              <m:mrow>
                                 <m:mtable>
                                    <m:mtr>
                                       <m:mtd>
                                          <m:mn>1</m:mn>
                                       </m:mtd>
                                       <m:mtd>
                                          <m:mrow>
                                             <m:mtext>if&#160;</m:mtext>
                                             <m:mi>x</m:mi>
                                             <m:mtext>&#160;is&#160;true</m:mtext>
                                          </m:mrow>
                                       </m:mtd>
                                    </m:mtr>
                                    <m:mtr>
                                       <m:mtd>
                                          <m:mn>0</m:mn>
                                       </m:mtd>
                                       <m:mtd>
                                          <m:mrow>
                                             <m:mtext>else</m:mtext>
                                          </m:mrow>
                                       </m:mtd>
                                    </m:mtr>
                                 </m:mtable>
                                 <m:mo>.</m:mo>
                              </m:mrow>
                           </m:mrow>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacqGGBbWwcqWG4baEcqGGDbqxcqGH9aqpdaGabaqaauaabmqaciaaaeaacqaIXaqmaeaacqqGPbqAcqqGMbGzcqqGGaaicqWG4baEcqqGGaaicqqGPbqAcqqGZbWCcqqGGaaicqqG0baDcqqGYbGCcqqG1bqDcqqGLbqzaeaacqaIWaamaeaacqqGLbqzcqqGSbaBcqqGZbWCcqqGLbqzaaGaeiOla4cacaGL7baaaaa@49E3@</m:annotation>
                     </m:semantics>
                  </m:math>
               </p>
               <p>Similarly, we consider the relative frequency <it>fpt</it><sub><it>E</it>,&lt;</sub>(<it>t</it>) of models with a quality below that of the Arby default models.</p>
               <p>The average within an ensemble <it>E</it>(<it>t</it>) of quality improvement of a model over the default Arby model is</p>
               <p>
                  <m:math name="1471-2105-7-364-i3" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:mi>q</m:mi>
                           <m:mi>i</m:mi>
                           <m:msub>
                              <m:mi>r</m:mi>
                              <m:mi>E</m:mi>
                           </m:msub>
                           <m:mo stretchy="false">(</m:mo>
                           <m:mi>t</m:mi>
                           <m:mo stretchy="false">)</m:mo>
                           <m:mo>=</m:mo>
                           <m:mfrac>
                              <m:mn>1</m:mn>
                              <m:mrow>
                                 <m:mo>|</m:mo>
                                 <m:mi>E</m:mi>
                                 <m:mo stretchy="false">(</m:mo>
                                 <m:mi>t</m:mi>
                                 <m:mo stretchy="false">)</m:mo>
                                 <m:mo>|</m:mo>
                              </m:mrow>
                           </m:mfrac>
                           <m:mstyle displaystyle="true">
                              <m:munder>
                                 <m:mo>&#8721;</m:mo>
                                 <m:mrow>
                                    <m:mi>m</m:mi>
                                    <m:mi>l</m:mi>
                                    <m:mo>&#8712;</m:mo>
                                    <m:mi>E</m:mi>
                                    <m:mo stretchy="false">(</m:mo>
                                    <m:mi>t</m:mi>
                                    <m:mo stretchy="false">)</m:mo>
                                 </m:mrow>
                              </m:munder>
                              <m:mrow>
                                 <m:mo stretchy="false">(</m:mo>
                                 <m:mi>T</m:mi>
                                 <m:mi>M</m:mi>
                                 <m:mo stretchy="false">(</m:mo>
                                 <m:mi>m</m:mi>
                                 <m:mi>l</m:mi>
                                 <m:mo stretchy="false">)</m:mo>
                                 <m:mo>&#8722;</m:mo>
                                 <m:mi>T</m:mi>
                                 <m:mi>M</m:mi>
                                 <m:mo stretchy="false">(</m:mo>
                                 <m:mi>d</m:mi>
                                 <m:mo stretchy="false">(</m:mo>
                                 <m:mi>t</m:mi>
                                 <m:mo stretchy="false">)</m:mo>
                                 <m:mo stretchy="false">)</m:mo>
                                 <m:mo stretchy="false">)</m:mo>
                              </m:mrow>
                           </m:mstyle>
                           <m:mo>.</m:mo>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacqWGXbqCcqWGPbqAcqWGYbGCdaWgaaWcbaGaemyraueabeaakiabcIcaOiabdsha0jabcMcaPiabg2da9maalaaabaGaeGymaedabaGaeiiFaWNaemyrauKaeiikaGIaemiDaqNaeiykaKIaeiiFaWhaamaaqafabaGaeiikaGIaemivaqLaemyta0KaeiikaGIaemyBa0MaemiBaWMaeiykaKIaeyOeI0IaemivaqLaemyta0KaeiikaGIaemizaqMaeiikaGIaemiDaqNaeiykaKIaeiykaKIaeiykaKcaleaacqWGTbqBcqWGSbaBcqGHiiIZcqWGfbqrcqGGOaakcqWG0baDcqGGPaqkaeqaniabggHiLdGccqGGUaGlaaa@5BF9@</m:annotation>
                     </m:semantics>
                  </m:math>
               </p>
               <p>We define an indicator function whether a better model for a target <it>t </it>exists in the ensemble <it>E</it>(<it>t</it>).</p>
               <p><it>fb</it><sub><it>E</it></sub>(<it>t</it>) = [&#8707;<it>ml </it>&#8712; <it>E</it>(<it>t</it>) : <it>TM</it>(<it>ml</it>) > <it>TM</it>(<it>d</it>(<it>t</it>))]</p>
               <p>and compute the quality improvement that is theoretically possible</p>
               <p>
                  <m:math name="1471-2105-7-364-i4" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:mi>q</m:mi>
                           <m:mi>i</m:mi>
                           <m:msub>
                              <m:mi>b</m:mi>
                              <m:mi>E</m:mi>
                           </m:msub>
                           <m:mo stretchy="false">(</m:mo>
                           <m:mi>t</m:mi>
                           <m:mo stretchy="false">)</m:mo>
                           <m:mo>=</m:mo>
                           <m:mtable>
                              <m:mtr>
                                 <m:mtd>
                                    <m:mrow>
                                       <m:munder>
                                          <m:mrow>
                                             <m:mi>max</m:mi>
                                             <m:mo>&#8289;</m:mo>
                                          </m:mrow>
                                          <m:mrow>
                                             <m:mi>m</m:mi>
                                             <m:mi>l</m:mi>
                                             <m:mo>&#8712;</m:mo>
                                             <m:mi>E</m:mi>
                                             <m:mo stretchy="false">(</m:mo>
                                             <m:mi>t</m:mi>
                                             <m:mo stretchy="false">)</m:mo>
                                          </m:mrow>
                                       </m:munder>
                                    </m:mrow>
                                 </m:mtd>
                                 <m:mtd>
                                    <m:mrow>
                                       <m:mi>T</m:mi>
                                       <m:mi>M</m:mi>
                                       <m:mo stretchy="false">(</m:mo>
                                       <m:mi>m</m:mi>
                                       <m:mi>l</m:mi>
                                       <m:mo stretchy="false">)</m:mo>
                                       <m:mo>&#8722;</m:mo>
                                       <m:mi>T</m:mi>
                                       <m:mi>M</m:mi>
                                       <m:mo stretchy="false">(</m:mo>
                                       <m:mi>d</m:mi>
                                       <m:mo stretchy="false">(</m:mo>
                                       <m:mi>t</m:mi>
                                       <m:mo stretchy="false">)</m:mo>
                                       <m:mo stretchy="false">)</m:mo>
                                    </m:mrow>
                                 </m:mtd>
                              </m:mtr>
                           </m:mtable>
                           <m:mo>.</m:mo>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacqWGXbqCcqWGPbqAcqWGIbGydaWgaaWcbaGaemyraueabeaakiabcIcaOiabdsha0jabcMcaPiabg2da9uaabmqabiaaaeaadaWfqaqaaiGbc2gaTjabcggaHjabcIha4bWcbaGaemyBa0MaemiBaWMaeyicI4SaemyrauKaeiikaGIaemiDaqNaeiykaKcabeaaaOqaaiabdsfaujabd2eanjabcIcaOiabd2gaTjabdYgaSjabcMcaPiabgkHiTiabdsfaujabd2eanjabcIcaOiabdsgaKjabcIcaOiabdsha0jabcMcaPiabcMcaPaaacqGGUaGlaaa@543C@</m:annotation>
                     </m:semantics>
                  </m:math>
               </p>
            </sec>
            <sec>
               <st>
                  <p>2.2.2 Performance over all targets</p>
               </st>
               <p>The frequency <it>fpt </it>was defined per target and its average over all targets is <m:math name="1471-2105-7-364-i5" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:mover accent="true"><m:mrow><m:mi>f</m:mi><m:mi>p</m:mi><m:mi>t</m:mi></m:mrow><m:mo stretchy="true">&#175;</m:mo></m:mover><m:mo>=</m:mo><m:mfrac><m:mn>1</m:mn><m:mi>n</m:mi></m:mfrac><m:mstyle displaystyle="true"><m:mo>&#8721;</m:mo><m:mrow><m:mi>f</m:mi><m:mi>p</m:mi><m:mi>t</m:mi></m:mrow></m:mstyle></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaadaqdaaqaaiabdAgaMjabdchaWjabdsha0baacqGH9aqpdaWcaaqaaiabigdaXaqaaiabd6gaUbaadaaeabqaaiabdAgaMjabdchaWjabdsha0bWcbeqab0GaeyyeIuoaaaa@3A9A@</m:annotation></m:semantics></m:math>. While <it>fpt </it>describes the frequency of better models per target, <m:math name="1471-2105-7-364-i6" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:mover accent="true"><m:mrow><m:mi>f</m:mi><m:mi>b</m:mi></m:mrow><m:mo stretchy="true">&#175;</m:mo></m:mover><m:mo>=</m:mo><m:mfrac><m:mn>1</m:mn><m:mi>n</m:mi></m:mfrac><m:mstyle displaystyle="true"><m:mo>&#8721;</m:mo><m:mrow><m:mi>f</m:mi><m:mi>b</m:mi></m:mrow></m:mstyle></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaadaqdaaqaaiabdAgaMjabdkgaIbaacqGH9aqpdaWcaaqaaiabigdaXaqaaiabd6gaUbaadaaeabqaaiabdAgaMjabdkgaIbWcbeqab0GaeyyeIuoaaaa@3780@</m:annotation></m:semantics></m:math> reflects the fraction of targets that have a model with a quality above the Arby default within the ensemble of constructed models.</p>
               <p>When selecting models randomly, an average quality improvement of <m:math name="1471-2105-7-364-i7" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:mover accent="true"><m:mrow><m:mi>q</m:mi><m:mi>i</m:mi><m:mi>r</m:mi></m:mrow><m:mo stretchy="true">&#175;</m:mo></m:mover><m:mo>=</m:mo><m:mfrac><m:mn>1</m:mn><m:mi>n</m:mi></m:mfrac><m:mstyle displaystyle="true"><m:mo>&#8721;</m:mo><m:mrow><m:mi>q</m:mi><m:mi>i</m:mi><m:mi>r</m:mi></m:mrow></m:mstyle></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaadaqdaaqaaiabdghaXjabdMgaPjabdkhaYbaacqGH9aqpdaWcaaqaaiabigdaXaqaaiabd6gaUbaadaaeabqaaiabdghaXjabdMgaPjabdkhaYbWcbeqab0GaeyyeIuoaaaa@3AA2@</m:annotation></m:semantics></m:math> is obtained. When selecting models optimally, an average quality improvement of <m:math name="1471-2105-7-364-i8" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:mover accent="true"><m:mrow><m:mi>q</m:mi><m:mi>i</m:mi><m:mi>b</m:mi></m:mrow><m:mo stretchy="true">&#175;</m:mo></m:mover><m:mo>=</m:mo><m:mfrac><m:mn>1</m:mn><m:mi>n</m:mi></m:mfrac><m:mstyle displaystyle="true"><m:mo>&#8721;</m:mo><m:mrow><m:mi>q</m:mi><m:mi>i</m:mi><m:mi>b</m:mi></m:mrow></m:mstyle></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaadaqdaaqaaiabdghaXjabdMgaPjabdkgaIbaacqGH9aqpdaWcaaqaaiabigdaXaqaaiabd6gaUbaadaaeabqaaiabdghaXjabdMgaPjabdkgaIbWcbeqab0GaeyyeIuoaaaa@3A62@</m:annotation></m:semantics></m:math> is obtained, imposing a theoretical upper bound to what is feasible with MQAP selection on the alignments generated as proposed. For the two procedures <it>PVS </it>and <it>PVH </it>generating alignment-based models these numbers are listed in Table <tblr tid="T3">3</tblr>.</p>
               <tbl id="T3">
                  <title>
                     <p>Table 3</p>
                  </title>
                  <caption>
                     <p>How good are the generated models? Description of the distributions of the <it>TM</it>-score quality. <m:math name="1471-2105-7-364-i9" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:mover accent="true"><m:mrow><m:mi>f</m:mi><m:mi>p</m:mi><m:mi>t</m:mi></m:mrow><m:mo stretchy="true">&#175;</m:mo></m:mover></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaadaqdaaqaaiabdAgaMjabdchaWjabdsha0baaaaa@30EC@</m:annotation></m:semantics></m:math><sub>&lt;</sub>and <m:math name="1471-2105-7-364-i9" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:mover accent="true"><m:mrow><m:mi>f</m:mi><m:mi>p</m:mi><m:mi>t</m:mi></m:mrow><m:mo stretchy="true">&#175;</m:mo></m:mover></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaadaqdaaqaaiabdAgaMjabdchaWjabdsha0baaaaa@30EC@</m:annotation></m:semantics></m:math><sub>> </sub>are the relative frequencies of models per target with a <it>TM</it>-score below and above Arby default, respectively, <m:math name="1471-2105-7-364-i10" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:mover accent="true"><m:mrow><m:mi>q</m:mi><m:mi>i</m:mi><m:mi>r</m:mi></m:mrow><m:mo stretchy="true">&#175;</m:mo></m:mover></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaadaqdaaqaaiabdghaXjabdMgaPjabdkhaYbaaaaa@30F0@</m:annotation></m:semantics></m:math> is the improvement in <it>TM</it>-score when choosing models randomly. <m:math name="1471-2105-7-364-i11" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:mover accent="true"><m:mrow><m:mi>f</m:mi><m:mi>b</m:mi></m:mrow><m:mo stretchy="true">&#175;</m:mo></m:mover></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaadaqdaaqaaiabdAgaMjabdkgaIbaaaaa@2F5F@</m:annotation></m:semantics></m:math> is the relative frequency of targets for which a better model exists, <m:math name="1471-2105-7-364-i12" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:mover accent="true"><m:mrow><m:mi>q</m:mi><m:mi>i</m:mi><m:mi>b</m:mi></m:mrow><m:mo stretchy="true">&#175;</m:mo></m:mover></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaadaqdaaqaaiabdghaXjabdMgaPjabdkgaIbaaaaa@30D0@</m:annotation></m:semantics></m:math> is the best theoretically possible improvement for the given ensemble of models.</p>
                  </caption>
                  <tblbdy cols="6">
                     <r>
                        <c ca="left">
                           <p>Model Generation Procedure</p>
                        </c>
                        <c ca="right">
                           <p>
                              <m:math name="1471-2105-7-364-i9" xmlns:m="http://www.w3.org/1998/Math/MathML">
                                 <m:semantics>
                                    <m:mrow>
                                       <m:mover accent="true">
                                          <m:mrow>
                                             <m:mi>f</m:mi>
                                             <m:mi>p</m:mi>
                                             <m:mi>t</m:mi>
                                          </m:mrow>
                                          <m:mo stretchy="true">&#175;</m:mo>
                                       </m:mover>
                                    </m:mrow>
                                    <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaadaqdaaqaaiabdAgaMjabdchaWjabdsha0baaaaa@30EC@</m:annotation>
                                 </m:semantics>
                              </m:math>
                              <sub>&lt;</sub>
                           </p>
                        </c>
                        <c ca="right">
                           <p>
                              <m:math name="1471-2105-7-364-i9" xmlns:m="http://www.w3.org/1998/Math/MathML">
                                 <m:semantics>
                                    <m:mrow>
                                       <m:mover accent="true">
                                          <m:mrow>
                                             <m:mi>f</m:mi>
                                             <m:mi>p</m:mi>
                                             <m:mi>t</m:mi>
                                          </m:mrow>
                                          <m:mo stretchy="true">&#175;</m:mo>
                                       </m:mover>
                                    </m:mrow>
                                    <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaadaqdaaqaaiabdAgaMjabdchaWjabdsha0baaaaa@30EC@</m:annotation>
                                 </m:semantics>
                              </m:math>
                              <sub>></sub>
                           </p>
                        </c>
                        <c ca="right">
                           <p>
                              <m:math name="1471-2105-7-364-i10" xmlns:m="http://www.w3.org/1998/Math/MathML">
                                 <m:semantics>
                                    <m:mrow>
                                       <m:mover accent="true">
                                          <m:mrow>
                                             <m:mi>q</m:mi>
                                             <m:mi>i</m:mi>
                                             <m:mi>r</m:mi>
                                          </m:mrow>
                                          <m:mo stretchy="true">&#175;</m:mo>
                                       </m:mover>
                                    </m:mrow>
                                    <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaadaqdaaqaaiabdghaXjabdMgaPjabdkhaYbaaaaa@30F0@</m:annotation>
                                 </m:semantics>
                              </m:math>
                           </p>
                        </c>
                        <c ca="right">
                           <p>
                              <m:math name="1471-2105-7-364-i11" xmlns:m="http://www.w3.org/1998/Math/MathML">
                                 <m:semantics>
                                    <m:mrow>
                                       <m:mover accent="true">
                                          <m:mrow>
                                             <m:mi>f</m:mi>
                                             <m:mi>b</m:mi>
                                          </m:mrow>
                                          <m:mo stretchy="true">&#175;</m:mo>
                                       </m:mover>
                                    </m:mrow>
                                    <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaadaqdaaqaaiabdAgaMjabdkgaIbaaaaa@2F5F@</m:annotation>
                                 </m:semantics>
                              </m:math>
                           </p>
                        </c>
                        <c ca="right">
                           <p>
                              <m:math name="1471-2105-7-364-i12" xmlns:m="http://www.w3.org/1998/Math/MathML">
                                 <m:semantics>
                                    <m:mrow>
                                       <m:mover accent="true">
                                          <m:mrow>
                                             <m:mi>q</m:mi>
                                             <m:mi>i</m:mi>
                                             <m:mi>b</m:mi>
                                          </m:mrow>
                                          <m:mo stretchy="true">&#175;</m:mo>
                                       </m:mover>
                                    </m:mrow>
                                    <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaadaqdaaqaaiabdghaXjabdMgaPjabdkgaIbaaaaa@30D0@</m:annotation>
                                 </m:semantics>
                              </m:math>
                           </p>
                        </c>
                     </r>
                     <r>
                        <c cspan="6">
                           <hr/>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>
                              <it>PVS</it>
                           </p>
                        </c>
                        <c ca="right">
                           <p>0.36</p>
                        </c>
                        <c ca="right">
                           <p>0.22</p>
                        </c>
                        <c ca="right">
                           <p>-0.013</p>
                        </c>
                        <c ca="right">
                           <p>0.47</p>
                        </c>
                        <c ca="right">
                           <p>0.019</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>
                              <it>PVH</it>
                           </p>
                        </c>
                        <c ca="right">
                           <p>0.51</p>
                        </c>
                        <c ca="right">
                           <p>0.26</p>
                        </c>
                        <c ca="right">
                           <p>-0.031</p>
                        </c>
                        <c ca="right">
                           <p>0.59</p>
                        </c>
                        <c ca="right">
                           <p>0.026</p>
                        </c>
                     </r>
                  </tblbdy>
               </tbl>
               <p>In order to visualize the distributions of model quality, in Figure <figr fid="F1">1</figr> for each target <it>t </it>the <it>TM</it>-score of the default Arby model <it>s </it>plotted versus the <it>TM</it>-score improvements of the models constructed for that target. The scatter plots in Figure <figr fid="F1">1</figr> along with Table <tblr tid="T3">3</tblr> clearly indicate that better models are generated for a large fraction of the targets. Summing up, the above-mentioned procedures generate models with a better quality than the Arby default models, but identification of the improved models among the generated models is a hard task as analyzed in the next section.</p>
               <fig id="F1">
                  <title>
                     <p>Figure 1</p>
                  </title>
                  <caption>
                     <p>Overview of model quality improvement with respect to the model difficulty</p>
                  </caption>
                  <text>
                     <p>Overview of model quality improvement with respect to the model difficulty. Left for PVS, right for PVH analogously. Each dot corresponds to a model where the x-coordinate is the <it>TM</it>-score of the corresponding default Arby model and the y-coordinate is the <it>TM</it>-score improvement with respect to this default model. Smoothed quantile lines are shown for the 10% (lower dashed), 50% (middle), 90% (upper dashed) quantiles of the models within a sliding window of size 0.15. Black lines represent all models, red lines represent the models selected using FRST, green lines represent the models selected using the SVM approach. For the smoothing evaluations are made at 1000 equidistant points and the resulting quantiles are smoothed with a lowess function (local linear scatter plot smoother). <b>Interpretation: </b>The <it>TM</it>-score of the Arby default gives an indication of how difficult it is to find the right template for a target. For the selection methods random, FRST, and SVM, this plot shows the potential improvement with respect to difficulty of the target. For PVH, more models are generated below default. For both PVS and PVH, the SVM selection performs better than FRST selection, and FRST performs better than random.</p>
                  </text>
                  <graphic file="1471-2105-7-364-1"/>
               </fig>
            </sec>
         </sec>
         <sec>
            <st>
               <p>2.3 Evaluation of model selection</p>
            </st>
            <p>In the following, we analyze how well the model selection procedure works on the models generated with procedures <it>PVS </it>and <it>PVH</it>. The key ideas are to count for how many targets an improved model is selected, and to measure the quality improvement with respect to the default model in terms of <it>TM</it>-score. We perform the analysis for the selection based on the FRST potential and then repeat it analogously for the SVM based selection.</p>
            <sec>
               <st>
                  <p>2.3.1 Analysis per target</p>
               </st>
               <p>Identification of the best model per target can be performed based on the FRST MQAP scores. For each target <it>t</it>, we select the model <it>s </it>with the lowest estimated <it>frst </it>energy</p>
               <p>
                  <m:math name="1471-2105-7-364-i13" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:msub>
                              <m:mi>s</m:mi>
                              <m:mrow>
                                 <m:mi>E</m:mi>
                                 <m:mo>,</m:mo>
                                 <m:mi>f</m:mi>
                                 <m:mi>r</m:mi>
                                 <m:mi>s</m:mi>
                                 <m:mi>t</m:mi>
                              </m:mrow>
                           </m:msub>
                           <m:mo stretchy="false">(</m:mo>
                           <m:mi>t</m:mi>
                           <m:mo stretchy="false">)</m:mo>
                           <m:mo>=</m:mo>
                           <m:mtable>
                              <m:mtr>
                                 <m:mtd>
                                    <m:mrow>
                                       <m:munder>
                                          <m:mrow>
                                             <m:mi>arg</m:mi>
                                             <m:mo>&#8289;</m:mo>
                                             <m:mi>min</m:mi>
                                             <m:mo>&#8289;</m:mo>
                                          </m:mrow>
                                          <m:mrow>
                                             <m:mi>m</m:mi>
                                             <m:mi>l</m:mi>
                                             <m:mo>&#8712;</m:mo>
                                             <m:mi>E</m:mi>
                                             <m:mo stretchy="false">(</m:mo>
                                             <m:mi>t</m:mi>
                                             <m:mo stretchy="false">)</m:mo>
                                          </m:mrow>
                                       </m:munder>
                                    </m:mrow>
                                 </m:mtd>
                                 <m:mtd>
                                    <m:mrow>
                                       <m:mi>f</m:mi>
                                       <m:mi>r</m:mi>
                                       <m:mi>s</m:mi>
                                       <m:mi>t</m:mi>
                                       <m:mo stretchy="false">(</m:mo>
                                       <m:mi>m</m:mi>
                                       <m:mi>l</m:mi>
                                       <m:mo stretchy="false">)</m:mo>
                                    </m:mrow>
                                 </m:mtd>
                              </m:mtr>
                           </m:mtable>
                           <m:mo>,</m:mo>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacqWGZbWCdaWgaaWcbaGaemyrauKaeiilaWIaemOzayMaemOCaiNaem4CamNaemiDaqhabeaakiabcIcaOiabdsha0jabcMcaPiabg2da9uaabmqabiaaaeaadaWfqaqaaiGbcggaHjabckhaYjabcEgaNjGbc2gaTjabcMgaPjabc6gaUbWcbaGaemyBa0MaemiBaWMaeyicI4SaemyrauKaeiikaGIaemiDaqNaeiykaKcabeaaaOqaaiabdAgaMjabdkhaYjabdohaZjabdsha0jabcIcaOiabd2gaTjabdYgaSjabcMcaPaaacqGGSaalaaa@5607@</m:annotation>
                     </m:semantics>
                  </m:math>
               </p>
               <p>since lower <it>frst </it>is better. In the supplementary material (see additional file supplement) we analyze the FRST partial potentials in more detail.</p>
               <p>In order to count the number of occurrences in which this is an improvement of model quality measured in <it>TM</it>-score, we define the indicator functions <it>fim </it>as follows:</p>
               <p><it>fim</it><sub><it>E,frst</it>,></sub>(<it>t</it>) = [<it>TM</it>(<it>s</it><sub><it>E,frst</it></sub>(<it>t</it>)) > <it>TM</it>(<it>d</it>(<it>t</it>))]</p>
               <p><it>fim</it><sub><it>E,frst</it>,=</sub>(<it>t</it>) = [<it>TM</it>(<it>s</it><sub><it>E,frst</it></sub>(<it>t</it>)) = <it>TM</it>(<it>d</it>(<it>t</it>))]</p>
               <p><it>fim</it><sub><it>E,frst</it>,&lt;</sub>(<it>t</it>) = [<it>TM</it>(<it>s</it><sub><it>E,frst</it></sub>(<it>t</it>)) &lt;<it>TM</it>(<it>d</it>(<it>t</it>))]</p>
               <p>These functions indicate whether the model selected by the MQAP is of higher, equal, or lower quality than the Arby default.</p>
               <p>While <it>fim </it>serves to count the number of targets which improve, we use the measure <it>qim </it>to quantify the improvement of model quality with respect to the default Arby model:</p>
               <p><it>qim</it><sub><it>E,frst</it></sub>(<it>t</it>) = <it>TM</it>(<it>s</it><sub><it>E,frst</it></sub>(<it>t</it>)) - <it>TM</it>(<it>d</it>(<it>t</it>)).</p>
            </sec>
            <sec>
               <st>
                  <p>2.3.2 Performance over all targets</p>
               </st>
               <p>Across all targets, <m:math name="1471-2105-7-364-i14" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msub><m:mrow><m:mover accent="true"><m:mrow><m:mi>f</m:mi><m:mi>i</m:mi><m:mi>m</m:mi></m:mrow><m:mo stretchy="true">&#175;</m:mo></m:mover></m:mrow><m:mo>></m:mo></m:msub><m:mo>=</m:mo><m:mfrac><m:mn>1</m:mn><m:mi>n</m:mi></m:mfrac><m:mstyle displaystyle="true"><m:mo>&#8721;</m:mo><m:mrow><m:mi>f</m:mi><m:mi>i</m:mi><m:msub><m:mi>m</m:mi><m:mo>></m:mo></m:msub></m:mrow></m:mstyle></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaadaqdaaqaaiabdAgaMjabdMgaPjabd2gaTbaadaWgaaWcbaGaeyOpa4dabeaakiabg2da9maalaaabaGaeGymaedabaGaemOBa4gaamaaqaeabaGaemOzayMaemyAaKMaemyBa02aaSbaaSqaaiabg6da+aqabaaabeqab0GaeyyeIuoaaaa@3CC9@</m:annotation></m:semantics></m:math> is the fraction of targets whose models improve when choosing models using the FRST MQAP. We measure the average improvement in model quality as <m:math name="1471-2105-7-364-i15" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:mover accent="true"><m:mrow><m:mi>q</m:mi><m:mi>i</m:mi><m:mi>m</m:mi></m:mrow><m:mo stretchy="true">&#175;</m:mo></m:mover><m:mo>=</m:mo><m:mfrac><m:mn>1</m:mn><m:mi>n</m:mi></m:mfrac><m:mstyle displaystyle="true"><m:mo>&#8721;</m:mo><m:mrow><m:mi>q</m:mi><m:mi>i</m:mi><m:mi>m</m:mi></m:mrow></m:mstyle></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaadaqdaaqaaiabdghaXjabdMgaPjabd2gaTbaacqGH9aqpdaWcaaqaaiabigdaXaqaaiabd6gaUbaadaaeabqaaiabdghaXjabdMgaPjabd2gaTbWcbeqab0GaeyyeIuoaaaa@3A8E@</m:annotation></m:semantics></m:math>.</p>
               <p>A summary of the results when selecting models according to the <it>frst </it>potential is given in Table <tblr tid="T4">4</tblr>.</p>
               <tbl id="T4">
                  <title>
                     <p>Table 4</p>
                  </title>
                  <caption>
                     <p>How well does model selection work? Description of distributions when selecting models according to the FRST potentials and the SVM. <m:math name="1471-2105-7-364-i16" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:mover accent="true"><m:mrow><m:mi>n</m:mi><m:mi>i</m:mi></m:mrow><m:mo stretchy="true">&#175;</m:mo></m:mover></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaadaqdaaqaaiabd6gaUjabdMgaPbaaaaa@2F7D@</m:annotation></m:semantics></m:math> is the relative frequency of targets for which a selection procedure suggests improved models. <m:math name="1471-2105-7-364-i17" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:mover accent="true"><m:mrow><m:mi>f</m:mi><m:mi>i</m:mi><m:mi>m</m:mi></m:mrow><m:mo stretchy="true">&#175;</m:mo></m:mover></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaadaqdaaqaaiabdAgaMjabdMgaPjabd2gaTbaaaaa@30D0@</m:annotation></m:semantics></m:math><sub>&lt;</sub>, <m:math name="1471-2105-7-364-i17" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:mover accent="true"><m:mrow><m:mi>f</m:mi><m:mi>i</m:mi><m:mi>m</m:mi></m:mrow><m:mo stretchy="true">&#175;</m:mo></m:mover></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaadaqdaaqaaiabdAgaMjabdMgaPjabd2gaTbaaaaa@30D0@</m:annotation></m:semantics></m:math><sub>=</sub>, and <m:math name="1471-2105-7-364-i17" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:mover accent="true"><m:mrow><m:mi>f</m:mi><m:mi>i</m:mi><m:mi>m</m:mi></m:mrow><m:mo stretchy="true">&#175;</m:mo></m:mover></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaadaqdaaqaaiabdAgaMjabdMgaPjabd2gaTbaaaaa@30D0@</m:annotation></m:semantics></m:math><sub>></sub>, are the relative frequencies of selected models with decreased, equal, or increased <it>TM</it>-score quality, respectively. min <it>qim </it>and max <it>qim </it>are the minimal and maximal quality improvements achieved per target. <m:math name="1471-2105-7-364-i18" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:mover accent="true"><m:mrow><m:mi>q</m:mi><m:mi>i</m:mi><m:mi>m</m:mi></m:mrow><m:mo stretchy="true">&#175;</m:mo></m:mover></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaadaqdaaqaaiabdghaXjabdMgaPjabd2gaTbaaaaa@30E6@</m:annotation></m:semantics></m:math> is the average quality improvement over all targets. <it><ul>qim</ul></it>is the average quality improvement for the targets that the selection procedure suggests improved models for.</p>
                  </caption>
                  <tblbdy cols="9">
                     <r>
                        <c>
                           <p/>
                        </c>
                        <c cspan="7" ca="center">
                           <p><it>Model Generation Procedure PVS</it>, all measures for <it>TM</it></p>
                        </c>
                        <c>
                           <p/>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>e</p>
                        </c>
                        <c ca="right">
                           <p>
                              <m:math name="1471-2105-7-364-i16" xmlns:m="http://www.w3.org/1998/Math/MathML">
                                 <m:semantics>
                                    <m:mrow>
                                       <m:mover accent="true">
                                          <m:mrow>
                                             <m:mi>n</m:mi>
                                             <m:mi>i</m:mi>
                                          </m:mrow>
                                          <m:mo stretchy="true">&#175;</m:mo>
                                       </m:mover>
                                    </m:mrow>
                                    <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaadaqdaaqaaiabd6gaUjabdMgaPbaaaaa@2F7D@</m:annotation>
                                 </m:semantics>
                              </m:math>
                           </p>
                        </c>
                        <c ca="right">
                           <p>
                              <graphic file="1471-2105-7-364-i17.gif"/>
                              <sub>&lt;</sub>
                           </p>
                        </c>
                        <c ca="right">
                           <p>
                              <m:math name="1471-2105-7-364-i17" xmlns:m="http://www.w3.org/1998/Math/MathML">
                                 <m:semantics>
                                    <m:mrow>
                                       <m:mover accent="true">
                                          <m:mrow>
                                             <m:mi>f</m:mi>
                                             <m:mi>i</m:mi>
                                             <m:mi>m</m:mi>
                                          </m:mrow>
                                          <m:mo stretchy="true">&#175;</m:mo>
                                       </m:mover>
                                    </m:mrow>
                                    <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaadaqdaaqaaiabdAgaMjabdMgaPjabd2gaTbaaaaa@30D0@</m:annotation>
                                 </m:semantics>
                              </m:math>
                              <sub>=</sub>
                           </p>
                        </c>
                        <c ca="right">
                           <p>
                              <m:math name="1471-2105-7-364-i17" xmlns:m="http://www.w3.org/1998/Math/MathML">
                                 <m:semantics>
                                    <m:mrow>
                                       <m:mover accent="true">
                                          <m:mrow>
                                             <m:mi>f</m:mi>
                                             <m:mi>i</m:mi>
                                             <m:mi>m</m:mi>
                                          </m:mrow>
                                          <m:mo stretchy="true">&#175;</m:mo>
                                       </m:mover>
                                    </m:mrow>
                                    <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaadaqdaaqaaiabdAgaMjabdMgaPjabd2gaTbaaaaa@30D0@</m:annotation>
                                 </m:semantics>
                              </m:math>
                              <sub>></sub>
                           </p>
                        </c>
                        <c ca="right">
                           <p>min <it>qim</it></p>
                        </c>
                        <c ca="right">
                           <p>max <it>qim</it></p>
                        </c>
                        <c ca="right">
                           <p>
                              <m:math name="1471-2105-7-364-i18" xmlns:m="http://www.w3.org/1998/Math/MathML">
                                 <m:semantics>
                                    <m:mrow>
                                       <m:mover accent="true">
                                          <m:mrow>
                                             <m:mi>q</m:mi>
                                             <m:mi>i</m:mi>
                                             <m:mi>m</m:mi>
                                          </m:mrow>
                                          <m:mo stretchy="true">&#175;</m:mo>
                                       </m:mover>
                                    </m:mrow>
                                    <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaadaqdaaqaaiabdghaXjabdMgaPjabd2gaTbaaaaa@30E6@</m:annotation>
                                 </m:semantics>
                              </m:math>
                           </p>
                        </c>
                        <c ca="right">
                           <p>
                              <it>
                                 <ul>qim</ul>
                              </it>
                           </p>
                        </c>
                     </r>
                     <r>
                        <c cspan="9">
                           <hr/>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>
                              <it>frst</it>
                           </p>
                        </c>
                        <c ca="right">
                           <p>0.51</p>
                        </c>
                        <c ca="right">
                           <p>0.23</p>
                        </c>
                        <c ca="right">
                           <p>0.50</p>
                        </c>
                        <c ca="right">
                           <p>0.27</p>
                        </c>
                        <c ca="right">
                           <p>-0.51</p>
                        </c>
                        <c ca="right">
                           <p>0.35</p>
                        </c>
                        <c ca="right">
                           <p>0.0016</p>
                        </c>
                        <c ca="right">
                           <p>0.0031</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>SVM</p>
                        </c>
                        <c ca="right">
                           <p>0.40</p>
                        </c>
                        <c ca="right">
                           <p>0.14</p>
                        </c>
                        <c ca="right">
                           <p>0.61</p>
                        </c>
                        <c ca="right">
                           <p>0.25</p>
                        </c>
                        <c ca="right">
                           <p>-0.21</p>
                        </c>
                        <c ca="right">
                           <p>0.29</p>
                        </c>
                        <c ca="right">
                           <p>0.0064</p>
                        </c>
                        <c ca="right">
                           <p>0.0160</p>
                        </c>
                     </r>
                     <r>
                        <c>
                           <p/>
                        </c>
                        <c cspan="7" ca="center">
                           <p><it>Model Generation Procedure PVH</it>, all measures for <it>TM</it></p>
                        </c>
                        <c>
                           <p/>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>e</p>
                        </c>
                        <c ca="right">
                           <p>
                              <m:math name="1471-2105-7-364-i16" xmlns:m="http://www.w3.org/1998/Math/MathML">
                                 <m:semantics>
                                    <m:mrow>
                                       <m:mover accent="true">
                                          <m:mrow>
                                             <m:mi>n</m:mi>
                                             <m:mi>i</m:mi>
                                          </m:mrow>
                                          <m:mo stretchy="true">&#175;</m:mo>
                                       </m:mover>
                                    </m:mrow>
                                    <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaadaqdaaqaaiabd6gaUjabdMgaPbaaaaa@2F7D@</m:annotation>
                                 </m:semantics>
                              </m:math>
                           </p>
                        </c>
                        <c ca="right">
                           <p>
                              <m:math name="1471-2105-7-364-i17" xmlns:m="http://www.w3.org/1998/Math/MathML">
                                 <m:semantics>
                                    <m:mrow>
                                       <m:mover accent="true">
                                          <m:mrow>
                                             <m:mi>f</m:mi>
                                             <m:mi>i</m:mi>
                                             <m:mi>m</m:mi>
                                          </m:mrow>
                                          <m:mo stretchy="true">&#175;</m:mo>
                                       </m:mover>
                                    </m:mrow>
                                    <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaadaqdaaqaaiabdAgaMjabdMgaPjabd2gaTbaaaaa@30D0@</m:annotation>
                                 </m:semantics>
                              </m:math>
                              <sub>&lt;</sub>
                           </p>
                        </c>
                        <c ca="right">
                           <p>
                              <m:math name="1471-2105-7-364-i17" xmlns:m="http://www.w3.org/1998/Math/MathML">
                                 <m:semantics>
                                    <m:mrow>
                                       <m:mover accent="true">
                                          <m:mrow>
                                             <m:mi>f</m:mi>
                                             <m:mi>i</m:mi>
                                             <m:mi>m</m:mi>
                                          </m:mrow>
                                          <m:mo stretchy="true">&#175;</m:mo>
                                       </m:mover>
                                    </m:mrow>
                                    <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaadaqdaaqaaiabdAgaMjabdMgaPjabd2gaTbaaaaa@30D0@</m:annotation>
                                 </m:semantics>
                              </m:math>
                              <sub>=</sub>
                           </p>
                        </c>
                        <c ca="right">
                           <p>
                              <m:math name="1471-2105-7-364-i17" xmlns:m="http://www.w3.org/1998/Math/MathML">
                                 <m:semantics>
                                    <m:mrow>
                                       <m:mover accent="true">
                                          <m:mrow>
                                             <m:mi>f</m:mi>
                                             <m:mi>i</m:mi>
                                             <m:mi>m</m:mi>
                                          </m:mrow>
                                          <m:mo stretchy="true">&#175;</m:mo>
                                       </m:mover>
                                    </m:mrow>
                                    <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaadaqdaaqaaiabdAgaMjabdMgaPjabd2gaTbaaaaa@30D0@</m:annotation>
                                 </m:semantics>
                              </m:math>
                              <sub>></sub>
                           </p>
                        </c>
                        <c ca="right">
                           <p>min <it>qim</it></p>
                        </c>
                        <c ca="right">
                           <p>max <it>qim</it></p>
                        </c>
                        <c ca="right">
                           <p>
                              <m:math name="1471-2105-7-364-i18" xmlns:m="http://www.w3.org/1998/Math/MathML">
                                 <m:semantics>
                                    <m:mrow>
                                       <m:mover accent="true">
                                          <m:mrow>
                                             <m:mi>q</m:mi>
                                             <m:mi>i</m:mi>
                                             <m:mi>m</m:mi>
                                          </m:mrow>
                                          <m:mo stretchy="true">&#175;</m:mo>
                                       </m:mover>
                                    </m:mrow>
                                    <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaadaqdaaqaaiabdghaXjabdMgaPjabd2gaTbaaaaa@30E6@</m:annotation>
                                 </m:semantics>
                              </m:math>
                           </p>
                        </c>
                        <c ca="right">
                           <p>
                              <it>
                                 <ul>qim</ul>
                              </it>
                           </p>
                        </c>
                     </r>
                     <r>
                        <c cspan="9">
                           <hr/>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>
                              <it>frst</it>
                           </p>
                        </c>
                        <c ca="right">
                           <p>0.70</p>
                        </c>
                        <c ca="right">
                           <p>0.35</p>
                        </c>
                        <c ca="right">
                           <p>0.31</p>
                        </c>
                        <c ca="right">
                           <p>0.34</p>
                        </c>
                        <c ca="right">
                           <p>-0.43</p>
                        </c>
                        <c ca="right">
                           <p>0.29</p>
                        </c>
                        <c ca="right">
                           <p>0.00047</p>
                        </c>
                        <c ca="right">
                           <p>0.00068</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>SVM</p>
                        </c>
                        <c ca="right">
                           <p>0.58</p>
                        </c>
                        <c ca="right">
                           <p>0.22</p>
                        </c>
                        <c ca="right">
                           <p>0.43</p>
                        </c>
                        <c ca="right">
                           <p>0.35</p>
                        </c>
                        <c ca="right">
                           <p>-0.27</p>
                        </c>
                        <c ca="right">
                           <p>0.35</p>
                        </c>
                        <c ca="right">
                           <p>0.00774</p>
                        </c>
                        <c ca="right">
                           <p>0.01339</p>
                        </c>
                     </r>
                  </tblbdy>
               </tbl>
            </sec>
            <sec>
               <st>
                  <p>2.3.3 Spotting candidate targets with estimated improvement</p>
               </st>
               <p>Both model generation procedures <it>PVS </it>and <it>PVH </it>include the Arby default model in the ensemble of generated models. Therefore, for any target, model selection will only pick an alternative model, if a model with a score better than the Arby default exists. An indicator for this is</p>
               <p><it>ni</it><sub><it>E,frst</it></sub>(<it>t</it>) = [<it>frst</it>(<it>s</it><sub><it>E,fsrt</it></sub>(<it>t</it>)) &lt;<it>frst</it>(<it>d</it>(<it>t</it>))].</p>
               <p>The set of targets for which model selection proposes candidates with estimated improvement consists of <it>n</it>&#183;<m:math name="1471-2105-7-364-i16" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:mover accent="true"><m:mrow><m:mi>n</m:mi><m:mi>i</m:mi></m:mrow><m:mo stretchy="true">&#175;</m:mo></m:mover></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaadaqdaaqaaiabd6gaUjabdMgaPbaaaaa@2F7D@</m:annotation></m:semantics></m:math> = &#8721;<it>ni </it>targets. On this candidate set, we denote the average improvement in model quality as <m:math name="1471-2105-7-364-i19" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:munder accentunder="true"><m:mrow><m:mi>q</m:mi><m:mi>i</m:mi><m:mi>m</m:mi></m:mrow><m:mo stretchy="true">&#175;</m:mo></m:munder><m:mo>=</m:mo><m:mfrac><m:mn>1</m:mn><m:mrow><m:mi>n</m:mi><m:mo>&#8901;</m:mo><m:mover accent="true"><m:mrow><m:mi>n</m:mi><m:mi>i</m:mi></m:mrow><m:mo stretchy="true">&#175;</m:mo></m:mover></m:mrow></m:mfrac><m:mstyle displaystyle="true"><m:msub><m:mo>&#8721;</m:mo><m:mi>t</m:mi></m:msub><m:mrow><m:mi>q</m:mi><m:mi>i</m:mi><m:mi>m</m:mi><m:mo stretchy="false">(</m:mo><m:mi>t</m:mi><m:mo stretchy="false">)</m:mo></m:mrow></m:mstyle></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaadaadaaqaaiabdghaXjabdMgaPjabd2gaTbaacqGH9aqpdaWcaaqaaiabigdaXaqaaiabd6gaUjabgwSixpaanaaabaGaemOBa4MaemyAaKgaaaaadaaeqaqaaiabdghaXjabdMgaPjabd2gaTjabcIcaOiabdsha0jabcMcaPaWcbaGaemiDaqhabeqdcqGHris5aaaa@440B@</m:annotation></m:semantics></m:math>.</p>
            </sec>
            <sec>
               <st>
                  <p>2.3.4 Significance and coverage</p>
               </st>
               <p>The <it>fim</it><sub>> </sub>and <it>qim </it>numbers exhibit a noticeable increase in model quality with respect to random selection of models (cf. <m:math name="1471-2105-7-364-i9" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:mover accent="true"><m:mrow><m:mi>f</m:mi><m:mi>p</m:mi><m:mi>t</m:mi></m:mrow><m:mo stretchy="true">&#175;</m:mo></m:mover></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaadaqdaaqaaiabdAgaMjabdchaWjabdsha0baaaaa@30EC@</m:annotation></m:semantics></m:math><sub>&lt;</sub>and <m:math name="1471-2105-7-364-i10" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:mover accent="true"><m:mrow><m:mi>q</m:mi><m:mi>i</m:mi><m:mi>r</m:mi></m:mrow><m:mo stretchy="true">&#175;</m:mo></m:mover></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaadaqdaaqaaiabdghaXjabdMgaPjabdkhaYbaaaaa@30F0@</m:annotation></m:semantics></m:math> values in Table <tblr tid="T3">3</tblr>).</p>
               <p>More importantly, comparing models resulting from the selection process to the Arby default by applying a paired Wilcoxon signed rank test, we find for model generation procedure <it>PVS </it>that the models selected according to <it>frst </it>are significantly better than the Arby default (with a p-value of 0.002). For the model generation procedure <it>PVH</it>, the models selected with <it>frst </it>alone are neither significantly better nor worse than the default, demonstrating that it is hard to select better models when generating more low-quality models.</p>
               <p>Selection of models constructed with model generation procedure <it>PVS </it>results in an average quality improvement of <m:math name="1471-2105-7-364-i20" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msub><m:mrow><m:munder accentunder="true"><m:mrow><m:mi>q</m:mi><m:mi>i</m:mi><m:mi>m</m:mi></m:mrow><m:mo stretchy="true">&#175;</m:mo></m:munder></m:mrow><m:mrow><m:msub><m:mi>E</m:mi><m:mrow><m:mi>P</m:mi><m:mi>V</m:mi><m:mi>S</m:mi><m:mo>,</m:mo><m:mi>f</m:mi><m:mi>r</m:mi><m:mi>s</m:mi><m:mi>t</m:mi></m:mrow></m:msub></m:mrow></m:msub></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaadaadaaqaaiabdghaXjabdMgaPjabd2gaTbaadaWgaaWcbaGaemyrau0aaSbaaWqaaiabdcfaqjabdAfawjabdofatjabcYcaSiabdAgaMjabdkhaYjabdohaZjabdsha0bqabaaaleqaaaaa@3C6B@</m:annotation></m:semantics></m:math> = 0.0031 and works better than selection of models constructed with model generation procedure <it>PVH </it>with an average quality improvement of <m:math name="1471-2105-7-364-i21" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msub><m:mrow><m:munder accentunder="true"><m:mrow><m:mi>q</m:mi><m:mi>i</m:mi><m:mi>m</m:mi></m:mrow><m:mo stretchy="true">&#175;</m:mo></m:munder></m:mrow><m:mrow><m:msub><m:mi>E</m:mi><m:mrow><m:mi>P</m:mi><m:mi>V</m:mi><m:mi>H</m:mi><m:mo>,</m:mo><m:mi>f</m:mi><m:mi>r</m:mi><m:mi>s</m:mi><m:mi>t</m:mi></m:mrow></m:msub></m:mrow></m:msub></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaadaadaaqaaiabdghaXjabdMgaPjabd2gaTbaadaWgaaWcbaGaemyrau0aaSbaaWqaaiabdcfaqjabdAfawjabdIeaijabcYcaSiabdAgaMjabdkhaYjabdohaZjabdsha0bqabaaaleqaaaaa@3C55@</m:annotation></m:semantics></m:math> = 0.00068.</p>
               <p>For generating procedure <it>PVS</it>, the selection according to FRST suggests an alternative model for 51% of the targets; 53% of these suggested targets are improved according to <it>TM</it>-score. For generating procedure <it>PVH</it>, alternative models are suggested for 70% of the targets; 48% of these suggested targets are improved according to <it>TM</it>-score.</p>
            </sec>
            <sec>
               <st>
                  <p>2.3.5 Selection of high quality models using an SVM-based selection</p>
               </st>
               <p>Based on the FRST scores, an SVM was trained to choose high quality models as described in the Methods section. The values <it>fim</it><sub><it>svm</it></sub>, <it>qim</it><sub><it>svm</it></sub>, and <it>ni</it><sub><it>svm </it></sub>are calculated analogously to the previously defined <it>fim</it><sub><it>frst</it></sub>, <it>qim</it><sub><it>frst</it></sub>, and <it>ni</it><sub><it>frst</it></sub>, by replacing <it>frst </it>in these formulas with the negative SVM decision values.</p>
            </sec>
            <sec>
               <st>
                  <p>2.3.6 Significance and coverage of the SVM selection</p>
               </st>
               <p>The results produced with selecting models according to the SVM decision values are summarized in Table <tblr tid="T4">4</tblr>. For <it>PVS</it>, an overview is given in Figure <figr fid="F2">2</figr>.</p>
               <fig id="F2">
                  <title>
                     <p>Figure 2</p>
                  </title>
                  <caption>
                     <p>(Left) Average increase in <it>TM</it>-score, for ranges of difficulty</p>
                  </caption>
                  <text>
                     <p>(Left) Average increase in <it>TM</it>-score, for ranges of difficulty. Targets are binned according to the <it>TM</it>-score of the default Arby model. Within each bin the average increase in quality <it><ul>qim</ul></it>is plotted. Bins are enumerated horizontally, the two outer bins were concatenated with their neighbors as each contained less than 100 target samples. Models are selected from <it>PVS </it>using the SVM. For comparison the average increase in quality obtained on this benchmark set by performing loop modeling is 0.003. (Right) Maximum increase in <it>TM</it>-score, for the same ranges of difficulty. The maximum increase in quality max <it>qim </it>within each bin is visualized as a line above the box representing the average increase (which is the same as on the left side, just the scale is different).</p>
                  </text>
                  <graphic file="1471-2105-7-364-2"/>
               </fig>
               <p>Compared to the FRST potentials, the <m:math name="1471-2105-7-364-i16" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:mover accent="true"><m:mrow><m:mi>n</m:mi><m:mi>i</m:mi></m:mrow><m:mo stretchy="true">&#175;</m:mo></m:mover></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaadaqdaaqaaiabd6gaUjabdMgaPbaaaaa@2F7D@</m:annotation></m:semantics></m:math><sub><it>sum </it></sub>are smaller (i.e. fewer targets were suggested for alteration, see Table <tblr tid="T4">4</tblr>). The SVM more effectively avoids changing models for the worse. This is visible in Figure <figr fid="F1">1</figr> and also reflected by noticeably smaller average numbers of models with decreased quality (<m:math name="1471-2105-7-364-i17" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:mover accent="true"><m:mrow><m:mi>f</m:mi><m:mi>i</m:mi><m:mi>m</m:mi></m:mrow><m:mo stretchy="true">&#175;</m:mo></m:mover></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaadaqdaaqaaiabdAgaMjabdMgaPjabd2gaTbaaaaa@30D0@</m:annotation></m:semantics></m:math><sub>&lt;</sub>values, see Table <tblr tid="T4">4</tblr>). The overall average improvement in <it>TM</it>-score model quality (<m:math name="1471-2105-7-364-i18" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:mover accent="true"><m:mrow><m:mi>q</m:mi><m:mi>i</m:mi><m:mi>m</m:mi></m:mrow><m:mo stretchy="true">&#175;</m:mo></m:mover></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaadaqdaaqaaiabdghaXjabdMgaPjabd2gaTbaaaaa@30E6@</m:annotation></m:semantics></m:math>, <ul><it>qim</it></ul>) increases.</p>
               <p>Applying a paired Wilcoxon signed rank test, we find for both generation procedures that the models selected by the SVM are significantly improved with respect to the Arby default (p-values below 10<sup>-15</sup>). The SVM selected models are also significantly improved with respect to the FRST selection process (with p-values below 10<sup>-5</sup>).</p>
               <p>For generating procedure <it>PVS</it>, the SVM-based selection suggests an alternative model for 40% of the targets; 64% of these suggested targets are improved according to <it>TM</it>-score. For generating procedure <it>PVH</it>, alternative models are suggested for 58% of the targets; 61% of these suggested targets are improved according to <it>TM</it>-score.</p>
            </sec>
         </sec>
      </sec>
      <sec>
         <st>
            <p>3 Discussion</p>
         </st>
         <p>Jaroszewski et al. <abbrgrp><abbr bid="B9">9</abbr></abbrgrp> show their method to produce significantly better alignments in about half of test cases, for a benchmark set of 742 protein pairs and make no statement regarding the likelihood of selecting such a solution from the ensemble of alternative alignments generated. To this end, they generate an average of 733 alignments per target-template pair with improved solutions in 34% of the test cases (average of 49 alignments). Our method is able to generate improved alignments for 59% of the test cases (<it>PVH</it>, Table <tblr tid="T3">3</tblr>) with only 13 alignments on average. The 55-fold decrease in the number of evaluated alignments compared to the method of Jaroszewski et al., while maintaining at least comparable increments in alignment quality, implies that we are exploring regions in the space of alignments that are densely populated with high-quality solutions, making the method practical for improving fully automated fold recognition servers such as Arby <abbrgrp><abbr bid="B23">23</abbr></abbrgrp>. This is important when comparing our method to other approaches like ROBETTA <abbrgrp><abbr bid="B24">24</abbr></abbrgrp>, or the work of John and Sali <abbrgrp><abbr bid="B11">11</abbr></abbrgrp> or Contreras-Moreira and coworkers <abbrgrp><abbr bid="B10">10</abbr></abbrgrp>, where improved model generation requires several orders of magnitude more alignments to be evaluated.</p>
         <p>Improved alignments have to be selected from the ensemble of alternatives with an MQAP program in order to be useful. Neither Jaroszewski <abbrgrp><abbr bid="B9">9</abbr></abbrgrp> nor Chivian <abbrgrp><abbr bid="B24">24</abbr></abbrgrp> make quantitative statements about the selection of improved solutions. The results of John and Sali or <it>in silico </it>recombination are not directly comparable, as they generate and select the solutions iteratively. Our data show that the selection of improved alignments is a difficult task. A random selector would actually deteriorate the overall performance of the method. Generating more models does not necessarily help the selection process. Especially if more models below default quality are generated (as with generating procedure <it>PVH</it>), avoiding to select worse models is more difficult. Thus the error rate can increase and the overall performance can decrease if more low-quality models are generated.</p>
         <p>Here we show that the proposed protocol, including model generation and SVM-based selection, significantly improves model quality (p-values below 10<sup>-15 </sup>using a Wilcoxon signed rank test). With the model generation procedure <it>PVS</it>, using the SVM-based selection the proposed method achieves a close to optimal average <it>TM</it>-score improvement of 0.016 and a maximal observed increase in <it>TM</it>-quality of max <it>qim </it>= 0.29 This has to be related to typical fold recognition targets, where the <it>TM</it>-scores for large portions of the predictions lie in the range of 0.1 to 0.4 for hard targets and from 0.4 to 0.8 for easier targets.</p>
         <p>To emphasize the relevance of the proposed method in practical use, we compare the quality improvement of the proposed protocol with the quality gain obtained by loop modeling alone. The average quality increase in <it>TM</it>-score incurred by using our protocol amounts to 0.016, which is a factor of five above the quality gain obtained by loop modeling alone 0.003. The quality increase for our protocol is computed as the difference between the <it>TM</it>-scores of the MQAP selection from varied models with modeled loops minus the Arby default with loops modeled. The model quality obtained by loop modeling alone is computed as the difference of the <it>TM</it>-score of the Arby default with loops modeled minus that of the Arby default without loops modeled (see additional file supplement).</p>
      </sec>
      <sec>
         <st>
            <p>4 Conclusion</p>
         </st>
         <p>We have presented an approach for improving structure prediction models that goes in a different direction from the one recently proposed by Pettitt et al. <abbrgrp><abbr bid="B12">12</abbr></abbrgrp>. Whereas they have evaluated the possibility to choose better templates with an MQAP program, we show that it is possible to generate and select better alignments for a fixed template with an MQAP program. The two approaches can be combined and will improve automated servers such as Arby.</p>
         <p>As this seems a promising approach in a competitive field, we will continue to work on the topic in two directions: First, generation of models with a high likelihood of improving the quality and second improving the selection process. For the latter, the numbers on the SVM performance clearly indicate that the current linear combination of the partial potentials in FRST can be improved.</p>
      </sec>
      <sec>
         <st>
            <p>5 Methods</p>
         </st>
         <sec>
            <st>
               <p>5.1 Protocol</p>
            </st>
            <sec>
               <st>
                  <p>5.1.1 Alternative alignments and models</p>
               </st>
               <p>The 3D protein structure model that we construct for a target protein is based on an alignment with a template structure. The method described here is independent of the strategy for template identification. With a given target and template as input, we compute a <it>default </it>alignment using profile-profile alignment with log-average scoring and parameters as tuned for the Arby server <abbrgrp><abbr bid="B22">22</abbr><abbr bid="B23">23</abbr></abbrgrp>. Namely these parameters are: substitution matrix Blosum62, gap insertion 14.7, gap extension 0.37, and a relative weight of secondary structure to sequence information of 0.24.</p>
               <p>In addition to the Arby default alignment, we propose two procedures (<it>PVS, PVH </it>)for generating alternative alignments for a target in analogy to the parametric approach of Jaroszewski et al. <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>. The alternatives are computed by a global profile-profile alignment method, using parameters multiplied with a factor varied inside the range from a lower to an upper bound. The parameters varied are gap-insertion, gap-extension, and the relative weights of amino-acid and secondary structure profiles. The two procedures differ with respect to the ranges of the factors. Each procedure reports alignments that occur multiply for different parameter settings only one time, resulting in an ensemble of distinct alignments.</p>
               <p>For each alignment a model is built as follows. Loop modeling of insertions and deletions is performed, using the LOBO program <abbrgrp><abbr bid="B25">25</abbr></abbrgrp>. Conserved (i.e. identical) residues and their side chains are copied from the template structure. The non-conserved residues and their side chains are positioned and optimized by SCWRL3.0 <abbrgrp><abbr bid="B26">26</abbr></abbrgrp>.</p>
            </sec>
            <sec>
               <st>
                  <p>5.1.2 Model scores</p>
               </st>
               <p>The quality of the model is then estimated using the FRST MQAP program <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>, which computes four potentials, namely a residue-specific all-atom distance potential <abbrgrp><abbr bid="B27">27</abbr></abbrgrp> (<it>rapdf</it>), a solvation potential (<it>solv</it>), a hydrogen bonding potential (<it>hydb</it>), and a torsion angle potential (<it>tors</it>). These four potentials are linearly combined into the <it>frst </it>energy score (with factors 2.5, 500.0, -50.0, and 350.0, respectively <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>). This leaves us with the <it>frst </it>score as an estimate of the quality of each constructed model.</p>
               <p>We can select the best alignment-based model for each target, by choosing the model with the lowest energy score according to the <it>frst </it>potential. These selected models are potentially improvements over the default model (constructed according to the default alignment). In the supplement we additionally analyze selection according to the partial contributions <it>rapdf, solv, hydb</it>, and <it>tors </it>of the <it>frst </it>potential (see additional file supplement). The FRST MQAP program places a strong emphasis on the torsion angle component <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>. Since each residue can either increase or decrease the overall score, there is no correlation between the number of gaps in a model and the overall score.</p>
               <p>For 95% of the targets in the benchmark set of this paper, the FRST MQAP can distinguish the native structures from the Arby default models. Similarly, the performance of FRST on selecting the native structure from the models generated with procedures <it>PVS </it>and <it>PVH is </it>95% and 94%, respectively.</p>
            </sec>
            <sec>
               <st>
                  <p>5.1.3 Model quality evaluation</p>
               </st>
               <p>If, additionally, the native structure of the target is known, using the model evaluation programs GDT <abbrgrp><abbr bid="B28">28</abbr></abbrgrp>, MaxSub <abbrgrp><abbr bid="B17">17</abbr></abbrgrp>, and TM-score <abbrgrp><abbr bid="B18">18</abbr></abbrgrp>, we can compute scores (<it>GDT, MS, TM</it>), reflecting the "real" quality of the model in terms of structural similarity between model structure and target structure.</p>
               <p>In general the quality measures <it>GDT</it>, <it>MS</it>, and <it>TM </it>correlate well: The correlation coefficients between quality measures for all models produced are cor<sub><it>GDT,MS </it></sub>= 0.99, cor<sub><it>GDT,TM </it></sub>= 0.93, and cor<sub><it>MS,TM </it></sub>= 0.93 (see supplement, Table <tblr tid="T1">1</tblr>). Overall the analysis yields similar results for all three quality measures. As the <it>TM</it>-score has the advantage of being independent of the size of the protein, we restrict our presentation to the analysis of the <it>TM</it>-score.</p>
               <p>Overall, a moderate negative correlation cor<sub><it>TM,frst </it></sub>= -0.43 of the quality measure <it>TM</it>-score with the <it>frst </it>score can be observed. It has to be pointed out that the correlation of the <it>frst </it>score across all targets is not as relevant as its selection capabilities per target.</p>
            </sec>
            <sec>
               <st>
                  <p>5.1.4 Combining MQAP partial potentials using a support vector machine</p>
               </st>
               <p>We train a Support Vector Machine (SVM) for selecting models with higher <it>TM</it>-score than the <it>TM</it>-score of the default model. The binary labels used for each model are <it>TM</it>-score-increase and <it>TM</it>-score-decrease with respect to the default Arby model. As features we use the <it>frst, rapdf, solv, hydb, tors </it>values of each model and the corresponding default model as well as the differences of these scores between model and default. For each target, the best model is selected based on the SVM decision value <abbrgrp><abbr bid="B29">29</abbr></abbrgrp>. Models with a negative SVM decision value remain unchanged with respect to the Arby default. As SVM implementation, the R package e1071 <abbrgrp><abbr bid="B30">30</abbr></abbrgrp> based on libsvm <abbrgrp><abbr bid="B31">31</abbr></abbrgrp> is employed. As parameter tuning showed only negligible changes in classification accuracy, standard parameters and a radial basis function kernel are used.</p>
            </sec>
         </sec>
         <sec>
            <st>
               <p>5.2 Benchmarking</p>
            </st>
            <sec>
               <st>
                  <p>5.2.1 Dataset of targets and templates</p>
               </st>
               <p>For the validation of our approach, the improvement of the proposed models over the default Arby models was evaluated. Target sequences were taken from a representative set of SCOP 1.65 domains <abbrgrp><abbr bid="B32">32</abbr></abbrgrp> with at most 40% sequence identity as provided by the Astral compendium <abbrgrp><abbr bid="B33">33</abbr><abbr bid="B34">34</abbr></abbrgrp>. As a basis for the alternative models, in this study, one template was chosen for each target: With log-average scoring and default parameters as listed in Table <tblr tid="T1">1</tblr><abbrgrp><abbr bid="B22">22</abbr><abbr bid="B23">23</abbr></abbrgrp>, the target was compared against the rest of the domains in the Astral 40% set and the top ranking hit was chosen as template. Our analysis was restricted to targets which have a template with at least 25% sequence identity, evaluating the proposed method for targets from the homologous fold recognition category. These criteria specify 1765 targets, each with one template. For 153 (8.7%) of these 1765 targets, some of the necessary computations failed. We excluded those targets, which leaves us with <it>n </it>= 1612 targets, for which we have all relevant scores available.</p>
            </sec>
            <sec>
               <st>
                  <p>5.2.2 Cross- validation of SVM-based selection</p>
               </st>
               <p>The training and validation of the support vector machine is performed using five-fold cross-validation. In order to ensure that there are no models for the same target in the training as well as in the testing set, during the cross-validation successively models for one fifth of the targets (not: one fifth of the models) are removed from the training set and used for testing.</p>
               <p>As the pairwise sequence identity between targets is below 40% according to selection criteria it is guaranteed that models in the test and training sets are sufficiently distinct.</p>
               <p>In order to assess the effect of the choice of <it>k </it>in <it>k</it>-fold cross-validation, a ten-fold cross-validation was also performed, yielding results identical to one digit precision in Table <tblr tid="T4">4</tblr> (data not shown, the figures in the article refer to <it>k </it>= 5).</p>
            </sec>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Abbreviations</p>
         </st>
         <p><it>a</it><sub>0</sub>(<it>t</it>) Arby default model for targets <it>t</it></p>
         <p><it>E</it>(<it>t</it>) Set of models constructed for targets <it>t </it>according to model generation procedure <it>i </it>&#8712; {0,1,2}</p>
         <p><it>TM </it>Model quality evaluation measure as computed with the TM-score program</p>
         <p><it>GDT </it>Model quality evaluation measure as computed with the LGA program</p>
         <p><it>MS </it>Model quality evaluation measure as computed with the MaxSub program</p>
         <p>[<it>x</it>] Iverson bracket</p>
         <p><m:math name="1471-2105-7-364-i22" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:mover accent="true"><m:mi>x</m:mi><m:mo stretchy="true">&#175;</m:mo></m:mover></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaadaqdaaqaaiabdIha4baaaaa@2E36@</m:annotation></m:semantics></m:math> Average of <it>x </it>over all targets</p>
         <p><it><ul>x</ul></it>Average of <it>x </it>over targets suggested by the selection procedure</p>
         <p><it>fpt</it><sub><it>E</it>,></sub>(<it>t</it>) Relative <b>f</b>equency of models <b>p</b>er <b>t</b>arget with <it>TM</it>-score above Arby default</p>
         <p><it>qir</it><sub><it>E</it></sub>(<it>t</it>) <b>Q</b>uality <b>i</b>mprovement when choosing <b>r</b>andomly</p>
         <p><it>fb</it><sub><it>E</it></sub>(<it>t</it>) Indicator whether better model exists per target</p>
         <p><m:math name="1471-2105-7-364-i11" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:mover accent="true"><m:mrow><m:mi>f</m:mi><m:mi>b</m:mi></m:mrow><m:mo stretchy="true">&#175;</m:mo></m:mover></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaadaqdaaqaaiabdAgaMjabdkgaIbaaaaa@2F5F@</m:annotation></m:semantics></m:math><sub><it>E</it></sub>(<it>t</it>) <b>R</b>elative <b>f</b>requency of targets for which a <b>b</b>etter model exists</p>
         <p><it>qib</it><sub><it>E</it></sub>(<it>t</it>) <b>Q</b>uality <b>i</b>mprovement which is theoretically the <b>b</b>est possible</p>
         <p><it>s</it><sub><it>E</it>,<it>frst</it></sub>(<it>t</it>) <b>S</b>elected model per target</p>
         <p><it>fim</it><sub><it>E</it>,<it>frst</it>,></sub>(<it>t</it>) Indicator whether selected model is improved in real quality</p>
         <p><m:math name="1471-2105-7-364-i17" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:mover accent="true"><m:mrow><m:mi>f</m:mi><m:mi>i</m:mi><m:mi>m</m:mi></m:mrow><m:mo stretchy="true">&#175;</m:mo></m:mover></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaadaqdaaqaaiabdAgaMjabdMgaPjabd2gaTbaaaaa@30D0@</m:annotation></m:semantics></m:math><sub><it>E</it>,<it>frst</it>,></sub>(<it>t</it>) Relative <b>f</b>requency of selected models with <b>i</b>ncreased <it>TM</it>-score <b>m</b>odel quality</p>
         <p><it>qim</it><sub><it>E,frst</it></sub>(<it>t</it>) Measure of <b>q</b>uality <b>i</b>mprovement with respect to default Arby <b>m</b>odel</p>
         <p><m:math name="1471-2105-7-364-i16" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:mover accent="true"><m:mrow><m:mi>n</m:mi><m:mi>i</m:mi></m:mrow><m:mo stretchy="true">&#175;</m:mo></m:mover></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaadaqdaaqaaiabd6gaUjabdMgaPbaaaaa@2F7D@</m:annotation></m:semantics></m:math> Relative frequency of targets for which a selection procedure suggests <b>i</b>mproved models</p>
         <p>The indicator functions are constructed to count the relative frequencies. They draw their names from the respective relative frequencies.</p>
      </sec>
      <sec>
         <st>
            <p>7 Authors' contributions</p>
         </st>
         <p>S.C.E.T, S.T., and I.S. conceived the experiment, I.S. and S.T. performed the experiment, I.S., O.S., and T.L. analyzed the results, and I.S., T.L., and S.C.E.T. wrote the final manuscript, which all authors have approved.</p>
         <suppl id="S1">
            <title>
               <p>Additional File 1</p>
            </title>
            <text>
               <p>Additional statistical analysis of partial potentials and of models with and without loop modeling. Some additional material, analysising partial potentials and analyzing the behaviour of the protocol when using models with our without loop modeling performed.</p>
            </text>
            <file name="1471-2105-7-364-S1.pdf">
               <p>Click here for file</p>
            </file>
         </suppl>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>8 Acknowledgements</p>
            </st>
            <p>I.S. is funded by DFG (grant Le 491/14). S.C.E.T. is funded by a "Rientro dei cervelli" grant from the Italian Ministry for Education, University and Research (MIUR). We thank Giorgio Valle and Alessandro Albiero for insightful discussions as well as Lars Kunert, Francisco S. Domingues, and J&#246;rg Rahnenf&#252;hrer for valuable comments on the manuscript. This research was performed in the context of the BioSapiens Network of Excellence (EU grant no. LSHG-CT-2003-503265).</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>Critical assessment of methods of protein structure prediction (CASP) &#8211; round VI</p>
            </title>
            <aug>
               <au>
                  <snm>Moult</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Fidelis</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Tramontano</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Rost</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Hubbard</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Proteins</source>
            <pubdate>2005</pubdate>
            <volume>61</volume>
            <issue>Suppl 7</issue>
            <fpage>3</fpage>
            <lpage>7</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1002/prot.20716</pubid>
                  <pubid idtype="pmpid" link="fulltext">16187341</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <title>
               <p>Comparison of sequence profiles. Strategies for structural predictions using sequence information</p>
            </title>
            <aug>
               <au>
                  <snm>Rychlewski</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Jaroszewski</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Godzik</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Protein Science</source>
            <pubdate>2000</pubdate>
            <volume>9</volume>
            <issue>2</issue>
            <fpage>232</fpage>
            <lpage>241</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">10716175</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>Improving profile-profile alignment via log average scoring</p>
            </title>
            <aug>
               <au>
                  <snm>von &#214;hsen</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Zimmer</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Algorithms in Bioinformatics, First International Workshop, WABI</source>
            <publisher>Springer</publisher>
            <editor>Gascuel O, Moret B</editor>
            <pubdate>2001</pubdate>
            <fpage>11</fpage>
            <lpage>26</lpage>
         </bibl>
         <bibl id="B4">
            <title>
               <p>Within the twilight zone: a sensitive profile-profile comparison tool based on information theory</p>
            </title>
            <aug>
               <au>
                  <snm>Yona</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Levitt</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>J Mol Biol</source>
            <pubdate>2002</pubdate>
            <volume>315</volume>
            <issue>5</issue>
            <fpage>1257</fpage>
            <lpage>1275</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1006/jmbi.2001.5293</pubid>
                  <pubid idtype="pmpid" link="fulltext">11827492</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>Scoring profile-to-profile sequence alignments</p>
            </title>
            <aug>
               <au>
                  <snm>Wang</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Dunbrack</snm>
                  <fnm>RL</fnm>
               </au>
            </aug>
            <source>Protein Sci</source>
            <pubdate>2004</pubdate>
            <volume>13</volume>
            <issue>6</issue>
            <fpage>1612</fpage>
            <lpage>1626</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1110/ps.03601504</pubid>
                  <pubid idtype="pmpid" link="fulltext">15152092</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>Fold recognition by combining sequence profiles derived from evolution and from depth-dependent structural alignment of fragments</p>
            </title>
            <aug>
               <au>
                  <snm>Zhou</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Zhou</snm>
                  <fnm>Y</fnm>
               </au>
            </aug>
            <source>Proteins</source>
            <pubdate>2005</pubdate>
            <volume>58</volume>
            <issue>2</issue>
            <fpage>321</fpage>
            <lpage>328</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1408319</pubid>
                  <pubid idtype="pmpid" link="fulltext">15523666</pubid>
                  <pubid idtype="doi">10.1002/prot.20308</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>Assessment of predictions submitted for the CASP6 comparative modelling category</p>
            </title>
            <aug>
               <au>
                  <snm>Tress</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Ezkurdia</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Gra&#241;a</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>L&#243;pez</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Valencia</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Proteins</source>
            <pubdate>2005</pubdate>
            <issue>Suppl 7</issue>
            <fpage>27</fpage>
            <lpage>45</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1002/prot.20720</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>Progress over the first decade of GASP experiments</p>
            </title>
            <aug>
               <au>
                  <snm>Kryshtafovych</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Venclovas</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Fidelis</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Moult</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Proteins</source>
            <pubdate>2005</pubdate>
            <volume>61</volume>
            <issue>suppl 7</issue>
            <fpage>225</fpage>
            <lpage>236</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1002/prot.20740</pubid>
                  <pubid idtype="pmpid" link="fulltext">16187365</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p>In search for more accurate alignments in the twilight zone</p>
            </title>
            <aug>
               <au>
                  <snm>Jaroszewski</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Godzik</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Protein Sci</source>
            <pubdate>2002</pubdate>
            <volume>11</volume>
            <issue>7</issue>
            <fpage>1702</fpage>
            <lpage>1713</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1110/ps.4820102</pubid>
                  <pubid idtype="pmpid" link="fulltext">12070323</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B10">
            <title>
               <p>In silico protein recombination: enhancing template and sequence alignment selection for comparative protein modelling</p>
            </title>
            <aug>
               <au>
                  <snm>Contreras-Moreira</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Fitzjohn</snm>
                  <fnm>PW</fnm>
               </au>
               <au>
                  <snm>Bates</snm>
                  <fnm>PA</fnm>
               </au>
            </aug>
            <source>J Mol Biol</source>
            <pubdate>2003</pubdate>
            <volume>328</volume>
            <issue>3</issue>
            <fpage>593</fpage>
            <lpage>608</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0022-2836(03)00309-7</pubid>
                  <pubid idtype="pmpid" link="fulltext">12706719</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>Comparative protein structure modeling by iterative alignment, model building and model assessment</p>
            </title>
            <aug>
               <au>
                  <snm>John</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Sali</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2003</pubdate>
            <volume>31</volume>
            <issue>14</issue>
            <fpage>3982</fpage>
            <lpage>3992</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">165975</pubid>
                  <pubid idtype="pmpid" link="fulltext">12853614</pubid>
                  <pubid idtype="doi">10.1093/nar/gkg460</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>Improving sequence-based fold recognition by using 3D model quality assessment</p>
            </title>
            <aug>
               <au>
                  <snm>Pettitt</snm>
                  <fnm>CS</fnm>
               </au>
               <au>
                  <snm>McGuffin</snm>
                  <fnm>LJ</fnm>
               </au>
               <au>
                  <snm>Jones</snm>
                  <fnm>DT</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2005</pubdate>
            <volume>21</volume>
            <issue>17</issue>
            <fpage>3509</fpage>
            <lpage>3515</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/bti540</pubid>
                  <pubid idtype="pmpid" link="fulltext">15955780</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B13">
            <title>
               <p>The Victor/FRST Function for Model Quality Estimation</p>
            </title>
            <aug>
               <au>
                  <snm>Tosatto</snm>
                  <fnm>SCE</fnm>
               </au>
            </aug>
            <source>Journal of Computational Biology</source>
            <pubdate>2005</pubdate>
            <volume>12</volume>
            <issue>10</issue>
            <fpage>1316</fpage>
            <lpage>1327</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1089/cmb.2005.12.1316</pubid>
                  <pubid idtype="pmpid" link="fulltext">16379537</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B14">
            <title>
               <p>Large-scale prediction of protein structure and function from sequence</p>
            </title>
            <aug>
               <au>
                  <snm>Tosatto</snm>
                  <fnm>SCE</fnm>
               </au>
               <au>
                  <snm>Toppo</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Current Pharmaceutical Design</source>
            <pubdate>2006</pubdate>
            <volume>12</volume>
            <issue>17</issue>
            <fpage>2067</fpage>
            <lpage>2086</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.2174/138161206777585238</pubid>
                  <pubid idtype="pmpid" link="fulltext">16796556</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B15">
            <title>
               <p>CAFASP4 MQAP</p>
            </title>
            <aug>
               <au>
                  <snm>Fischer</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <url>Http://www.cs.bgu.ac.il/~dfischer/CAFASP4/mqap.html</url>
         </bibl>
         <bibl id="B16">
            <title>
               <p>Processing and analysis of CASP3 protein structure predictions</p>
            </title>
            <aug>
               <au>
                  <snm>Zemla</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Venclovas</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Moult</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Fidelis</snm>
                  <fnm>K</fnm>
               </au>
            </aug>
            <source>Proteins</source>
            <pubdate>1999</pubdate>
            <issue>Suppl 3</issue>
            <fpage>22</fpage>
            <lpage>29</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1002/(SICI)1097-0134(1999)37:3+&lt;22::AID-PROT5>3.0.CO;2-W</pubid>
                  <pubid idtype="pmpid">10526349</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B17">
            <title>
               <p>MaxSub: an automated measure for the assessment of protein structure prediction quality</p>
            </title>
            <aug>
               <au>
                  <snm>Siew</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Elofsson</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Rychlewski</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Fischer</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2000</pubdate>
            <volume>16</volume>
            <issue>9</issue>
            <fpage>776</fpage>
            <lpage>785</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/16.9.776</pubid>
                  <pubid idtype="pmpid" link="fulltext">11108700</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B18">
            <title>
               <p>Scoring function for automated assessment of protein structure template quality</p>
            </title>
            <aug>
               <au>
                  <snm>Zhang</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Skolnick</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Proteins</source>
            <pubdate>2004</pubdate>
            <volume>57</volume>
            <issue>4</issue>
            <fpage>702</fpage>
            <lpage>710</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1002/prot.20264</pubid>
                  <pubid idtype="pmpid" link="fulltext">15476259</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B19">
            <title>
               <p>Assessment of protein models with three-dimensional profiles</p>
            </title>
            <aug>
               <au>
                  <snm>L&#252;thy</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Bowie</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Eisenberg</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>1992</pubdate>
            <volume>356</volume>
            <issue>6364</issue>
            <fpage>83</fpage>
            <lpage>85</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/356083a0</pubid>
                  <pubid idtype="pmpid" link="fulltext">1538787</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B20">
            <title>
               <p>Recognition of errors in three-dimensional structures of proteins</p>
            </title>
            <aug>
               <au>
                  <snm>Sippl</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Proteins</source>
            <pubdate>1993</pubdate>
            <volume>17</volume>
            <issue>4</issue>
            <fpage>355</fpage>
            <lpage>362</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1002/prot.340170404</pubid>
                  <pubid idtype="pmpid">8108378</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B21">
            <title>
               <p>Servers for protein structure prediction</p>
            </title>
            <aug>
               <au>
                  <snm>Fischer</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Curr Opin Struct Biol</source>
            <pubdate>2006</pubdate>
            <volume>16</volume>
            <fpage>178</fpage>
            <lpage>182</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.sbi.2006.03.004</pubid>
                  <pubid idtype="pmpid" link="fulltext">16546376</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B22">
            <title>
               <p>Profile-profile alignment: a powerful tool for protein structure prediction</p>
            </title>
            <aug>
               <au>
                  <snm>von &#214;hsen</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Sommer</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Zimmer</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Pac Symp Biocomput</source>
            <pubdate>2003</pubdate>
            <volume>8</volume>
            <fpage>252</fpage>
            <lpage>263</lpage>
         </bibl>
         <bibl id="B23">
            <title>
               <p>Arby: automatic protein structure prediction using profile-profile alignment and confidence measures</p>
            </title>
            <aug>
               <au>
                  <snm>von &#214;hsen</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Sommer</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Zimmer</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Lengauer</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2004</pubdate>
            <volume>20</volume>
            <issue>14</issue>
            <fpage>2228</fpage>
            <lpage>2235</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/bth232</pubid>
                  <pubid idtype="pmpid" link="fulltext">15059818</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B24">
            <title>
               <p>Prediction of CASP-6 structures using automated Robetta protocols</p>
            </title>
            <aug>
               <au>
                  <snm>Chivian</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Kim</snm>
                  <fnm>DE</fnm>
               </au>
               <au>
                  <snm>Malmstr&#246;m</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Schonbrun</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Rohl</snm>
                  <fnm>CA</fnm>
               </au>
               <au>
                  <snm>Baker</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Proteins</source>
            <pubdate>2005</pubdate>
            <issue>Suppl 7</issue>
            <fpage>157</fpage>
            <lpage>166</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1002/prot.20733</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B25">
            <title>
               <p>A divide and conquer approach to fast loop modeling</p>
            </title>
            <aug>
               <au>
                  <snm>Tosatto</snm>
                  <fnm>SCE</fnm>
               </au>
               <au>
                  <snm>Bindewald</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Hesser</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>M&#228;nner</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Protein Eng</source>
            <pubdate>2002</pubdate>
            <volume>15</volume>
            <issue>4</issue>
            <fpage>279</fpage>
            <lpage>286</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/protein/15.4.279</pubid>
                  <pubid idtype="pmpid" link="fulltext">11983928</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B26">
            <title>
               <p>A graph-theory algorithm for rapid protein side-chain prediction</p>
            </title>
            <aug>
               <au>
                  <snm>Canutescu</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Shelenkov</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Dunbrack</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Protein Sci</source>
            <pubdate>2003</pubdate>
            <volume>12</volume>
            <issue>9</issue>
            <fpage>2001</fpage>
            <lpage>2014</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1110/ps.03154503</pubid>
                  <pubid idtype="pmpid" link="fulltext">12930999</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B27">
            <title>
               <p>An all-atom distance-dependent conditional probability discriminatory function for protein structure prediction</p>
            </title>
            <aug>
               <au>
                  <snm>Samudrala</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Moult</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>J Mol Biol</source>
            <pubdate>1998</pubdate>
            <volume>275</volume>
            <issue>5</issue>
            <fpage>895</fpage>
            <lpage>916</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1006/jmbi.1997.1479</pubid>
                  <pubid idtype="pmpid" link="fulltext">9480776</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B28">
            <title>
               <p>LGA: A method for finding 3D similarities in protein structures</p>
            </title>
            <aug>
               <au>
                  <snm>Zemla</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2003</pubdate>
            <volume>31</volume>
            <issue>13</issue>
            <fpage>3370</fpage>
            <lpage>3374</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">168977</pubid>
                  <pubid idtype="pmpid" link="fulltext">12824330</pubid>
                  <pubid idtype="doi">10.1093/nar/gkg571</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B29">
            <title>
               <p>Probabilistic outputs for support vector machines and comparison to regularized likelihood methods</p>
            </title>
            <aug>
               <au>
                  <snm>Platt</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Advances in Large Margin Classifiers</source>
            <publisher>MIT Press</publisher>
            <editor>Smola A, Bartlett P, Schoelkopf B, Schuurmans D</editor>
            <pubdate>1999</pubdate>
            <fpage>61</fpage>
            <lpage>74</lpage>
         </bibl>
         <bibl id="B30">
            <title>
               <p>The e1071 Package</p>
            </title>
            <aug>
               <au>
                  <snm>Dimitriadou</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Hornik</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Leisch</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Meyer</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Weingessel</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <pubdate>2005</pubdate>
            <url>Http://cran.r-project.org/src/contrib/Descriptions/e1071.html</url>
         </bibl>
         <bibl id="B31">
            <title>
               <p>LIBSVM: a library for support vector machines</p>
            </title>
            <aug>
               <au>
                  <snm>Chang</snm>
                  <fnm>CC</fnm>
               </au>
               <au>
                  <snm>Lin</snm>
                  <fnm>CJ</fnm>
               </au>
            </aug>
            <pubdate>2001</pubdate>
            <url>Http://www.csie.ntu.edu.tw/~cjlin/libsvm</url>
         </bibl>
         <bibl id="B32">
            <title>
               <p>SCOP database in 2004: refinements integrate structure and sequence family data</p>
            </title>
            <aug>
               <au>
                  <snm>Andreeva</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Howorth</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Brenner</snm>
                  <fnm>SE</fnm>
               </au>
               <au>
                  <snm>Hubbard</snm>
                  <fnm>TJP</fnm>
               </au>
               <au>
                  <snm>Chothia</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Murzin</snm>
                  <fnm>AG</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2004</pubdate>
            <issue>32 Database</issue>
            <fpage>D226</fpage>
            <lpage>D229</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">308773</pubid>
                  <pubid idtype="pmpid" link="fulltext">14681400</pubid>
                  <pubid idtype="doi">10.1093/nar/gkh039</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B33">
            <title>
               <p>The ASTRAL compendium for protein structure and sequence analysis</p>
            </title>
            <aug>
               <au>
                  <snm>Brenner</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Koehl</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Levitt</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2000</pubdate>
            <volume>28</volume>
            <fpage>254</fpage>
            <lpage>256</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">102434</pubid>
                  <pubid idtype="pmpid" link="fulltext">10592239</pubid>
                  <pubid idtype="doi">10.1093/nar/28.1.254</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B34">
            <title>
               <p>The ASTRAL Compendium in 2004</p>
            </title>
            <aug>
               <au>
                  <snm>Chandonia</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Hon</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Walker</snm>
                  <fnm>NS</fnm>
               </au>
               <au>
                  <snm>Conte</snm>
                  <fnm>LL</fnm>
               </au>
               <au>
                  <snm>Koehl</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Levitt</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Brenner</snm>
                  <fnm>SE</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2004</pubdate>
            <issue>32 Database</issue>
            <fpage>D189</fpage>
            <lpage>92</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">308768</pubid>
                  <pubid idtype="pmpid" link="fulltext">14681391</pubid>
                  <pubid idtype="doi">10.1093/nar/gkh034</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
      </refgrp>
   </bm>
</art>
