<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1471-2105-8-459</ui>
   <ji>1471-2105</ji>
   <fm>
      <dochead>Research article</dochead>
      <bibl>
         <title>
            <p>Predicting peptides binding to MHC class II molecules using multi-objective evolutionary algorithms</p>
         </title>
         <aug>
            <au id="A1" ca="yes">
               <snm>Rajapakse</snm>
               <fnm>Menaka</fnm>
               <insr iid="I1"/>
               <insr iid="I3"/>
               <email>menaka@i2r.a-star.edu.sg</email>
            </au>
            <au id="A2">
               <snm>Schmidt</snm>
               <fnm>Bertil</fnm>
               <insr iid="I2"/>
               <email>bertil.schmidt@computer.org</email>
            </au>
            <au id="A3">
               <snm>Feng</snm>
               <fnm>Lin</fnm>
               <insr iid="I3"/>
               <email>asflin@ntu.edu.sg</email>
            </au>
            <au id="A4">
               <snm>Brusic</snm>
               <fnm>Vladimir</fnm>
               <insr iid="I4"/>
               <email>vladimir_brusic@dfci.harvard.edu</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Institute for Infocomm Research, 21 Heng Mui Keng Terrace, 119613 Singapore</p>
            </ins>
            <ins id="I2">
               <p>NICTA VRL, University of Melbourne, Parkville, 3010 Australia</p>
            </ins>
            <ins id="I3">
               <p>School of Computer Engineering, Nanyang Technological University, Block N4, Nanyang Avenue, 639798 Singapore</p>
            </ins>
            <ins id="I4">
               <p>Cancer Vaccine Center, Dana-Farber Cancer Institute, Harvard Medical School, Boston, MA 02115 USA</p>
            </ins>
         </insg>
         <source>BMC Bioinformatics</source>
         <issn>1471-2105</issn>
         <pubdate>2007</pubdate>
         <volume>8</volume>
         <issue>1</issue>
         <fpage>459</fpage>
         <url>http://www.biomedcentral.com/1471-2105/8/459</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">18031584</pubid>
               <pubid idtype="doi">10.1186/1471-2105-8-459</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>07</day>
               <month>5</month>
               <year>2007</year>
            </date>
         </rec>
         <acc>
            <date>
               <day>22</day>
               <month>11</month>
               <year>2007</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>22</day>
               <month>11</month>
               <year>2007</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2007</year>
         <collab>Rajapakse et al; licensee BioMed Central Ltd.</collab>
         <note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>Peptides binding to Major Histocompatibility Complex (MHC) class II molecules are crucial for initiation and regulation of immune responses. Predicting peptides that bind to a specific MHC molecule plays an important role in determining potential candidates for vaccines. The binding groove in class II MHC is open at both ends, allowing peptides longer than 9-mer to bind. Finding the consensus motif facilitating the binding of peptides to a MHC class II molecule is difficult because of different lengths of binding peptides and varying location of 9-mer binding core. The level of difficulty increases when the molecule is promiscuous and binds to a large number of low affinity peptides.</p>
               <p>In this paper, we propose two approaches using multi-objective evolutionary algorithms (MOEA) for predicting peptides binding to MHC class II molecules. One uses the information from both binders and non-binders for self-discovery of motifs. The other, in addition, uses information from experimentally determined motifs for guided-discovery of motifs.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>The proposed methods are intended for finding peptides binding to MHC class II I-A<sup>g7 </sup>molecule &#8211; a promiscuous binder to a large number of low affinity peptides. Cross-validation results across experiments on two motifs derived for I-A<sup>g7 </sup>datasets demonstrate better generalization abilities and accuracies of the present method over earlier approaches. Further, the proposed method was validated and compared on two publicly available benchmark datasets: (1) an ensemble of qualitative HLA-DRB1*0401 peptide data obtained from five different sources, and (2) quantitative peptide data obtained for sixteen different alleles comprising of three mouse alleles and thirteen HLA alleles. The proposed method outperformed earlier methods on most datasets, indicating that it is well suited for finding peptides binding to MHC class II molecules.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusion</p>
               </st>
               <p>We present two MOEA-based algorithms for finding motifs, one for self-discovery and the other for guided-discovery by experimentally determined motifs, and thereby predicting binding peptides to I-A<sup>g7 </sup>molecule. Our experiments show that the proposed MOEA-based algorithms are better than earlier methods in predicting binding sites not only on I-A<sup>g7 </sup>but also on most alleles of class II MHC benchmark datasets. This shows that our methods could be applicable to find binding motifs in a wide range of alleles.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>Major histocompatibility complex (MHC) molecules play a key role in initiating immune responses. They bind to and expose an antigen (or short peptides) to T cell receptors (TCR) triggering an immune response against the infected cell or foreign agent. MHC molecules make multiple contacts with the side-chains of binding peptides, which define the binding motif and determine the specificity of binding <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>. Prediction of peptides binding to a MHC class II molecule is difficult due to different types of side chains and because the length of the binding peptides is longer than 9aa (approximately 11 to 22aa) <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr></abbrgrp>. It has been previously observed that a core of 9aa is sufficient for binding peptides to a MHC class II molecules <abbrgrp><abbr bid="B3">3</abbr></abbrgrp>, however, the exact location of the binding core (or motif) within the peptide is usually unknown and vary.</p>
         <p>A binding motif is usually represented either by a consensus sequence or as a weight matrix <abbrgrp><abbr bid="B4">4</abbr></abbrgrp>. The presence or composition of a motif can be experimentally determined from a large pool of putative binding peptides <abbrgrp><abbr bid="B3">3</abbr><abbr bid="B5">5</abbr></abbrgrp>. However, such wet-lab experiments are costly, time consuming, and cumbersome. Amino acids at specific sites of a motif, contributing significantly to the binding are referred to as <it>primary anchor residues </it>and the corresponding sites as <it>anchor positions</it>. By using such position-specific information, earlier studies have found weight matrix models elaborating the nature and strength of binding motifs <abbrgrp><abbr bid="B6">6</abbr><abbr bid="B7">7</abbr></abbrgrp>. These models offer binding strengths of every residue at specific sites in the form of a position specific scoring matrix (PSSM).<abbrgrp><abbr bid="B7">7</abbr></abbrgrp></p>
         <p>In general, MHC class-II prediction methods are categorized into two main classes <abbrgrp><abbr bid="B8">8</abbr></abbrgrp>: (1) quantitative prediction methods that predict inhibitory concentration (IC<sub>50</sub>) values and (2) qualitative prediction methods that determine the binding status (binder or non-binder) based on the predictive score. Recent quantitative prediction approaches include SVRMHC <abbrgrp><abbr bid="B8">8</abbr></abbrgrp>, PLS-ISC <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>, ARB <abbrgrp><abbr bid="B10">10</abbr></abbrgrp>, and SMM-align <abbrgrp><abbr bid="B11">11</abbr></abbrgrp>. The ARB approach uses full length of the peptide whereas both SVRMHC and PLS-ISC approaches use a preprocessing step involving alignment of sequences, based on anchor position-specific residues. The underlying assumption of SMM-align is that amino acids occupying the 9-mer binding core motif are sufficient to determine the affinity of peptide-MHC binding. However, in some cases, the predictive performance could be improved by incorporating terminal residues known as peptide flanking residues (PFR) <abbrgrp><abbr bid="B11">11</abbr></abbrgrp>.</p>
         <p>Qualitative prediction approaches use classifiers such as artificial neural networks <abbrgrp><abbr bid="B12">12</abbr><abbr bid="B13">13</abbr><abbr bid="B14">14</abbr><abbr bid="B15">15</abbr><abbr bid="B16">16</abbr></abbrgrp>, hidden Markov models <abbrgrp><abbr bid="B4">4</abbr><abbr bid="B17">17</abbr></abbrgrp>, support vector machines <abbrgrp><abbr bid="B18">18</abbr><abbr bid="B19">19</abbr><abbr bid="B20">20</abbr><abbr bid="B21">21</abbr></abbrgrp>, and their hybrids <abbrgrp><abbr bid="B22">22</abbr></abbrgrp>, or profile analysis such as those using iterative learning <abbrgrp><abbr bid="B23">23</abbr><abbr bid="B24">24</abbr><abbr bid="B25">25</abbr><abbr bid="B26">26</abbr></abbrgrp>, stochastic approaches (MEME) <abbrgrp><abbr bid="B27">27</abbr><abbr bid="B28">28</abbr></abbrgrp>, Gibbs motif sampler <abbrgrp><abbr bid="B29">29</abbr><abbr bid="B30">30</abbr><abbr bid="B31">31</abbr><abbr bid="B32">32</abbr></abbrgrp>, profile motifs (RANKPEP) <abbrgrp><abbr bid="B33">33</abbr><abbr bid="B34">34</abbr></abbrgrp>, DNA microarrays and virtual matrices (TEPITOPE) <abbrgrp><abbr bid="B35">35</abbr></abbrgrp>, and evolutionary algorithms (EA) <abbrgrp><abbr bid="B36">36</abbr></abbrgrp>. However, given a set of sequences of differing lengths with known binding affinities, the location of the binding core within each sequence must be first identified before classification of sequences. Classical multiple sequence alignment techniques often fail to detect binding cores in MHC class II binding peptides because of weak instances of binding motifs.</p>
         <p>All methods predicting peptides binding to MHC molecules have their pros and cons; most show good performance only for datasets upon which they were developed. Therefore, there is a need for new algorithms that perform well on previously unseen data. We propose to use MOEA to align a set of experimentally determined binding peptides at their binding cores and subsequently derive the consensus motif. The methods are especially useful when molecules are promiscuous and bind to a large number of low affinity peptides. The preliminary results of our work have been presented in <abbrgrp><abbr bid="B37">37</abbr></abbrgrp>.</p>
         <p>I-A<sup>g7 </sup>is the MHC class II molecule of the NOD mouse, critical for the development of insulin-dependent diabetes mellitus (IDDM) and other autoimmune disorders <abbrgrp><abbr bid="B38">38</abbr><abbr bid="B39">39</abbr><abbr bid="B40">40</abbr><abbr bid="B41">41</abbr><abbr bid="B42">42</abbr><abbr bid="B43">43</abbr></abbrgrp>. Knowledge of peptides binding to I-A<sup>g7 </sup>is important in understanding the molecular basis of development of IDDM in NOD mice. Experiments have demonstrated that I-A<sup>g7 </sup>binding peptides are 9&#8211;30aa long <abbrgrp><abbr bid="B44">44</abbr></abbrgrp>. Finding motifs in peptide binding to I-A<sup>g7 </sup>is a non-trivial problem <abbrgrp><abbr bid="B45">45</abbr><abbr bid="B46">46</abbr></abbrgrp>. Despite numerous attempts, no consensus has been reached on the rules of peptide binding to I-A<sup>g7 </sup>molecule <abbrgrp><abbr bid="B38">38</abbr><abbr bid="B39">39</abbr><abbr bid="B40">40</abbr><abbr bid="B41">41</abbr><abbr bid="B42">42</abbr><abbr bid="B43">43</abbr><abbr bid="B44">44</abbr><abbr bid="B45">45</abbr><abbr bid="B46">46</abbr><abbr bid="B47">47</abbr><abbr bid="B48">48</abbr></abbrgrp>. However, computational analyses on multiple datasets indicate that experimental motifs satisfy only a subset of rules describing the optimal motif.</p>
         <p>To demonstrate the utility in predicting peptides binding to other MHC molecules, our method is tested on two benchmark datasets comprising of peptides of number of different HLA (human MHC) and mouse alleles. The first dataset, referred to as BM-Set1 here onwards, consists of different combinations of peptides of HLA-DRB1*0401 allele, and the second dataset, BM-Set2, consists of datasets from thirteen different HLA alleles and three mouse alleles.</p>
         <sec>
            <st>
               <p>Multi-Objective Evolutionary Algorithms (MOEA)</p>
            </st>
            <p>Evolutionary algorithms (EA) are based on the principles of biological evolution and have often been successful in solving complex search and optimization problems. Majority of bioinformatics applications of EA have been in the discovery of motifs such as transcription factor binding sites <abbrgrp><abbr bid="B49">49</abbr><abbr bid="B50">50</abbr><abbr bid="B51">51</abbr><abbr bid="B52">52</abbr><abbr bid="B53">53</abbr></abbrgrp>. Yet, only a few researchers have used EA for the prediction of peptides binding to protein sequences <abbrgrp><abbr bid="B36">36</abbr></abbrgrp>.</p>
            <p>An EA consists of (1) representing input variables as individuals or chromosomes (binary or real valued) in a population, (2) formulating the fitness (objective function) to evaluate individuals, (3) generating a new population by genetic operations (such as reproduction, crossover, and mutation) on the current population, and (4) determining if the population has reached the optimal fitness. The algorithm begins with an initial population and evolves over time. At a particular instance of evolution, every individual is evaluated by its fitness. New populations (offspring) are produced from highly fit individuals (parents) selected, which undergo genetic operations. Each offspring is paired and compared to its parents. Highly fit individuals are retained in the population while less fit individuals are discarded. Search mechanisms such as elitism, constraint-handling, and multi-objective optimization are available for finding a better spread of solutions, depending on the needs of the optimization problem <abbrgrp><abbr bid="B54">54</abbr><abbr bid="B55">55</abbr><abbr bid="B56">56</abbr><abbr bid="B57">57</abbr></abbrgrp>.</p>
            <p>Multi-objective evolutionary algorithms (MOEA) are used to solve problems which require simultaneous optimization of a number of competing objective functions <abbrgrp><abbr bid="B58">58</abbr><abbr bid="B59">59</abbr><abbr bid="B60">60</abbr><abbr bid="B61">61</abbr></abbrgrp>. MOEA maintains a set of solutions ranked by their dominance at a given instant of the evolution. A solution is said to dominate another if it is better or equal with respect to all objectives and strictly better in at least one objective <abbrgrp><abbr bid="B58">58</abbr></abbrgrp>. Often, there are more than one non-dominated solutions, representing the best ones, collectively known as the <it>Pareto </it>front. MOEA algorithms result in a <it>Pareto optimal set </it>of solutions.</p>
            <p>Non-dominated Sorting Genetic Algorithm II (NSGA-II) was recently introduced to incorporate several new genetic mechanisms for better convergence, such as non-dominated sorting, elitism, diversity preservation, and constraint handling <abbrgrp><abbr bid="B58">58</abbr></abbrgrp>. In NSGA-II, a population is subjected to several rounds of non-dominated sorting. That is, all the non-dominated individuals are identified and assigned the same fitness value until a new set of non-dominated solutions is found. The solutions found in subsequent rounds are assigned fitness values lower than those in the previous rounds. This process continues until the whole population is partitioned into non-dominated fronts with diverse fitness values. The elitism prevents the loss of fit individuals encountered in earlier generations by allowing earlier solutions to survive in the subsequent generations. The diversity of Pareto-optimal solutions is maintained by imposing a measure referred to as <it>crowding distance</it>. A solution that satisfies the constraints defined by the objective functions is called a <it>feasible solution</it>.</p>
         </sec>
         <sec>
            <st>
               <p>Peptide Binding to MHC Class II I-A<sup>g7</sup></p>
            </st>
            <p>In this paper, we attempt to find an optimal motif describing peptide binding to MHC class II molecules, using experimentally determined binding data. There are several factors that impede the derivation of such a consensus motif. The first is the strong resemblance among the peptides isolated in a single experiment and the second is the diversity among different datasets. A motif derived from a dataset lacking diversity indicates a bias towards the dataset used in deriving the motif. Such motifs are difficult to generalize on other experimental or previously unseen datasets. The MOEA based motif detection algorithm is designed to find a consensus motif on I-A<sup>g7 </sup>datasets, which alleviates the influences arising from biased datasets and thereby predicts binding peptides more accurately in new datasets.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Results</p>
         </st>
         <sec>
            <st>
               <p>Predicting Peptides Binding to MHC Class II</p>
            </st>
            <p>We use our approach to find a consensus motif on seven experimental datasets of peptides binding to I-A<sup>g7 </sup>molecules, obtained from literature <abbrgrp><abbr bid="B40">40</abbr><abbr bid="B41">41</abbr><abbr bid="B42">42</abbr><abbr bid="B43">43</abbr><abbr bid="B62">62</abbr><abbr bid="B63">63</abbr><abbr bid="B64">64</abbr></abbrgrp>. The motif is validated using an independent testing set generated from the Stratmann dataset <abbrgrp><abbr bid="B46">46</abbr></abbrgrp>. The overall quality of prediction was measured using area under curve (AUC) of the receiver operating characteristics (ROC) curve <abbrgrp><abbr bid="B65">65</abbr><abbr bid="B66">66</abbr><abbr bid="B67">67</abbr></abbrgrp>. AUC values of all feasible solutions in the final population of EA were evaluated and the solution with the highest AUC was chosen as the consensus motif (see Additional file <supplr sid="S1">1</supplr>).</p>
            <suppl id="S1">
               <title>
                  <p>Additional file 1</p>
               </title>
               <text>
                  <p>MOEA derived matrices on I-A<sup>g7 </sup>dataset. The two PSSM derived by using MOEA self-discovery and guided-discovery approaches are given in the Additional file <supplr sid="S1">1</supplr>.</p>
               </text>
               <file name="1471-2105-8-459-S1.pdf">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <p>Table <tblr tid="T1">1</tblr> shows the information of the datasets extracted from literature, which were used in the training. A blank '-'indicates the unavailability of a particular information. As an example, the details of the experimental motif of Reizis <it>et al </it>are given in Table <tblr tid="T2">2</tblr>. Table <tblr tid="T3">3</tblr> shows the performance when an experimental motif is used to predict peptide binders in other datasets. As seen, a motif of a particular experiment does not characterize peptide binding of I-A<sup>g7 </sup>molecules in other datasets. Table <tblr tid="T4">4</tblr> shows the cross-validation performance of two motifs (by self-discovery and guided-discovery) derived using MOEA; in a particular cross-validation run, one experimental dataset was excluded and the motif was derived using the information of the remaining datasets. The motif was tested for predicting binders and non-binders of the left-out dataset. The self-discovery approach uses only the binding information whereas the guided-discovery uses both binding information as well as information associated with experimental motifs. As seen in Table <tblr tid="T4">4</tblr>, by achieving AUC values greater than 0.7 for all cross-validation runs, MOEA derived motifs demonstrate better generalization capabilities compared to experimentally determined motifs. The binding motifs derived from self-discovery and guided-discovery are illustrated as sequence logo plots <abbrgrp><abbr bid="B68">68</abbr></abbrgrp> in the Additional file <supplr sid="S2">2</supplr>.</p>
            <suppl id="S2">
               <title>
                  <p>Additional file 2</p>
               </title>
               <text>
                  <p>Motif logos obtained for I-A<sup>g7 </sup>from MOEA derived matrices. Figure 1 and Figure 2 illustrate motif logos derived from the alignments obtained from the MOEA guided-discovery and self-discovery approaches. The web server <abbrgrp><abbr bid="B79">79</abbr></abbrgrp> was used to generate the motif logos as described in <abbrgrp><abbr bid="B68">68</abbr></abbrgrp>.</p>
               </text>
               <file name="1471-2105-8-459-S2.pdf">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <tbl id="T1">
               <title>
                  <p>Table 1</p>
               </title>
               <caption>
                  <p>I-A<sup>g7 </sup>datasets and experimental motifs</p>
               </caption>
               <tblbdy cols="5">
                  <r>
                     <c ca="left">
                        <p>Dataset</p>
                     </c>
                     <c ca="left">
                        <p>Experimental Motif</p>
                     </c>
                     <c ca="left">
                        <p>Non-binders</p>
                     </c>
                     <c ca="left">
                        <p>Binders</p>
                     </c>
                     <c ca="left">
                        <p>Reference</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="5">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Reizis</p>
                     </c>
                     <c ca="left">
                        <p><it>m</it>(Reizis)</p>
                     </c>
                     <c ca="left">
                        <p>21</p>
                     </c>
                     <c ca="left">
                        <p>33</p>
                     </c>
                     <c ca="left">
                        <p>[40]</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Harrison</p>
                     </c>
                     <c ca="left">
                        <p><it>m</it>(Harrison)</p>
                     </c>
                     <c ca="left">
                        <p>19</p>
                     </c>
                     <c ca="left">
                        <p>157</p>
                     </c>
                     <c ca="left">
                        <p>[41]</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Gregori</p>
                     </c>
                     <c ca="left">
                        <p><it>m</it>(Gregori)</p>
                     </c>
                     <c ca="left">
                        <p>31</p>
                     </c>
                     <c ca="left">
                        <p>109</p>
                     </c>
                     <c ca="left">
                        <p>[43]</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Latek</p>
                     </c>
                     <c ca="left">
                        <p><it>m</it>(Latek)</p>
                     </c>
                     <c ca="left">
                        <p>8</p>
                     </c>
                     <c ca="left">
                        <p>37</p>
                     </c>
                     <c ca="left">
                        <p>[42]</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p><it>m</it>(Rammensee)</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>[44]</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p><it>m</it>(Reich)</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>[38]</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p><it>m</it>(Amor)</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>[39]</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Corper</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>35</p>
                     </c>
                     <c ca="left">
                        <p>13</p>
                     </c>
                     <c ca="left">
                        <p>[62]</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>MHCPEP</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>176</p>
                     </c>
                     <c ca="left">
                        <p>[63]</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Yu</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>16</p>
                     </c>
                     <c ca="left">
                        <p>10</p>
                     </c>
                     <c ca="left">
                        <p>[64]</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Brusic</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>37</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>[unpublished]</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>Information on I-A<sup>g7 </sup>related peptide binding datasets and motifs. Unavailable information is indicated by "-".</p>
               </tblfn>
            </tbl>
            <tbl id="T2">
               <title>
                  <p>Table 2</p>
               </title>
               <caption>
                  <p>Representation of an experimentally derived I-A<sup>g7 </sup>motif</p>
               </caption>
               <tblbdy cols="4">
                  <r>
                     <c ca="center">
                        <p>Position</p>
                     </c>
                     <c ca="center">
                        <p>Well-Tolerated</p>
                     </c>
                     <c ca="center">
                        <p>Weakly-Tolerated</p>
                     </c>
                     <c ca="center">
                        <p>Non-Tolerated</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>P1</p>
                     </c>
                     <c ca="center">
                        <p>VEQMHLPD</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>R</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>P2</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>P3</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>
                           <b>P4</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>ILPV</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>HY</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>QEK</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>P5</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>
                           <b>P6</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>ATSNV</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>-</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>LYQK</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>P7</p>
                     </c>
                     <c ca="center">
                        <p>QVYLHINRF</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>P8</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>
                           <b>P9</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>ED</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>SM</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>LYTQK</b>
                        </p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>The description of experimentally determined I-A<sup>g7 </sup>9-mer peptide binding motif by Reizis: each position accommodates a well-tolerated, weakly-tolerated, or non-tolerated amino acid. The positions P4, P6 and P9 are the primary anchor positions where binding is highly likely to occur.</p>
               </tblfn>
            </tbl>
            <tbl id="T3">
               <title>
                  <p>Table 3</p>
               </title>
               <caption>
                  <p>Validation of I-A<sup>g7 </sup>experimental motifs</p>
               </caption>
               <tblbdy cols="8">
                  <r>
                     <c ca="left">
                        <p>Experimental Motif</p>
                     </c>
                     <c cspan="7" ca="center">
                        <p>AUC value</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c cspan="7">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c cspan="7" ca="center">
                        <p>Datasets</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c cspan="7">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>Reizis</p>
                     </c>
                     <c ca="center">
                        <p>Harrison</p>
                     </c>
                     <c ca="center">
                        <p>Gregori</p>
                     </c>
                     <c ca="center">
                        <p>Latek</p>
                     </c>
                     <c ca="center">
                        <p>Corper</p>
                     </c>
                     <c ca="center">
                        <p>MHCPEP</p>
                     </c>
                     <c ca="center">
                        <p>Yu</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="8">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p><it>m</it>(Reizis)</p>
                     </c>
                     <c ca="center">
                        <p>0.95</p>
                     </c>
                     <c ca="center">
                        <p>0.68</p>
                     </c>
                     <c ca="center">
                        <p>0.74</p>
                     </c>
                     <c ca="center">
                        <p>0.95</p>
                     </c>
                     <c ca="center">
                        <p>0.50</p>
                     </c>
                     <c ca="center">
                        <p>0.59</p>
                     </c>
                     <c ca="center">
                        <p>0.48</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p><it>m</it>(Harrison)</p>
                     </c>
                     <c ca="center">
                        <p>0.75</p>
                     </c>
                     <c ca="center">
                        <p>0.88</p>
                     </c>
                     <c ca="center">
                        <p>0.69</p>
                     </c>
                     <c ca="center">
                        <p>0.64</p>
                     </c>
                     <c ca="center">
                        <p>0.53</p>
                     </c>
                     <c ca="center">
                        <p>0.72</p>
                     </c>
                     <c ca="center">
                        <p>0.33</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p><it>m</it>(Gregori)</p>
                     </c>
                     <c ca="center">
                        <p>0.64</p>
                     </c>
                     <c ca="center">
                        <p>0.68</p>
                     </c>
                     <c ca="center">
                        <p>0.71</p>
                     </c>
                     <c ca="center">
                        <p>0.73</p>
                     </c>
                     <c ca="center">
                        <p>0.40</p>
                     </c>
                     <c ca="center">
                        <p>0.64</p>
                     </c>
                     <c ca="center">
                        <p>0.61</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p><it>m</it>(Latek)</p>
                     </c>
                     <c ca="center">
                        <p>0.66</p>
                     </c>
                     <c ca="center">
                        <p>0.72</p>
                     </c>
                     <c ca="center">
                        <p>0.80</p>
                     </c>
                     <c ca="center">
                        <p>0.95</p>
                     </c>
                     <c ca="center">
                        <p>0.64</p>
                     </c>
                     <c ca="center">
                        <p>0.52</p>
                     </c>
                     <c ca="center">
                        <p>0.75</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p><it>m</it>(Rammensee</p>
                     </c>
                     <c ca="center">
                        <p>0.49</p>
                     </c>
                     <c ca="center">
                        <p>0.64</p>
                     </c>
                     <c ca="center">
                        <p>0.76</p>
                     </c>
                     <c ca="center">
                        <p>0.82</p>
                     </c>
                     <c ca="center">
                        <p>0.60</p>
                     </c>
                     <c ca="center">
                        <p>0.48</p>
                     </c>
                     <c ca="center">
                        <p>0.43</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p><it>m</it>(Reich)</p>
                     </c>
                     <c ca="center">
                        <p>0.55</p>
                     </c>
                     <c ca="center">
                        <p>0.64</p>
                     </c>
                     <c ca="center">
                        <p>0.69</p>
                     </c>
                     <c ca="center">
                        <p>0.58</p>
                     </c>
                     <c ca="center">
                        <p>0.56</p>
                     </c>
                     <c ca="center">
                        <p>0.47</p>
                     </c>
                     <c ca="center">
                        <p>0.50</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p><it>m</it>(Amor)</p>
                     </c>
                     <c ca="center">
                        <p>0.69</p>
                     </c>
                     <c ca="center">
                        <p>0.54</p>
                     </c>
                     <c ca="center">
                        <p>0.66</p>
                     </c>
                     <c ca="center">
                        <p>0.70</p>
                     </c>
                     <c ca="center">
                        <p>0.56</p>
                     </c>
                     <c ca="center">
                        <p>0.66</p>
                     </c>
                     <c ca="center">
                        <p>0.40</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>Performance measured by AUC of experimentally determined I-A<sup>g7 </sup>motifs on their own datasets and other experimental datasets.</p>
               </tblfn>
            </tbl>
            <tbl id="T4">
               <title>
                  <p>Table 4</p>
               </title>
               <caption>
                  <p>Performance of I-A<sup>g7 </sup>MOEA derived motifs</p>
               </caption>
               <tblbdy cols="8">
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c cspan="7" ca="center">
                        <p>AUC value</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c cspan="7">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>MOEA-derived Motifs</p>
                     </c>
                     <c cspan="7" ca="center">
                        <p>Datasets</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c cspan="7">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>Reizis</p>
                     </c>
                     <c ca="center">
                        <p>Harrison</p>
                     </c>
                     <c ca="center">
                        <p>Gregori</p>
                     </c>
                     <c ca="center">
                        <p>Latek</p>
                     </c>
                     <c ca="center">
                        <p>Corper</p>
                     </c>
                     <c ca="center">
                        <p>MHCPEP</p>
                     </c>
                     <c ca="center">
                        <p>Yu</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="8">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>self-discovery</p>
                     </c>
                     <c ca="center">
                        <p>0.75</p>
                     </c>
                     <c ca="center">
                        <p>0.75</p>
                     </c>
                     <c ca="center">
                        <p>0.77</p>
                     </c>
                     <c ca="center">
                        <p>0.93</p>
                     </c>
                     <c ca="center">
                        <p>0.70</p>
                     </c>
                     <c ca="center">
                        <p>0.75</p>
                     </c>
                     <c ca="center">
                        <p>0.75</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>guided-discovery</p>
                     </c>
                     <c ca="center">
                        <p>0.77</p>
                     </c>
                     <c ca="center">
                        <p>0.74</p>
                     </c>
                     <c ca="center">
                        <p>0.81</p>
                     </c>
                     <c ca="center">
                        <p>0.83</p>
                     </c>
                     <c ca="center">
                        <p>0.72</p>
                     </c>
                     <c ca="center">
                        <p>0.77</p>
                     </c>
                     <c ca="center">
                        <p>0.71</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>Seven-fold cross-validation accuracies of MOEA derived motifs on training dataset.</p>
               </tblfn>
            </tbl>
            <p>To compare the performance of our method with earlier methods, a training dataset was created by combining all the experimental datasets given in Table <tblr tid="T1">1</tblr>. Motifs derived on the training dataset were tested on an independent test dataset &#8211; a balanced set generated from Stratmann dataset. The Stratmann dataset was balanced by adding randomly generated non-binders. Twenty five such balanced test datasets were assembled by generating random samples starting from different seeds and adding them to the Stratmann dataset. The results reported are based on the average AUC values over all balanced test sets. Figure <figr fid="F1">1</figr> shows comparison of performances of motifs derived by MOEA and by earlier motif prediction approaches such as MEME and RANKPEP. An increase of 4&#8211;10% in predictive performance is observed with MOEA over the other approaches.</p>
            <p>Comparison of performances of MOEA derived motifs for BM-Set1 (see Table <tblr tid="T5">5</tblr>) with enhanced Gibbs sampler <abbrgrp><abbr bid="B32">32</abbr></abbrgrp>, TEPITOPE <abbrgrp><abbr bid="B35">35</abbr></abbrgrp>, SVRMHC <abbrgrp><abbr bid="B8">8</abbr></abbrgrp> and ARB <abbrgrp><abbr bid="B10">10</abbr></abbrgrp>, is given in Table <tblr tid="T6">6</tblr>. As seen, MOEA shows comparable or superior performance with Gibbs sampler on all datasets except for the Southwood dataset. Out of the ten non-redundant (NR) datasets, the MOEA outperformed Gibbs sampler, TEPITOPE, SVRMHC and ARB by seven, nine, eight and ten datasets, respectively.</p>
            <p>The performance of MOEA on BM-Set2 (see Table <tblr tid="T7">7</tblr>) was compared with Gibbs sampler <abbrgrp><abbr bid="B32">32</abbr></abbrgrp>, TEPITOPE <abbrgrp><abbr bid="B35">35</abbr></abbrgrp>, SVRMHC <abbrgrp><abbr bid="B8">8</abbr></abbrgrp>, ARB <abbrgrp><abbr bid="B10">10</abbr></abbrgrp> and NetMHCII <abbrgrp><abbr bid="B11">11</abbr></abbrgrp>. Each allele dataset was subjected to five-fold cross-validation and the results are given in Table <tblr tid="T8">8</tblr>. The present method shows comparable or superior performance on majority of allele datasets compared to Gibbs sampler, SVRMHC, TEPITOPE, and NetMHCII. A fair comparison of ARB method cannot be drawn because the method has been trained on quantitative data obtained from IEDB <abbrgrp><abbr bid="B10">10</abbr></abbrgrp>.</p>
            <tbl id="T5">
               <title>
                  <p>Table 5</p>
               </title>
               <caption>
                  <p>Description of peptides in BM-Set1</p>
               </caption>
               <tblbdy cols="5">
                  <r>
                     <c ca="left">
                        <p>BM-Set1</p>
                     </c>
                     <c cspan="2" ca="center">
                        <p>Original</p>
                     </c>
                     <c cspan="2" ca="center">
                        <p>NR</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="5">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>DRB1*0401</p>
                     </c>
                     <c ca="center">
                        <p>Binders</p>
                     </c>
                     <c ca="center">
                        <p>Non-binders</p>
                     </c>
                     <c ca="center">
                        <p>Binders</p>
                     </c>
                     <c ca="center">
                        <p>Non-binders</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="5">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Set1</p>
                     </c>
                     <c ca="center">
                        <p>694</p>
                     </c>
                     <c ca="center">
                        <p>323</p>
                     </c>
                     <c ca="center">
                        <p>248</p>
                     </c>
                     <c ca="center">
                        <p>283</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Set2</p>
                     </c>
                     <c ca="center">
                        <p>381</p>
                     </c>
                     <c ca="center">
                        <p>292</p>
                     </c>
                     <c ca="center">
                        <p>161</p>
                     </c>
                     <c ca="center">
                        <p>255</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Set3a</p>
                     </c>
                     <c ca="center">
                        <p>373</p>
                     </c>
                     <c ca="center">
                        <p>217</p>
                     </c>
                     <c ca="center">
                        <p>151</p>
                     </c>
                     <c ca="center">
                        <p>204</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Set3b</p>
                     </c>
                     <c ca="center">
                        <p>279</p>
                     </c>
                     <c ca="center">
                        <p>216</p>
                     </c>
                     <c ca="center">
                        <p>128</p>
                     </c>
                     <c ca="center">
                        <p>197</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Set4a</p>
                     </c>
                     <c ca="center">
                        <p>323</p>
                     </c>
                     <c ca="center">
                        <p>323</p>
                     </c>
                     <c ca="center">
                        <p>120</p>
                     </c>
                     <c ca="center">
                        <p>283</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Set4b</p>
                     </c>
                     <c ca="center">
                        <p>292</p>
                     </c>
                     <c ca="center">
                        <p>292</p>
                     </c>
                     <c ca="center">
                        <p>120</p>
                     </c>
                     <c ca="center">
                        <p>255</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Set5a</p>
                     </c>
                     <c ca="center">
                        <p>70</p>
                     </c>
                     <c ca="center">
                        <p>47</p>
                     </c>
                     <c ca="center">
                        <p>65</p>
                     </c>
                     <c ca="center">
                        <p>45</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Set5b</p>
                     </c>
                     <c ca="center">
                        <p>48</p>
                     </c>
                     <c ca="center">
                        <p>37</p>
                     </c>
                     <c ca="center">
                        <p>47</p>
                     </c>
                     <c ca="center">
                        <p>37</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Southwood</p>
                     </c>
                     <c ca="center">
                        <p>16</p>
                     </c>
                     <c ca="center">
                        <p>6</p>
                     </c>
                     <c ca="center">
                        <p>15</p>
                     </c>
                     <c ca="center">
                        <p>6</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Geluk</p>
                     </c>
                     <c ca="center">
                        <p>22</p>
                     </c>
                     <c ca="center">
                        <p>83</p>
                     </c>
                     <c ca="center">
                        <p>19</p>
                     </c>
                     <c ca="center">
                        <p>80</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>The number of binders and non-binders in the original and non-redundant (NR) datasets in BM-Set1.</p>
               </tblfn>
            </tbl>
            <tbl id="T6">
               <title>
                  <p>Table 6</p>
               </title>
               <caption>
                  <p>Comparison of performance on BM-Set1</p>
               </caption>
               <tblbdy cols="7">
                  <r>
                     <c cspan="2" ca="center">
                        <p>Dataset</p>
                     </c>
                     <c cspan="5" ca="center">
                        <p>AUC</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c cspan="5">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>&#8224;SVRMHC</p>
                     </c>
                     <c ca="center">
                        <p>Gibbs</p>
                     </c>
                     <c ca="center">
                        <p>ARB</p>
                     </c>
                     <c ca="center">
                        <p>TEPITOPE</p>
                     </c>
                     <c ca="center">
                        <p>MOEA</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="7">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>Original</p>
                     </c>
                     <c ca="left">
                        <p>set1</p>
                     </c>
                     <c ca="center">
                        <p>0.711</p>
                     </c>
                     <c ca="center">
                        <p>0.799</p>
                     </c>
                     <c ca="center">
                        <p>0.666</p>
                     </c>
                     <c ca="center">
                        <p>0.760</p>
                     </c>
                     <c ca="center">
                        <p>0.760</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>set2</p>
                     </c>
                     <c ca="center">
                        <p>0.652</p>
                     </c>
                     <c ca="center">
                        <p>0.766</p>
                     </c>
                     <c ca="center">
                        <p>0.653</p>
                     </c>
                     <c ca="center">
                        <p>0.736</p>
                     </c>
                     <c ca="center">
                        <p>0.765</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>set3a</p>
                     </c>
                     <c ca="center">
                        <p>0.626</p>
                     </c>
                     <c ca="center">
                        <p>0.740</p>
                     </c>
                     <c ca="center">
                        <p>0.652</p>
                     </c>
                     <c ca="center">
                        <p>0.730</p>
                     </c>
                     <c ca="center">
                        <p>0.733</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>set3b</p>
                     </c>
                     <c ca="center">
                        <p>0.618</p>
                     </c>
                     <c ca="center">
                        <p>0.751</p>
                     </c>
                     <c ca="center">
                        <p>0.666</p>
                     </c>
                     <c ca="center">
                        <p>0.750</p>
                     </c>
                     <c ca="center">
                        <p>0.752</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>set4a</p>
                     </c>
                     <c ca="center">
                        <p>0.706</p>
                     </c>
                     <c ca="center">
                        <p>0.788</p>
                     </c>
                     <c ca="center">
                        <p>0.668</p>
                     </c>
                     <c ca="center">
                        <p>0.748</p>
                     </c>
                     <c ca="center">
                        <p>0.748</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>set4b</p>
                     </c>
                     <c ca="center">
                        <p>0.664</p>
                     </c>
                     <c ca="center">
                        <p>0.770</p>
                     </c>
                     <c ca="center">
                        <p>0.661</p>
                     </c>
                     <c ca="center">
                        <p>0.748</p>
                     </c>
                     <c ca="center">
                        <p>0.770</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>set5a</p>
                     </c>
                     <c ca="center">
                        <p>0.553</p>
                     </c>
                     <c ca="center">
                        <p>0.604</p>
                     </c>
                     <c ca="center">
                        <p>0.539</p>
                     </c>
                     <c ca="center">
                        <p>0.653</p>
                     </c>
                     <c ca="center">
                        <p>0.777</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>set5b</p>
                     </c>
                     <c ca="center">
                        <p>0.606</p>
                     </c>
                     <c ca="center">
                        <p>0.621</p>
                     </c>
                     <c ca="center">
                        <p>0.579</p>
                     </c>
                     <c ca="center">
                        <p>0.679</p>
                     </c>
                     <c ca="center">
                        <p>0.748</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Southwood</p>
                     </c>
                     <c ca="center">
                        <p>0.912</p>
                     </c>
                     <c ca="center">
                        <p>0.862</p>
                     </c>
                     <c ca="center">
                        <p>0.514</p>
                     </c>
                     <c ca="center">
                        <p>0.490</p>
                     </c>
                     <c ca="center">
                        <p>0.784</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Geluk</p>
                     </c>
                     <c ca="center">
                        <p>0.697</p>
                     </c>
                     <c ca="center">
                        <p>0.723</p>
                     </c>
                     <c ca="center">
                        <p>0.682</p>
                     </c>
                     <c ca="center">
                        <p>0.710</p>
                     </c>
                     <c ca="center">
                        <p>0.786</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>NR</p>
                     </c>
                     <c ca="left">
                        <p>set1</p>
                     </c>
                     <c ca="center">
                        <p>0.619</p>
                     </c>
                     <c ca="center">
                        <p>0.673</p>
                     </c>
                     <c ca="center">
                        <p>0.572</p>
                     </c>
                     <c ca="center">
                        <p>0.594</p>
                     </c>
                     <c ca="center">
                        <p>0.587</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>set2</p>
                     </c>
                     <c ca="center">
                        <p>0.581</p>
                     </c>
                     <c ca="center">
                        <p>0.665</p>
                     </c>
                     <c ca="center">
                        <p>0.640</p>
                     </c>
                     <c ca="center">
                        <p>0.653</p>
                     </c>
                     <c ca="center">
                        <p>0.685</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>set3a</p>
                     </c>
                     <c ca="center">
                        <p>0.578</p>
                     </c>
                     <c ca="center">
                        <p>0.598</p>
                     </c>
                     <c ca="center">
                        <p>0.600</p>
                     </c>
                     <c ca="center">
                        <p>0.598</p>
                     </c>
                     <c ca="center">
                        <p>0.660</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>set3b</p>
                     </c>
                     <c ca="center">
                        <p>0.577</p>
                     </c>
                     <c ca="center">
                        <p>0.692</p>
                     </c>
                     <c ca="center">
                        <p>0.669</p>
                     </c>
                     <c ca="center">
                        <p>0.699</p>
                     </c>
                     <c ca="center">
                        <p>0.713</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>set4a</p>
                     </c>
                     <c ca="center">
                        <p>0.597</p>
                     </c>
                     <c ca="center">
                        <p>0.671</p>
                     </c>
                     <c ca="center">
                        <p>0.575</p>
                     </c>
                     <c ca="center">
                        <p>0.573</p>
                     </c>
                     <c ca="center">
                        <p>0.599</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>set4b</p>
                     </c>
                     <c ca="center">
                        <p>0.577</p>
                     </c>
                     <c ca="center">
                        <p>0.669</p>
                     </c>
                     <c ca="center">
                        <p>0.651</p>
                     </c>
                     <c ca="center">
                        <p>0.655</p>
                     </c>
                     <c ca="center">
                        <p>0.690</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>set5a</p>
                     </c>
                     <c ca="center">
                        <p>0.544</p>
                     </c>
                     <c ca="center">
                        <p>0.601</p>
                     </c>
                     <c ca="center">
                        <p>0.536</p>
                     </c>
                     <c ca="center">
                        <p>0.646</p>
                     </c>
                     <c ca="center">
                        <p>0.790</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>set5b</p>
                     </c>
                     <c ca="center">
                        <p>0.593</p>
                     </c>
                     <c ca="center">
                        <p>0.610</p>
                     </c>
                     <c ca="center">
                        <p>0.572</p>
                     </c>
                     <c ca="center">
                        <p>0.671</p>
                     </c>
                     <c ca="center">
                        <p>0.743</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Southwood</p>
                     </c>
                     <c ca="center">
                        <p>0.917</p>
                     </c>
                     <c ca="center">
                        <p>0.850</p>
                     </c>
                     <c ca="center">
                        <p>0.671</p>
                     </c>
                     <c ca="center">
                        <p>0.505</p>
                     </c>
                     <c ca="center">
                        <p>0.770</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Geluk</p>
                     </c>
                     <c ca="center">
                        <p>0.655</p>
                     </c>
                     <c ca="center">
                        <p>0.697</p>
                     </c>
                     <c ca="center">
                        <p>0.510</p>
                     </c>
                     <c ca="center">
                        <p>0.670</p>
                     </c>
                     <c ca="center">
                        <p>0.768</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>Comparison of AUC values of the BM-Set1 (DRB1*0401). &#8224;These values are based on smaller dataset sizes as SVRMHC didn't predict values for some of the peptides. The values from the Gibbs sampler were estimated from the matrix provided by the authors in [32].</p>
               </tblfn>
            </tbl>
            <tbl id="T7">
               <title>
                  <p>Table 7</p>
               </title>
               <caption>
                  <p>Description of peptides in BM-Set2</p>
               </caption>
               <tblbdy cols="4">
                  <r>
                     <c ca="center">
                        <p>Type</p>
                     </c>
                     <c ca="center">
                        <p>Allele</p>
                     </c>
                     <c ca="center">
                        <p>Binders</p>
                     </c>
                     <c ca="center">
                        <p>Non-binders</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>Mouse</p>
                     </c>
                     <c ca="center">
                        <p>I-Ab</p>
                     </c>
                     <c ca="center">
                        <p>43</p>
                     </c>
                     <c ca="center">
                        <p>33</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>I-Ad</p>
                     </c>
                     <c ca="center">
                        <p>56</p>
                     </c>
                     <c ca="center">
                        <p>286</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>I-As</p>
                     </c>
                     <c ca="center">
                        <p>35</p>
                     </c>
                     <c ca="center">
                        <p>91</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>HLA</p>
                     </c>
                     <c ca="center">
                        <p>DRB1-0101</p>
                     </c>
                     <c ca="center">
                        <p>920</p>
                     </c>
                     <c ca="center">
                        <p>283</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>DRB1-0301</p>
                     </c>
                     <c ca="center">
                        <p>65</p>
                     </c>
                     <c ca="center">
                        <p>409</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>DRB1-0401</p>
                     </c>
                     <c ca="center">
                        <p>209</p>
                     </c>
                     <c ca="center">
                        <p>248</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>DRB1-0404</p>
                     </c>
                     <c ca="center">
                        <p>74</p>
                     </c>
                     <c ca="center">
                        <p>94</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>DRB1-0405</p>
                     </c>
                     <c ca="center">
                        <p>88</p>
                     </c>
                     <c ca="center">
                        <p>83</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>DRB1-0701</p>
                     </c>
                     <c ca="center">
                        <p>125</p>
                     </c>
                     <c ca="center">
                        <p>185</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>DRB1-0802</p>
                     </c>
                     <c ca="center">
                        <p>58</p>
                     </c>
                     <c ca="center">
                        <p>116</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>DRB1-0901</p>
                     </c>
                     <c ca="center">
                        <p>47</p>
                     </c>
                     <c ca="center">
                        <p>70</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>DRB1-1101</p>
                     </c>
                     <c ca="center">
                        <p>95</p>
                     </c>
                     <c ca="center">
                        <p>264</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>DRB1-1302</p>
                     </c>
                     <c ca="center">
                        <p>101</p>
                     </c>
                     <c ca="center">
                        <p>78</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>DRB1-1501</p>
                     </c>
                     <c ca="center">
                        <p>188</p>
                     </c>
                     <c ca="center">
                        <p>177</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>DRB4-0101</p>
                     </c>
                     <c ca="center">
                        <p>74</p>
                     </c>
                     <c ca="center">
                        <p>107</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>DRB5-0101</p>
                     </c>
                     <c ca="center">
                        <p>112</p>
                     </c>
                     <c ca="center">
                        <p>231</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>The number of binders and non-binders in each of the dataset in BM-Set2. The datasets in BM-Set2 were obtained from [77]. The DRB3-0101 allele dataset was excluded from the performance comparison due to significant imbalance in the dataset (3 binders and 99 non-binders).</p>
               </tblfn>
            </tbl>
            <tbl id="T8">
               <title>
                  <p>Table 8</p>
               </title>
               <caption>
                  <p>Comparison of Performance on BM-Set2</p>
               </caption>
               <tblbdy cols="8">
                  <r>
                     <c ca="center">
                        <p>Type</p>
                     </c>
                     <c ca="center">
                        <p>Allele</p>
                     </c>
                     <c cspan="6" ca="center">
                        <p>AUC</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c cspan="6">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>SVRMHC</p>
                     </c>
                     <c ca="center">
                        <p>Gibbs</p>
                     </c>
                     <c ca="center">
                        <p>ARB</p>
                     </c>
                     <c ca="center">
                        <p>TEPITOPE</p>
                     </c>
                     <c ca="center">
                        <p>NetMHCII</p>
                     </c>
                     <c ca="center">
                        <p>MOEA</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="8">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>Mouse</p>
                     </c>
                     <c ca="center">
                        <p>I-A<sup>b</sup></p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>0.662</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>0.908</p>
                     </c>
                     <c ca="center">
                        <p>0.919</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>I-A<sup>d</sup></p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>0.819</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>0.818</p>
                     </c>
                     <c ca="center">
                        <p>0.855</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>I-A<sup>s</sup></p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>-</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0.898</p>
                     </c>
                     <c ca="center">
                        <p>0.889</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>HLA</p>
                     </c>
                     <c ca="center">
                        <p>DRB1-0101</p>
                     </c>
                     <c ca="center">
                        <p>0.623</p>
                     </c>
                     <c ca="center">
                        <p>0.676</p>
                     </c>
                     <c ca="center">
                        <p>0.666</p>
                     </c>
                     <c ca="center">
                        <p>0.647</p>
                     </c>
                     <c ca="center">
                        <p>0.716</p>
                     </c>
                     <c ca="center">
                        <p>0.651</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>DRB1-0301</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>0.722</p>
                     </c>
                     <c ca="center">
                        <p>0.799</p>
                     </c>
                     <c ca="center">
                        <p>0.734</p>
                     </c>
                     <c ca="center">
                        <p>0.765</p>
                     </c>
                     <c ca="center">
                        <p>0.778</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>DRB1-0401</p>
                     </c>
                     <c ca="center">
                        <p>0.739</p>
                     </c>
                     <c ca="center">
                        <p>0.759</p>
                     </c>
                     <c ca="center">
                        <p>0.737</p>
                     </c>
                     <c ca="center">
                        <p>0.754</p>
                     </c>
                     <c ca="center">
                        <p>0.758</p>
                     </c>
                     <c ca="center">
                        <p>0.725</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>DRB1-0404</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>0.743</p>
                     </c>
                     <c ca="center">
                        <p>0.788</p>
                     </c>
                     <c ca="center">
                        <p>0.829</p>
                     </c>
                     <c ca="center">
                        <p>0.785</p>
                     </c>
                     <c ca="center">
                        <p>0.786</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>DRB1-0405</p>
                     </c>
                     <c ca="center">
                        <p>0.701</p>
                     </c>
                     <c ca="center">
                        <p>0.724</p>
                     </c>
                     <c ca="center">
                        <p>0.724</p>
                     </c>
                     <c ca="center">
                        <p>0.790</p>
                     </c>
                     <c ca="center">
                        <p>0.735</p>
                     </c>
                     <c ca="center">
                        <p>0.756</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>DRB1-0701</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>0.695</p>
                     </c>
                     <c ca="center">
                        <p>0.749</p>
                     </c>
                     <c ca="center">
                        <p>0.768</p>
                     </c>
                     <c ca="center">
                        <p>0.787</p>
                     </c>
                     <c ca="center">
                        <p>0.735</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>DRB1-0802</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>0.721</p>
                     </c>
                     <c ca="center">
                        <p>0.803</p>
                     </c>
                     <c ca="center">
                        <p>0.769</p>
                     </c>
                     <c ca="center">
                        <p>0.756</p>
                     </c>
                     <c ca="center">
                        <p>0.773</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>DRB1-0901</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>0.734</p>
                     </c>
                     <c ca="center">
                        <p>0.711</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>0.775</p>
                     </c>
                     <c ca="center">
                        <p>0.712</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>DRB1-1101</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>0.715</p>
                     </c>
                     <c ca="center">
                        <p>0.727</p>
                     </c>
                     <c ca="center">
                        <p>0.710</p>
                     </c>
                     <c ca="center">
                        <p>0.734</p>
                     </c>
                     <c ca="center">
                        <p>0.759</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>DRB1-1302</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>0.716</p>
                     </c>
                     <c ca="center">
                        <p>0.917</p>
                     </c>
                     <c ca="center">
                        <p>0.720</p>
                     </c>
                     <c ca="center">
                        <p>0.818</p>
                     </c>
                     <c ca="center">
                        <p>0.820</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>DRB1-1501</p>
                     </c>
                     <c ca="center">
                        <p>0.730</p>
                     </c>
                     <c ca="center">
                        <p>0.672</p>
                     </c>
                     <c ca="center">
                        <p>0.792</p>
                     </c>
                     <c ca="center">
                        <p>0.726</p>
                     </c>
                     <c ca="center">
                        <p>0.736</p>
                     </c>
                     <c ca="center">
                        <p>0.743</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>DRB4-0101</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>0.742</p>
                     </c>
                     <c ca="center">
                        <p>0.800</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>0.736</p>
                     </c>
                     <c ca="center">
                        <p>0.759</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>DRB5-0101</p>
                     </c>
                     <c ca="center">
                        <p>0.649</p>
                     </c>
                     <c ca="center">
                        <p>0.618</p>
                     </c>
                     <c ca="center">
                        <p>0.677</p>
                     </c>
                     <c ca="center">
                        <p>0.653</p>
                     </c>
                     <c ca="center">
                        <p>0.664</p>
                     </c>
                     <c ca="center">
                        <p>0.660</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>Comparison of AUC values from five-fold cross-validation of allele datasets given in BM-Set2. "-" indicates that the allele is unavailable for testing with the respective prediction method.</p>
               </tblfn>
            </tbl>
            <fig id="F1">
               <title>
                  <p>Figure 1</p>
               </title>
               <caption>
                  <p>Comparison of Performances</p>
               </caption>
               <text>
                  <p><b>Comparison of Performances</b>. Comparison of performance of MOEA based algorithms &#8211; self-discovery and guided-discovery &#8211; against MEME, RANKPEP, and experimental motifs on the balanced I-A<sup>g7 </sup>test datasets (the performance was averaged over 25 test datasets)</p>
               </text>
               <graphic file="1471-2105-8-459-1"/>
            </fig>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Discussion</p>
         </st>
         <p>We proposed two approaches using MOEA for deriving motifs (1) when the information of only the binders and non-binders are known (i.e., self-discovery) and (2) when, in addition, the information of experimentally (wet-lab) determined motifs are available (i.e., guided-discovery).</p>
         <p>Since I-A<sup>g7 </sup>molecule is known to bind to a large number of peptides of low affinity and appears to be a promiscuous binder, the prediction of peptides binding to I-A<sup>g7 </sup>molecule has been nontrivial. This has lead to the definition of a number of suboptimal consensus motifs specific to the datasets. MOEA derived motifs had superior generalization capabilities to those derived with MEME and RANKPEP techniques as well as to the experimentally determined motifs on other datasets. The performances evaluated on two benchmark datasets indicate that the present MOEA based algorithm is applicable in deriving motifs on other class II MHC alleles as well.</p>
         <p>The likelihood of finding an optimal motif by MOEA is higher than by a local or greedy search because of the stochastic nature of EA. The proposed approach learns from the characteristics of both binders and non-binders in the training set whereas other methods use information only from binders to determine motifs <abbrgrp><abbr bid="B27">27</abbr><abbr bid="B32">32</abbr></abbrgrp>. Moreover, ranges of the parameters involved in MOEA are known, so the parameters of the fitness functions are quickly estimated in a few cross-validation runs. Furthermore, unlike the earlier methods, the present method does not rely on any prior information such as anchor positions to obtain an alignment, prior distributions, etc., <abbrgrp><abbr bid="B8">8</abbr><abbr bid="B9">9</abbr></abbrgrp>. Given sufficient data samples representing both binders and non-binders, the method could be applicable to find motifs in other types of molecules. A future direction of this research would be to integrate additional information such as peptide length <abbrgrp><abbr bid="B69">69</abbr></abbrgrp> and PFR <abbrgrp><abbr bid="B70">70</abbr></abbrgrp> as such information has been shown to have the potential to enhance motif detection <abbrgrp><abbr bid="B11">11</abbr><abbr bid="B69">69</abbr></abbrgrp>. This would lead to further improvement of the performance of the present algorithm.</p>
         <p>Even though EAs are generally known to be computationally intensive, training for derivation of scoring matrices can be performed off-line and the prediction engines can be provided through web services. As seen in Tables <tblr tid="T6">6</tblr> and <tblr tid="T8">8</tblr>, a single method does not always perform well on all types of allele datasets. Nevertheless, the present method showed higher accuracy in detecting motifs on majority of MHC alleles in the benchmark datasets. Therefore, we believe that MOEA-based methods could provide a general framework for efficiently determining motifs in a wide range of MHC molecules.</p>
         <p>In immunology, accuracy and speed in predicting binding peptides is of paramount importance. Computationally predicted binders do subsequently need to be validated with wet-lab experiments. By using computational predictions as an initial step, high cost involved in initial screening and time-consuming clinical testing can be significantly reduced. Towards this end, the proposed MOEA methods present a promising way to predict peptides that bind to MHC class II alleles including promiscuous and low affinity peptide binders.</p>
      </sec>
      <sec>
         <st>
            <p>Conclusion</p>
         </st>
         <p>We present two MOEA-based algorithms for finding motifs, one for self-discovery and the other for guided-discovery by experimentally determined motifs, and thereby predicting binding peptides to I-A<sup>g7 </sup>molecule. Our experiments show that the proposed MOEA-based algorithms are better than earlier methods in predicting binding sites not only on I-A<sup>g7 </sup>but also on most alleles of class II MHC benchmark datasets. This demonstrates the applicability of our methods to find binding motifs in a wide range of MHC alleles.</p>
      </sec>
      <sec>
         <st>
            <p>Methods</p>
         </st>
         <sec>
            <st>
               <p>Datasets</p>
            </st>
            <p>Several I-A<sup>g7 </sup>datasets were extracted from literature <abbrgrp><abbr bid="B40">40</abbr><abbr bid="B41">41</abbr><abbr bid="B42">42</abbr><abbr bid="B43">43</abbr><abbr bid="B62">62</abbr><abbr bid="B63">63</abbr><abbr bid="B64">64</abbr></abbrgrp> and from Brusic, V.(unpublished data). The numbers of binders and non-binders in each dataset are given in Table <tblr tid="T1">1</tblr>. The datasets consist of short peptides ranging from 9&#8211;30aa in length. Their binding affinities had been experimentally determined by independent studies and classified as binders or non-binders based on IC<sub>50 </sub>values according to the following scheme <abbrgrp><abbr bid="B41">41</abbr></abbrgrp>: good binder (IC<sub>50 </sub>= 100 nM); weak binder (IC<sub>50 </sub>= 2000 nM); non-binder (IC<sub>50 </sub>= 50000 nM). The datasets in <abbrgrp><abbr bid="B40">40</abbr><abbr bid="B41">41</abbr><abbr bid="B42">42</abbr><abbr bid="B43">43</abbr><abbr bid="B62">62</abbr><abbr bid="B63">63</abbr><abbr bid="B64">64</abbr></abbrgrp> were combined into a single training dataset and curated by removing duplicates and redundancy as follows: if a binder is a subsequence of another binder sequence, the longer binder sequence is discarded; if a non-binder is a subsequence of another non-binder, the shorter subsequence is discarded. Let the curated whole dataset be referred to as <it>training </it>dataset here onwards and it be denoted by <it>D </it>= {(<it>x</it><sub><it>i</it></sub>, <it>v</it><sub><it>i</it></sub>): <it>i </it>= 1, 2,.... <it>N</it>} where <it>N </it>is the number of total peptide sequences and <it>x</it><sub><it>i </it></sub>is the <it>i</it>-th peptide sequence with the label <it>v</it><sub><it>i </it></sub>&#949; {b, nb} indicating whether the sequence <it>x</it><sub><it>i </it></sub>is a binder (b) or a non-binder (nb). The number of peptides in the training set <it>N </it>= 438 in which the number of binders <it>N</it><sub>b </sub>= 304 and the number of non-binders <it>N</it><sub>nb </sub>= 134.</p>
            <p>The set of experimentally validated I-A<sup>g7</sup>motifs <abbrgrp><abbr bid="B38">38</abbr><abbr bid="B39">39</abbr><abbr bid="B40">40</abbr><abbr bid="B41">41</abbr><abbr bid="B42">42</abbr><abbr bid="B43">43</abbr><abbr bid="B44">44</abbr></abbrgrp> derived largely from uncorrelated datasets <abbrgrp><abbr bid="B40">40</abbr><abbr bid="B41">41</abbr><abbr bid="B42">42</abbr><abbr bid="B43">43</abbr></abbrgrp> was extracted and is illustrated in Table <tblr tid="T1">1</tblr> with the distribution of binders and non-binders in each dataset. Table <tblr tid="T2">2</tblr> illustrates an experimentally validated motif of I-A<sup>g7 </sup>reported by Reizis <it>et al </it><abbrgrp><abbr bid="B40">40</abbr></abbrgrp>. Experimental motifs are described by the anchor positions and binding affinities of amino acids of the motif. The residues which contribute significantly to the peptide binding are called primary anchor residues and positions they reside are called anchor positions. An amino acid occupying a specific position within a motif is characterized as well tolerated, weakly tolerated, or non-tolerated based on its involvement in the binding process.</p>
            <p>An independent dataset was generated from binders of Stratmann dataset <abbrgrp><abbr bid="B46">46</abbr></abbrgrp>, consisting of a diverse set of I-A<sup>g7 </sup>binding peptides with their binding affinities, to find the test accuracies in predicting binders and non-binders. The Stratmann dataset was balanced with randomly generated 9-mer non-binders so that for testing dataset, <it>N</it><sub>b </sub>= <it>N</it><sub>nb </sub>= 112.</p>
         </sec>
         <sec>
            <st>
               <p>Binding Score Matrix</p>
            </st>
            <p>A <it>k</it>-mer motif of amino acids is characterized by a PSSM <it>Q </it>= {<it>q</it><sub><it>ia</it></sub>}<sub><it>k </it>&#215; 20 </sub>where <it>q</it><sub><it>ia </it></sub>denotes the binding strength of the site <it>i </it>when it is occupied by amino acid <it>a</it>. The binding score of a putative motif is computed by adding the binding scores assigned to each amino acid at the respective positions. The binding score indicates the likelihood of the motif binding to the molecule. The binding score <it>s</it><sub><it>i </it></sub>of sequence <it>x</it><sub><it>i </it></sub>= (<it>x</it><sub><it>i</it>,1</sub>, <it>x</it><sub><it>i</it>,2</sub>,...<it>x</it><sub><it>i</it>, <it>n</it></sub>) of length <it>n </it>is determined by the maximum value of binding scores computed for all <it>k</it>-mer subsequences in <it>x</it><sub><it>i</it></sub>:</p>
            <p>
               <display-formula id="M1">
                  <m:math name="1471-2105-8-459-i1" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:msub>
                              <m:mi>s</m:mi>
                              <m:mi>i</m:mi>
                           </m:msub>
                           <m:mo>=</m:mo>
                           <m:munder>
                              <m:mrow>
                                 <m:mi>max</m:mi>
                                 <m:mo>&#8289;</m:mo>
                              </m:mrow>
                              <m:mi>j</m:mi>
                           </m:munder>
                           <m:mo>{</m:mo>
                           <m:msub>
                              <m:mi>s</m:mi>
                              <m:mrow>
                                 <m:mi>i</m:mi>
                                 <m:mi>j</m:mi>
                              </m:mrow>
                           </m:msub>
                           <m:mo>:</m:mo>
                           <m:mi>j</m:mi>
                           <m:mo>=</m:mo>
                           <m:mn>1</m:mn>
                           <m:mo>,</m:mo>
                           <m:mn>2</m:mn>
                           <m:mo>,</m:mo>
                           <m:mo>&#8943;</m:mo>
                           <m:mi>n</m:mi>
                           <m:mo>&#8722;</m:mo>
                           <m:mi>k</m:mi>
                           <m:mo>+</m:mo>
                           <m:mn>1</m:mn>
                           <m:mo>}</m:mo>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGaem4Cam3aaSbaaSqaaiabdMgaPbqabaGccqGH9aqpdaWfqaqaaiGbc2gaTjabcggaHjabcIha4bWcbaGaemOAaOgabeaakiabcUha7jabdohaZnaaBaaaleaacqWGPbqAcqWGQbGAaeqaaOGaeiOoaOJaemOAaOMaeyypa0JaeGymaeJaeiilaWIaeGOmaiJaeiilaWIaeS47IWKaemOBa4MaeyOeI0Iaem4AaSMaey4kaSIaeGymaeJaeiyFa0haaa@4BBC@</m:annotation>
                     </m:semantics>
                  </m:math>
               </display-formula>
            </p>
            <p>where <it>s</it><sub><it>ij </it></sub>denotes the binding score of the subsequence beginning at location <it>j </it>of the sequence <it>i</it>, which is given by</p>
            <p>
               <display-formula id="M2">
                  <m:math name="1471-2105-8-459-i2" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:msub>
                              <m:mi>s</m:mi>
                              <m:mrow>
                                 <m:mi>i</m:mi>
                                 <m:mi>j</m:mi>
                              </m:mrow>
                           </m:msub>
                           <m:mo>=</m:mo>
                           <m:mstyle displaystyle="true">
                              <m:munder>
                                 <m:mo>&#8721;</m:mo>
                                 <m:mrow>
                                    <m:mi>l</m:mi>
                                    <m:mo>=</m:mo>
                                    <m:mn>1</m:mn>
                                    <m:mo>,</m:mo>
                                    <m:mn>2</m:mn>
                                    <m:mo>,</m:mo>
                                    <m:mo>&#8943;</m:mo>
                                    <m:mi>k</m:mi>
                                 </m:mrow>
                              </m:munder>
                              <m:mrow>
                                 <m:msub>
                                    <m:mi>q</m:mi>
                                    <m:mrow>
                                       <m:mo stretchy="false">(</m:mo>
                                       <m:mi>j</m:mi>
                                       <m:mo>+</m:mo>
                                       <m:mi>l</m:mi>
                                       <m:mo stretchy="false">)</m:mo>
                                    </m:mrow>
                                 </m:msub>
                                 <m:mo>,</m:mo>
                                 <m:msub>
                                    <m:mi>x</m:mi>
                                    <m:mrow>
                                       <m:mi>i</m:mi>
                                       <m:mo stretchy="false">(</m:mo>
                                       <m:mi>j</m:mi>
                                       <m:mo>+</m:mo>
                                       <m:mi>l</m:mi>
                                       <m:mo stretchy="false">)</m:mo>
                                    </m:mrow>
                                 </m:msub>
                              </m:mrow>
                           </m:mstyle>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGaem4Cam3aaSbaaSqaaiabdMgaPjabdQgaQbqabaGccqGH9aqpdaaeqbqaaiabdghaXnaaBaaaleaacqGGOaakcqWGQbGAcqGHRaWkcqWGSbaBcqGGPaqkaeqaaOGaeiilaWIaemiEaG3aaSbaaSqaaiabdMgaPjabcIcaOiabdQgaQjabgUcaRiabdYgaSjabcMcaPaqabaaabaGaemiBaWMaeyypa0JaeGymaeJaeiilaWIaeGOmaiJaeiilaWIaeS47IWKaem4AaSgabeqdcqGHris5aaaa@4D17@</m:annotation>
                     </m:semantics>
                  </m:math>
               </display-formula>
            </p>
            <p>and assuming that only one motif instance exists in every sequence, the location <it>j</it>* of the motif is given by</p>
            <p>
               <display-formula id="M3">
                  <m:math name="1471-2105-8-459-i3" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:msup>
                              <m:mi>j</m:mi>
                              <m:mo>&#8727;</m:mo>
                           </m:msup>
                           <m:mo>=</m:mo>
                           <m:mi>arg</m:mi>
                           <m:mo>&#8289;</m:mo>
                           <m:munder>
                              <m:mrow>
                                 <m:mi>max</m:mi>
                                 <m:mo>&#8289;</m:mo>
                              </m:mrow>
                              <m:mi>j</m:mi>
                           </m:munder>
                           <m:mo>{</m:mo>
                           <m:msub>
                              <m:mi>s</m:mi>
                              <m:mrow>
                                 <m:mi>i</m:mi>
                                 <m:mi>j</m:mi>
                              </m:mrow>
                           </m:msub>
                           <m:mo>:</m:mo>
                           <m:mi>j</m:mi>
                           <m:mo>=</m:mo>
                           <m:mn>1</m:mn>
                           <m:mo>,</m:mo>
                           <m:mn>2</m:mn>
                           <m:mo>,</m:mo>
                           <m:mo>&#8943;</m:mo>
                           <m:mi>n</m:mi>
                           <m:mo>&#8722;</m:mo>
                           <m:mi>k</m:mi>
                           <m:mo>+</m:mo>
                           <m:mn>1</m:mn>
                           <m:mo>}</m:mo>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGaemOAaO2aaWbaaSqabeaacqGHxiIkaaGccqGH9aqpcyGGHbqycqGGYbGCcqGGNbWzdaWfqaqaaiGbc2gaTjabcggaHjabcIha4bWcbaGaemOAaOgabeaakiabcUha7jabdohaZnaaBaaaleaacqWGPbqAcqWGQbGAaeqaaOGaeiOoaOJaemOAaOMaeyypa0JaeGymaeJaeiilaWIaeGOmaiJaeiilaWIaeS47IWKaemOBa4MaeyOeI0Iaem4AaSMaey4kaSIaeGymaeJaeiyFa0haaa@4F4D@</m:annotation>
                     </m:semantics>
                  </m:math>
               </display-formula>
            </p>
            <p>That is, the most likely motif instance of sequence <it>x</it><sub><it>i</it></sub>, say <it>m</it><sub><it>i</it></sub>, is given by the sequence <it>m</it><sub><it>i </it></sub>= (<it>x</it><sub><it>ij</it>*</sub>&#183;<it>x</it><sub><it>ij</it>* + 1</sub>,... <it>x</it><sub><it>ij</it>* + <it>k</it>-1</sub>).</p>
         </sec>
         <sec>
            <st>
               <p>Self-discovery of Motif</p>
            </st>
            <p>We derive a consensus motif from the training dataset which consists of peptides from several experiments and of varying lengths. The positions of binding cores within the peptides are unknown. The elements of the PSSM are represented as 20<it>k</it>-tuples (<it>q</it><sub><it>ia</it></sub>, : <it>i </it>= 1,... <it>k</it>; <it>a &#949; </it>&#937;) where &#937; represents the amino acid alphabet. Each element in the <it>k</it>-tuple is converted to a real number representation using a binary word of size <it>&#952; </it>so that <it>q</it><sub><it>ia </it></sub>&#8712; [0, 2<sup><it>&#952;</it></sup>-1]. The <it>k</it>-mer motif is therefore represented by an individual of 20<it>k&#952; </it>long string in the EA. Let the population at <it>t</it>-th iteration of the evolution is denoted by <it>q</it>(<it>t</it>) = {<it>q</it><sub>1</sub>(<it>t</it>), <it>q</it><sub>2</sub>(<it>t</it>),..... <it>q</it><sub><it>M </it></sub>(<it>t</it>)} where <it>q</it><sub><it>j</it></sub>(<it>t</it>) represents an individual in a population of size <it>M</it>.</p>
            <p>The fitness function is designed to arrive at an optimal consensus of the motif, by using the training dataset. A solution is evaluated based on its ability to maximize the accuracies in identifying true binders (TP) and true non-binders (TN) as well as to widen the gap between the total score for binders and non-binders. This is achieved by two fitness functions: <it>f</it><sub>1 </sub>to minimize the sum of false positives (FP) and false negatives (FN), and <it>f</it><sub>2 </sub>to minimize the ratio between the average cumulative scores of non-binders and binders:</p>
            <p>
               <display-formula id="M4"><it>f</it><sub>1 </sub>= FN + <it>&#954;</it><sub>1 </sub>FP</display-formula>
            </p>
            <p>
               <display-formula id="M5">
                  <m:math name="1471-2105-8-459-i4" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:msub>
                              <m:mi>f</m:mi>
                              <m:mn>2</m:mn>
                           </m:msub>
                           <m:mo>=</m:mo>
                           <m:mfrac>
                              <m:mrow>
                                 <m:msub>
                                    <m:mi>N</m:mi>
                                    <m:mtext>b</m:mtext>
                                 </m:msub>
                              </m:mrow>
                              <m:mrow>
                                 <m:msub>
                                    <m:mi>N</m:mi>
                                    <m:mrow>
                                       <m:mtext>nb</m:mtext>
                                    </m:mrow>
                                 </m:msub>
                              </m:mrow>
                           </m:mfrac>
                           <m:mfrac>
                              <m:mrow>
                                 <m:mstyle displaystyle="true">
                                    <m:munderover>
                                       <m:mo>&#8721;</m:mo>
                                       <m:mrow>
                                          <m:mi>i</m:mi>
                                          <m:mo>=</m:mo>
                                          <m:mn>1</m:mn>
                                       </m:mrow>
                                       <m:mi>N</m:mi>
                                    </m:munderover>
                                    <m:mrow>
                                       <m:mi>s</m:mi>
                                       <m:mo stretchy="false">(</m:mo>
                                       <m:msub>
                                          <m:mi>m</m:mi>
                                          <m:mi>i</m:mi>
                                       </m:msub>
                                       <m:mo stretchy="false">)</m:mo>
                                       <m:mi>&#948;</m:mi>
                                       <m:mo stretchy="false">(</m:mo>
                                       <m:msub>
                                          <m:mi>v</m:mi>
                                          <m:mi>i</m:mi>
                                       </m:msub>
                                       <m:mo>=</m:mo>
                                       <m:mtext>nb</m:mtext>
                                       <m:mo stretchy="false">)</m:mo>
                                    </m:mrow>
                                 </m:mstyle>
                              </m:mrow>
                              <m:mrow>
                                 <m:mstyle displaystyle="true">
                                    <m:munderover>
                                       <m:mo>&#8721;</m:mo>
                                       <m:mrow>
                                          <m:mi>i</m:mi>
                                          <m:mo>=</m:mo>
                                          <m:mn>1</m:mn>
                                       </m:mrow>
                                       <m:mi>N</m:mi>
                                    </m:munderover>
                                    <m:mrow>
                                       <m:mi>s</m:mi>
                                       <m:mo stretchy="false">(</m:mo>
                                       <m:msub>
                                          <m:mi>m</m:mi>
                                          <m:mi>i</m:mi>
                                       </m:msub>
                                       <m:mo stretchy="false">)</m:mo>
                                       <m:mi>&#948;</m:mi>
                                       <m:mo stretchy="false">(</m:mo>
                                       <m:msub>
                                          <m:mi>v</m:mi>
                                          <m:mi>i</m:mi>
                                       </m:msub>
                                       <m:mo>=</m:mo>
                                       <m:mtext>b</m:mtext>
                                       <m:mo stretchy="false">)</m:mo>
                                    </m:mrow>
                                 </m:mstyle>
                              </m:mrow>
                           </m:mfrac>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGaemOzay2aaSbaaSqaaiabikdaYaqabaGccqGH9aqpjuaGdaWcaaqaaiabd6eaonaaBaaabaGaeeOyaigabeaaaeaacqWGobGtdaWgaaqaaiabb6gaUjabbkgaIbqabaaaamaalaaabaWaaabCaeaacqWGZbWCcqGGOaakcqWGTbqBdaWgaaqaaiabdMgaPbqabaGaeiykaKccciGae8hTdqMaeiikaGIaemODay3aaSbaaeaacqWGPbqAaeqaaiabg2da9iabb6gaUjabbkgaIjabcMcaPaqaaiabdMgaPjabg2da9iabigdaXaqaaiabd6eaobGaeyyeIuoaaeaadaaeWbqaaiabdohaZjabcIcaOiabd2gaTnaaBaaabaGaemyAaKgabeaacqGGPaqkcqWF0oazcqGGOaakcqWG2bGDdaWgaaqaaiabdMgaPbqabaGaeyypa0JaeeOyaiMaeiykaKcabaGaemyAaKMaeyypa0JaeGymaedabaGaemOta4eacqGHris5aaaaaaa@62AE@</m:annotation>
                     </m:semantics>
                  </m:math>
               </display-formula>
            </p>
            <p>Eqs. (4) and (5) are minimized and subjected to following two constraints:</p>
            <p>
               <display-formula id="M6">
                  <m:math name="1471-2105-8-459-i5" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:mfrac>
                              <m:mrow>
                                 <m:mtext>FP</m:mtext>
                              </m:mrow>
                              <m:mrow>
                                 <m:msub>
                                    <m:mi>N</m:mi>
                                    <m:mrow>
                                       <m:mtext>nb</m:mtext>
                                    </m:mrow>
                                 </m:msub>
                              </m:mrow>
                           </m:mfrac>
                           <m:mo>&#8804;</m:mo>
                           <m:mfrac>
                              <m:mn>1</m:mn>
                              <m:mrow>
                                 <m:msub>
                                    <m:mi>&#945;</m:mi>
                                    <m:mn>1</m:mn>
                                 </m:msub>
                              </m:mrow>
                           </m:mfrac>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaqcfa4aaSaaaeaacqqGgbGrcqqGqbauaeaacqWGobGtdaWgaaqaaiabb6gaUjabbkgaIbqabaaaaOGaeyizImAcfa4aaSaaaeaacqaIXaqmaeaaiiGacqWFXoqydaWgaaqaaiabigdaXaqabaaaaaaa@38F1@</m:annotation>
                     </m:semantics>
                  </m:math>
               </display-formula>
            </p>
            <p>
               <display-formula id="M7">
                  <m:math name="1471-2105-8-459-i6" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:mfrac>
                              <m:mrow>
                                 <m:mtext>FN</m:mtext>
                              </m:mrow>
                              <m:mrow>
                                 <m:msub>
                                    <m:mi>N</m:mi>
                                    <m:mtext>b</m:mtext>
                                 </m:msub>
                              </m:mrow>
                           </m:mfrac>
                           <m:mo>&#8804;</m:mo>
                           <m:mfrac>
                              <m:mn>1</m:mn>
                              <m:mrow>
                                 <m:msub>
                                    <m:mi>&#945;</m:mi>
                                    <m:mn>2</m:mn>
                                 </m:msub>
                              </m:mrow>
                           </m:mfrac>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaqcfa4aaSaaaeaacqqGgbGrcqqGobGtaeaacqWGobGtdaWgaaqaaiabbkgaIbqabaaaaOGaeyizImAcfa4aaSaaaeaacqaIXaqmaeaaiiGacqWFXoqydaWgaaqaaiabikdaYaqabaaaaaaa@378C@</m:annotation>
                     </m:semantics>
                  </m:math>
               </display-formula>
            </p>
            <p>where <it>s</it>(<it>m</it><sub><it>i</it></sub>) denotes the score computed for the most likely motif instance <it>m</it><sub><it>i </it></sub>of sequence <it>x</it><sub><it>i </it></sub>of the training dataset, and Kronecker <it>&#948; </it>is one when the argument is satisfied and otherwise is zero. <it>N</it><sub>b </sub>and <it>N</it><sub>nb </sub>are the total counts of binders and non-binders in the dataset. The constant <it>&#954;</it><sub>1 </sub>(><it>N</it><sub>b</sub>/<it>N</it><sub>nb </sub>for <it>N</it><sub>b </sub>> <it>N</it><sub>nb</sub>, or vice versa) was empirically determined to minimize the number of false positives. The two parameters <it>&#945;</it><sub>1 </sub>(&lt;&lt;<it>N</it><sub>nb</sub>) and <it>&#945;</it><sub>2 </sub>(&lt;&lt;<it>N</it><sub>b</sub>) are set to minimize FP and FN rates, respectively. If none of the individuals satisfies the above constraints, MOEA reports no feasible solution. Given the training set, a few trial runs with different initializations are necessary to determine the best values of <it>&#945;</it><sub>1 </sub>and <it>&#945;</it><sub>2</sub>.</p>
         </sec>
         <sec>
            <st>
               <p>Scoring of Experimental Motifs</p>
            </st>
            <p>The description of an experimental <it>k</it>-mer motif conveys three kinds of information at each site: (1) the amino acid occupied, (2) the tolerance level of the amino acid, and (3) the strength of binding. Let us denote a <it>k</it>-mer motif validated in experiment "e" by <it>m</it>(e) and the tolerance level of the residue at site <it>j </it>by <it>&#961;</it><sub><it>j </it></sub>where <it>&#961;</it><sub><it>j </it></sub>&#8712; {well, weak, unknown, non &#8211; tolerated}. The binding strength of site <it>j </it>is expressed by <it>&#963;</it><sub><it>j </it></sub>&#8712; {primary &#8211; anchor, secondary &#8211; anchor, other}. Then, the binding score for a <it>k</it>-mer experimental motif is given by</p>
            <p>
               <display-formula id="M8">
                  <m:math name="1471-2105-8-459-i7" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:mi>s</m:mi>
                           <m:mo stretchy="false">(</m:mo>
                           <m:mi>m</m:mi>
                           <m:mo stretchy="false">(</m:mo>
                           <m:mtext>e</m:mtext>
                           <m:mo stretchy="false">)</m:mo>
                           <m:mo stretchy="false">)</m:mo>
                           <m:mo>=</m:mo>
                           <m:mstyle displaystyle="true">
                              <m:munderover>
                                 <m:mo>&#8721;</m:mo>
                                 <m:mrow>
                                    <m:mi>j</m:mi>
                                    <m:mo>=</m:mo>
                                    <m:mn>1</m:mn>
                                 </m:mrow>
                                 <m:mi>k</m:mi>
                              </m:munderover>
                              <m:mrow>
                                 <m:msub>
                                    <m:mi>&#961;</m:mi>
                                    <m:mi>j</m:mi>
                                 </m:msub>
                                 <m:mo>&#8901;</m:mo>
                                 <m:msub>
                                    <m:mi>&#963;</m:mi>
                                    <m:mi>j</m:mi>
                                 </m:msub>
                              </m:mrow>
                           </m:mstyle>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGaem4CamNaeiikaGIaemyBa0MaeiikaGIaeeyzauMaeiykaKIaeiykaKIaeyypa0ZaaabCaeaaiiGacqWFbpGCdaWgaaWcbaGaemOAaOgabeaakiabgwSixlab=n8aZnaaBaaaleaacqWGQbGAaeqaaaqaaiabdQgaQjabg2da9iabigdaXaqaaiabdUgaRbqdcqGHris5aaaa@4482@</m:annotation>
                     </m:semantics>
                  </m:math>
               </display-formula>
            </p>
         </sec>
         <sec>
            <st>
               <p>Guided-discovery of Motif</p>
            </st>
            <p>In this algorithm, we assume that experimentally determined motifs are available along with the experimental datasets. An MOEA is proposed to determine a motif closer to experimental motifs. An objective function <it>f</it><sub>3 </sub>is proposed to best represent the characteristics of the motif that is close to the knowledge embedded in the experimental motifs:</p>
            <p>
               <display-formula id="M9">
                  <m:math name="1471-2105-8-459-i8" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:msub>
                              <m:mi>f</m:mi>
                              <m:mn>3</m:mn>
                           </m:msub>
                           <m:mo>=</m:mo>
                           <m:mstyle displaystyle="true">
                              <m:munder>
                                 <m:mo>&#8721;</m:mo>
                                 <m:mtext>e</m:mtext>
                              </m:munder>
                              <m:mrow>
                                 <m:mrow>
                                    <m:mo>|</m:mo>
                                    <m:mrow>
                                       <m:mover accent="true">
                                          <m:mi>Q</m:mi>
                                          <m:mo>^</m:mo>
                                       </m:mover>
                                       <m:mo>&#8722;</m:mo>
                                       <m:mi>Q</m:mi>
                                       <m:mo stretchy="false">(</m:mo>
                                       <m:mi>m</m:mi>
                                       <m:mo stretchy="false">(</m:mo>
                                       <m:mtext>e</m:mtext>
                                       <m:mo stretchy="false">)</m:mo>
                                       <m:mo stretchy="false">)</m:mo>
                                    </m:mrow>
                                    <m:mo>|</m:mo>
                                 </m:mrow>
                              </m:mrow>
                           </m:mstyle>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGaemOzay2aaSbaaSqaaiabiodaZaqabaGccqGH9aqpdaaeqbqaamaaemaabaGafmyuaeLbaKaacqGHsislcqWGrbqucqGGOaakcqWGTbqBcqGGOaakcqqGLbqzcqGGPaqkcqGGPaqkaiaawEa7caGLiWoaaSqaaiabbwgaLbqab0GaeyyeIuoaaaa@3FA7@</m:annotation>
                     </m:semantics>
                  </m:math>
               </display-formula>
            </p>
            <p>where <inline-formula><m:math name="1471-2105-8-459-i9" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>Q</m:mi><m:mo>^</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGafmyuaeLbaKaaaaa@2D0E@</m:annotation></m:semantics></m:math></inline-formula> denotes the estimated PSSM of the motif. We use the same objective function in Eq. (4) to accurately predict binders of the training dataset. The MOEA minimizes the objective functions given in Eqs. (4) and (9), subjected to the two constraints given in Eqs. (6) and (7). The summation in Eq. (9) is taken over all the experimental motifs and |<inline-formula><m:math name="1471-2105-8-459-i9" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>Q</m:mi><m:mo>^</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGafmyuaeLbaKaaaaa@2D0E@</m:annotation></m:semantics></m:math></inline-formula> - <it>Q</it>(<it>m</it>(e))| is the sum of squares of differences between individual elements of weight matrices <inline-formula><m:math name="1471-2105-8-459-i9" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>Q</m:mi><m:mo>^</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGafmyuaeLbaKaaaaa@2D0E@</m:annotation></m:semantics></m:math></inline-formula> and <it>Q</it>(<it>m</it>(e)). The knowledge of the experimental motif is incorporated to the consensus motif adaptively with the distance function used in <it>f</it><sub>3</sub>. Further, the fitness <it>f</it><sub>1 </sub>optimizes the specificity and sensitivity of the prediction of binders.</p>
            <p>The elements in the PSSM of experimental motifs are set to values within the same range [0, 2<sup><it>&#952;</it></sup>-1] as before. The following procedure is adopted to determine the elements of <it>Q</it>(<it>m</it>(e)): a well tolerated amino acid at an anchor position of the motif receives the highest possible score of 2<sup><it>&#952;</it></sup>-1; the lowest score of zero is assigned to a non-tolerated residue; weakly tolerated residues and residues at secondary anchor positions receive of (2<sup><it>&#952;</it></sup>-1)/2; and all the other unknown positions receives a score of (2<sup><it>&#952;</it></sup>-1)/3.</p>
         </sec>
         <sec>
            <st>
               <p>Performance Comparison</p>
            </st>
            <p>The binding scores of I-A<sup>g7 </sup>experimental motifs were computed using Eq. (8) by assigning the following values for binding strengths: primary = 4, secondary = 2, and others = 1, and for anchor positions: well = 4, weak = 2, non-tolerated = -4, and unknown = 0. The experimentally determined motifs were used with peptide data in the guided-discovery of motifs.</p>
            <p>We used AUC to compare performance of the proposed methods with earlier approaches <abbrgrp><abbr bid="B28">28</abbr><abbr bid="B34">34</abbr></abbrgrp> and experimental motifs <abbrgrp><abbr bid="B38">38</abbr><abbr bid="B39">39</abbr><abbr bid="B40">40</abbr><abbr bid="B41">41</abbr><abbr bid="B42">42</abbr><abbr bid="B43">43</abbr><abbr bid="B44">44</abbr></abbrgrp>. Whether a peptide is a binder or a non-binder is determined by a threshold of the binding score. By varying this threshold, the ROC curve was plotted, from which AUC value was obtained. A comparison of performances of the methods is given in Figure <figr fid="F1">1</figr>.</p>
            <p>In order to compare to the MEME method, only binders in the I-A<sup>g7 </sup>training set were submitted to MEME motif discovery tool at the prediction server <abbrgrp><abbr bid="B71">71</abbr></abbrgrp>. The motif of 9-mer length was obtained with the following options: zero or one motif per sequence, minimum and maximum width = 9. The performance accuracy of RANKPEP approach on the testing dataset was carried out by uploading the dataset to the online prediction server at <abbrgrp><abbr bid="B72">72</abbr></abbrgrp> with a 4% binding threshold <abbrgrp><abbr bid="B34">34</abbr></abbrgrp>.</p>
         </sec>
         <sec>
            <st>
               <p>Benchmark Datasets</p>
            </st>
            <p>The proposed self-discovery approach was tested on BM-Set1, i.e., HLA-DRB1*0401, which consists of one training set and 10 testing datasets and had been earlier used to benchmark a number of motif finding algorithms <abbrgrp><abbr bid="B25">25</abbr><abbr bid="B26">26</abbr><abbr bid="B32">32</abbr><abbr bid="B73">73</abbr></abbrgrp>. The performance of MOEA was compared with earlier methods <abbrgrp><abbr bid="B8">8</abbr><abbr bid="B10">10</abbr><abbr bid="B32">32</abbr><abbr bid="B35">35</abbr></abbrgrp>.</p>
            <p>The training set consisting of binders and non-binders was assembled as follows: an ensemble of 532 unique binding peptides were extracted from SYFPEITHI <abbrgrp><abbr bid="B44">44</abbr></abbrgrp> and MHCPEP <abbrgrp><abbr bid="B63">63</abbr></abbrgrp> databases and a set of 177 unique non-binders were extracted from the MHCBN database <abbrgrp><abbr bid="B20">20</abbr></abbrgrp>. The datasets were pre-processed by removing peptides that did not allow a hydrophobic residue at P1 position of all putative 9-mer binding cores and unnatural peptides containing more than 75% alanine <abbrgrp><abbr bid="B32">32</abbr></abbrgrp>. The preprocessed binder set has 456 unique peptides with a length distribution ranging from 9 to 30 amino acid residues.</p>
            <p>Of the 10 testing datasets, 8 datasets were taken from the MHC-bench as described in <abbrgrp><abbr bid="B74">74</abbr></abbrgrp>. The other 2 datasets were extracted from experiments described by Southwood <abbrgrp><abbr bid="B75">75</abbr></abbrgrp> and Geluk <abbrgrp><abbr bid="B76">76</abbr></abbrgrp>. An affinity of (IC<sub>50 </sub>= 1000 nM) was taken as the threshold for peptide binding as described in <abbrgrp><abbr bid="B75">75</abbr></abbrgrp>. Homology reduction had been carried out on all datasets in order to reduce the chances of over-fitting due to the redundancy of datasets. The peptides in the non-redundant (NR) datasets had sequence similarities less than 90%. The number of binders and non-binders in the original and NR datasets are given in Table <tblr tid="T5">5</tblr>.</p>
            <p>We tested our method on BM-Set2 comprising of 3 mouse alleles and 13 HLA alleles made available at <abbrgrp><abbr bid="B77">77</abbr></abbrgrp>. These quantitative peptide datasets had been extracted from the IEDB at <abbrgrp><abbr bid="B78">78</abbr></abbrgrp>. The number of binders and non-binders in each dataset is given in Table <tblr tid="T7">7</tblr>. The DRB3-0101 allele dataset was excluded from the benchmark dataset because of the significant imbalance between binders and non-binders (3 binders and 99 non-binders). With this dataset, we compared our method with <abbrgrp><abbr bid="B8">8</abbr><abbr bid="B10">10</abbr><abbr bid="B11">11</abbr><abbr bid="B32">32</abbr><abbr bid="B35">35</abbr></abbrgrp>.</p>
         </sec>
         <sec>
            <st>
               <p>Parameters of MOEA</p>
            </st>
            <p>The range of positional scores was set with <it>&#952; </it>= 7. For each run of MOEA, the population size <it>M </it>= 500, crossover probability <it>p</it><sub><it>c </it></sub>= 0.9, and mutation probability <it>p</it><sub><it>m </it></sub>= 0.005 were used. The process was terminated after 300 generations as no significant improvement in the convergence was observed during the experimental trial sessions. The parameters of the fitness functions were empirically determined for optimum performance within the following ranges: <it>&#954;</it><sub>1 </sub>= 1~2.5, <it>&#945;</it><sub>1 </sub>= 5.0&#8211;6.0, and <it>&#945;</it><sub>2 </sub>= 1.0&#8211;2.0. The parameters <it>&#954;</it><sub>1 </sub>= 2.5, <it>&#945;</it><sub>1 </sub>= 6.0, and <it>&#945;</it><sub>2 </sub>= 2.0 were found to work well empirically for both datasets.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Authors' contributions</p>
         </st>
         <p>MR and VB conceived the study; MR designed experiments and performed computational analysis; MR, BS, VB and LF wrote the manuscript. All authors read and corrected the manuscript.</p>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>The authors would like to thank Dr. Tim Oliver for proof reading the manuscript. We are also grateful to the anonymous reviewers whose comments significantly improved the paper.</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>Antigenic peptide binding by class I and class II histocompatibility proteins</p>
            </title>
            <aug>
               <au>
                  <snm>Stern</snm>
                  <fnm>LJ</fnm>
               </au>
               <au>
                  <snm>Wiley</snm>
                  <fnm>DC</fnm>
               </au>
            </aug>
            <source>Behring Inst Mitt</source>
            <pubdate>1994</pubdate>
            <issue>94</issue>
            <fpage>1</fpage>
            <lpage>10</lpage>
            <xrefbib>
               <pubid idtype="pmpid">7998902</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <title>
               <p>Precise prediction of major histocompatibility complex class II-peptide interaction based on peptide side chain scanning</p>
            </title>
            <aug>
               <au>
                  <snm>Hammer</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Bono</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Gallazzi</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Belunis</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Nagy</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Sinigaglia</snm>
                  <fnm>F</fnm>
               </au>
            </aug>
            <source>J Exp Med</source>
            <pubdate>1994</pubdate>
            <volume>180</volume>
            <issue>6</issue>
            <fpage>2353</fpage>
            <lpage>2358</lpage>
            <xrefbib>
               <pubid idtype="pmpid">7964508</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>MHC ligands and peptide motifs: first listing</p>
            </title>
            <aug>
               <au>
                  <snm>Rammensee</snm>
                  <fnm>HG</fnm>
               </au>
               <au>
                  <snm>Friede</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Stevanoviic</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Immunogenetics</source>
            <pubdate>1995</pubdate>
            <volume>41</volume>
            <issue>4</issue>
            <fpage>178</fpage>
            <lpage>228</lpage>
            <xrefbib>
               <pubid idtype="pmpid">7890324</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>Predicting peptides that bind to MHC molecules using supervised learning of hidden Markov models</p>
            </title>
            <aug>
               <au>
                  <snm>Mamitsuka</snm>
                  <fnm>H</fnm>
               </au>
            </aug>
            <source>Proteins</source>
            <pubdate>1998</pubdate>
            <volume>33</volume>
            <issue>4</issue>
            <fpage>460</fpage>
            <lpage>474</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">9849933</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>Allele-specific motifs revealed by sequencing of self-peptides eluted from MHC molecules</p>
            </title>
            <aug>
               <au>
                  <snm>Falk</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Rotzschke</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Stevanovic</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Jung</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Rammensee</snm>
                  <fnm>HG</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>1991</pubdate>
            <volume>351</volume>
            <issue>6324</issue>
            <fpage>290</fpage>
            <lpage>296</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">1709722</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>Prominent role of secondary anchor residues in peptide binding to HLA-A2.1 molecules</p>
            </title>
            <aug>
               <au>
                  <snm>Ruppert</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Sidney</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Celis</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Kubo</snm>
                  <fnm>RT</fnm>
               </au>
               <au>
                  <snm>Grey</snm>
                  <fnm>HM</fnm>
               </au>
               <au>
                  <snm>Sette</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Cell</source>
            <pubdate>1993</pubdate>
            <volume>74</volume>
            <issue>5</issue>
            <fpage>929</fpage>
            <lpage>937</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">8104103</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>Importance of peptide amino and carboxyl termini to the stability of MHC class I molecules</p>
            </title>
            <aug>
               <au>
                  <snm>Bouvier</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Wiley</snm>
                  <fnm>DC</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>1994</pubdate>
            <volume>265</volume>
            <issue>5170</issue>
            <fpage>398</fpage>
            <lpage>402</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">8023162</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>SVRMHC prediction server for MHC-binding peptides</p>
            </title>
            <aug>
               <au>
                  <snm>Wan</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Liu</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Xu</snm>
                  <fnm>Q</fnm>
               </au>
               <au>
                  <snm>Y</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Flower</snm>
                  <fnm>DR</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>BMC Bioinformatics</source>
            <pubdate>2006</pubdate>
            <volume>7</volume>
            <fpage>463</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1626489</pubid>
                  <pubid idtype="pmpid" link="fulltext">17059589</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p>Towards the insilico identification of class II restricted T-cell epitopes: a partial least squares iterative self-consistent algorithm for affinity prediction</p>
            </title>
            <aug>
               <au>
                  <snm>Doytchinova</snm>
                  <fnm>IA</fnm>
               </au>
               <au>
                  <snm>Flower</snm>
                  <fnm>DR</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2003</pubdate>
            <volume>19</volume>
            <issue>17</issue>
            <fpage>2263</fpage>
            <lpage>2270</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">14630655</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B10">
            <title>
               <p>Automated generation and evaluation of specific MHC binding predictive tools: ARB matrix applications</p>
            </title>
            <aug>
               <au>
                  <snm>Bui</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Sidney</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Peters</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Sathiamurthy</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Sinichi</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Purton</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Moth&#233;</snm>
                  <fnm>BR</fnm>
               </au>
               <au>
                  <snm>Chisari</snm>
                  <fnm>FV</fnm>
               </au>
               <au>
                  <snm>Watkins</snm>
                  <fnm>DI</fnm>
               </au>
               <au>
                  <snm>Sette</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Immunogenetics</source>
            <pubdate>2005</pubdate>
            <volume>57</volume>
            <issue>5</issue>
            <fpage>304</fpage>
            <lpage>314</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">15868141</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>Prediction of MHC class II binding affinity using SMM-align, a novel stabilization matrix alignment method</p>
            </title>
            <aug>
               <au>
                  <snm>Nielsen</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Lundegaard</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Lund</snm>
                  <fnm>O</fnm>
               </au>
            </aug>
            <source>BMC Bioinformatics</source>
            <pubdate>2007</pubdate>
            <volume>8</volume>
            <issue>238</issue>
         </bibl>
         <bibl id="B12">
            <title>
               <p>Using a neural network to identify potential HLA-DR1 binding sites within proteins</p>
            </title>
            <aug>
               <au>
                  <snm>Bisset</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Fierz</snm>
                  <fnm>W</fnm>
               </au>
            </aug>
            <source>J Mol Recognition</source>
            <pubdate>1994</pubdate>
            <volume>6</volume>
            <fpage>41</fpage>
            <lpage>48</lpage>
         </bibl>
         <bibl id="B13">
            <title>
               <p>Prediction of MHC binding peptides using artificial neural networks</p>
            </title>
            <aug>
               <au>
                  <snm>Brusic</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Rudy</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Harrison</snm>
                  <fnm>LC</fnm>
               </au>
            </aug>
            <source>Complex Systems: Mechanism of Adaptation</source>
            <publisher>Amsterdam: IOS Press</publisher>
            <editor>Stonier R, Yu XS</editor>
            <pubdate>1994</pubdate>
            <fpage>253</fpage>
            <lpage>260</lpage>
         </bibl>
         <bibl id="B14">
            <title>
               <p>Prediction of binding to MHC class I molecules</p>
            </title>
            <aug>
               <au>
                  <snm>Adams</snm>
                  <fnm>HP</fnm>
               </au>
               <au>
                  <snm>Koziol</snm>
                  <fnm>JA</fnm>
               </au>
            </aug>
            <source>J Immunol Methods</source>
            <pubdate>1995</pubdate>
            <volume>185</volume>
            <issue>2</issue>
            <fpage>181</fpage>
            <lpage>190</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">7561128</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B15">
            <title>
               <p>Two complementary methods for predicting peptide binding major histocompatibility complex molecules</p>
            </title>
            <aug>
               <au>
                  <snm>Gulukota</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Sidney</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Sette</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>DeLisi</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>J Mol Biol</source>
            <pubdate>1997</pubdate>
            <volume>267</volume>
            <fpage>1258</fpage>
            <lpage>1267</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">9150410</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B16">
            <title>
               <p>Predictive Bayesian neural network models of MHC class II peptide binding</p>
            </title>
            <aug>
               <au>
                  <snm>Burden</snm>
                  <fnm>FR</fnm>
               </au>
               <au>
                  <snm>Winkler</snm>
                  <fnm>DA</fnm>
               </au>
            </aug>
            <source>J Mol Graph Model</source>
            <pubdate>2005</pubdate>
            <volume>23</volume>
            <issue>6</issue>
            <fpage>481</fpage>
            <lpage>489</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">15878832</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B17">
            <title>
               <p>Hidden Markov model-based prediction of antigenic peptides that interact with MHC class II molecules</p>
            </title>
            <aug>
               <au>
                  <snm>Noguchi</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Kato</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Hanai</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Matsubara</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Honda</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Brusic</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Kobayashi</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>J Biosci Bioeng</source>
            <pubdate>2002</pubdate>
            <volume>94</volume>
            <issue>3</issue>
            <fpage>264</fpage>
            <lpage>270</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">16233301</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B18">
            <title>
               <p>Prediction of MHC class I binding peptides, using SVMHC</p>
            </title>
            <aug>
               <au>
                  <snm>Donnes</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Elofsson</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>BMC Bioinformatics</source>
            <pubdate>2002</pubdate>
            <volume>3</volume>
            <fpage>25</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">129981</pubid>
                  <pubid idtype="pmpid" link="fulltext">12225620</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B19">
            <title>
               <p>Application of support vector machines for T-cell epitopes prediction</p>
            </title>
            <aug>
               <au>
                  <snm>Zhao</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Pinilla</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Valmori</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Martin</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Simon</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2003</pubdate>
            <volume>19</volume>
            <issue>15</issue>
            <fpage>1978</fpage>
            <lpage>1984</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">14555632</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B20">
            <title>
               <p>MHCBN: A comprehensive database of MHC binding and non-binding peptides</p>
            </title>
            <aug>
               <au>
                  <snm>Bhasin</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Singh</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Raghava</snm>
                  <fnm>GPS</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2003</pubdate>
            <volume>19</volume>
            <fpage>665</fpage>
            <lpage>666</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12651731</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B21">
            <title>
               <p>Predicting Class II MHC-Peptide binding: a kernel based approach using similarity scores</p>
            </title>
            <aug>
               <au>
                  <snm>Salomon</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Flower</snm>
                  <fnm>DR</fnm>
               </au>
            </aug>
            <source>BMC Bioinformatics</source>
            <pubdate>2006</pubdate>
            <volume>7</volume>
            <fpage>551</fpage>
         </bibl>
         <bibl id="B22">
            <title>
               <p>Prediction of peptide binding to major histocompatibility complex class II molecules through use of bossted fuzzy classifier with SWEEP operator method</p>
            </title>
            <aug>
               <au>
                  <snm>Takahashi</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Honda</snm>
                  <fnm>H</fnm>
               </au>
            </aug>
            <source>Bioscience and Bioengineering</source>
            <pubdate>2006</pubdate>
            <volume>101</volume>
            <issue>2</issue>
            <fpage>137</fpage>
            <lpage>141</lpage>
         </bibl>
         <bibl id="B23">
            <title>
               <p>Class II MHC quantitative binding motifs derived from a large molecular database with a versatile iterative stepwise discriminant analysis meta-algorithm</p>
            </title>
            <aug>
               <au>
                  <snm>Mallios</snm>
                  <fnm>RR</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>1999</pubdate>
            <volume>15</volume>
            <issue>6</issue>
            <fpage>432</fpage>
            <lpage>439</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">10383468</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B24">
            <title>
               <p>Predicting class II MHC/peptide multi-level binding with an iterative stepwise discriminant analysis meta-algorithm</p>
            </title>
            <aug>
               <au>
                  <snm>Mallios</snm>
                  <fnm>RR</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2001</pubdate>
            <volume>17</volume>
            <issue>10</issue>
            <fpage>942</fpage>
            <lpage>948</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">11673239</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B25">
            <title>
               <p>Prediction of MHC class II binding peptides based on an iterative learning model</p>
            </title>
            <aug>
               <au>
                  <snm>Murugan</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Dai</snm>
                  <fnm>Y</fnm>
               </au>
            </aug>
            <source>Immunome Res</source>
            <pubdate>2005</pubdate>
            <volume>1</volume>
            <issue>6</issue>
            <fpage>10</fpage>
         </bibl>
         <bibl id="B26">
            <title>
               <p>Prediction of MHC class II binders using the ant colony search strategy</p>
            </title>
            <aug>
               <au>
                  <snm>Karpenko</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Shi</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Dai</snm>
                  <fnm>Y</fnm>
               </au>
            </aug>
            <source>Artif Intell Medicine</source>
            <pubdate>2005</pubdate>
            <volume>35</volume>
            <issue>1&#8211;2</issue>
            <fpage>47</fpage>
            <lpage>56</lpage>
         </bibl>
         <bibl id="B27">
            <title>
               <p>Unsupervised learning of multiple motifs in biopolymers using expectation maximization</p>
            </title>
            <aug>
               <au>
                  <snm>Bailey</snm>
                  <fnm>TL</fnm>
               </au>
               <au>
                  <snm>Elkan</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>Machine Learning</source>
            <pubdate>1995</pubdate>
            <volume>21</volume>
            <fpage>51</fpage>
            <lpage>80</lpage>
         </bibl>
         <bibl id="B28">
            <title>
               <p>Fitting a mixture model by expectation maximization to discover motifs in biopolymers</p>
            </title>
            <aug>
               <au>
                  <snm>Bailey</snm>
                  <fnm>TL</fnm>
               </au>
               <au>
                  <snm>Charles</snm>
                  <fnm>E</fnm>
               </au>
            </aug>
            <source>Second International Conference on Intelligent Systems for Molecular Biology</source>
            <publisher>AAAI Press, Menlo Park, California</publisher>
            <pubdate>1994</pubdate>
            <fpage>28</fpage>
            <lpage>36</lpage>
         </bibl>
         <bibl id="B29">
            <title>
               <p>Gibbs motif sampling: detection of bacterial outer membrane protein repeats</p>
            </title>
            <aug>
               <au>
                  <snm>Neuwald</snm>
                  <fnm>AF</fnm>
               </au>
               <au>
                  <snm>Liu</snm>
                  <fnm>JS</fnm>
               </au>
               <au>
                  <snm>Lawrence</snm>
                  <fnm>CE</fnm>
               </au>
            </aug>
            <source>Protein Sci</source>
            <pubdate>1995</pubdate>
            <volume>4</volume>
            <issue>8</issue>
            <fpage>1618</fpage>
            <lpage>1632</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">8520488</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B30">
            <title>
               <p>A Gibbs sampling method to detect overrepresented motifs in the upstream regions of coexpressed genes</p>
            </title>
            <aug>
               <au>
                  <snm>Thijs</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Marchal</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Lescot</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Rombauts</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>De Moor</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Rouze</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Moreau</snm>
                  <fnm>Y</fnm>
               </au>
            </aug>
            <source>J Comput Biol</source>
            <pubdate>2002</pubdate>
            <volume>9</volume>
            <issue>2</issue>
            <fpage>447</fpage>
            <lpage>464</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12015892</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B31">
            <title>
               <p>Detecting subtle sequence signals: a Gibbs sampling strategy for multiple alignment</p>
            </title>
            <aug>
               <au>
                  <snm>Lawrence</snm>
                  <fnm>CE</fnm>
               </au>
               <au>
                  <snm>Altschul</snm>
                  <fnm>SF</fnm>
               </au>
               <au>
                  <snm>Boguski</snm>
                  <fnm>MS</fnm>
               </au>
               <au>
                  <snm>Liu</snm>
                  <fnm>JS</fnm>
               </au>
               <au>
                  <snm>Neuwald</snm>
                  <fnm>AF</fnm>
               </au>
               <au>
                  <snm>Wootton</snm>
                  <fnm>JC</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>1993</pubdate>
            <volume>262</volume>
            <issue>5131</issue>
            <fpage>208</fpage>
            <lpage>214</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">8211139</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B32">
            <title>
               <p>Improved prediction of MHC class I and class II epitopes using a novel Gibbs sampling approach</p>
            </title>
            <aug>
               <au>
                  <snm>Nielsen</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Lundegaard</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Worning</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Hvid</snm>
                  <fnm>CS</fnm>
               </au>
               <au>
                  <snm>Lamberth</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Buus</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Brunak</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Lund</snm>
                  <fnm>O</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2004</pubdate>
            <volume>20</volume>
            <issue>9</issue>
            <fpage>1388</fpage>
            <lpage>1397</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">14962912</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B33">
            <title>
               <p>Prediction of MHC class I binding peptides using profile motifs</p>
            </title>
            <aug>
               <au>
                  <snm>Reche</snm>
                  <fnm>PA</fnm>
               </au>
               <au>
                  <snm>Glutting</snm>
                  <fnm>JP</fnm>
               </au>
               <au>
                  <snm>Reinherz</snm>
                  <fnm>EL</fnm>
               </au>
            </aug>
            <source>Hum Immunol</source>
            <pubdate>2002</pubdate>
            <volume>63</volume>
            <issue>9</issue>
            <fpage>701</fpage>
            <lpage>709</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12175724</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B34">
            <title>
               <p>Enhancement to the RANKPEP resource for the prediction of peptide binding to MHC molecules using profiles</p>
            </title>
            <aug>
               <au>
                  <snm>Reche</snm>
                  <fnm>PA</fnm>
               </au>
               <au>
                  <snm>Glutting</snm>
                  <fnm>JP</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Reinherz</snm>
                  <fnm>EL</fnm>
               </au>
            </aug>
            <source>Immunogenetics</source>
            <pubdate>2004</pubdate>
            <volume>56</volume>
            <issue>6</issue>
            <fpage>405</fpage>
            <lpage>419</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">15349703</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B35">
            <title>
               <p>Generation of tissue-specific and promiscuous HLA ligand databases using DNA microarrays and virtual HLA class II matrices</p>
            </title>
            <aug>
               <au>
                  <snm>Sturniolo</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Bono</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Jiayi</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Raddrizzani</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Tuereci</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Sahin</snm>
                  <fnm>U</fnm>
               </au>
               <au>
                  <snm>Braxenthaler</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Gallazzi</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Protti</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Sinigaglia</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Hammer</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Nature Biotech</source>
            <pubdate>1999</pubdate>
            <volume>17</volume>
            <issue>6</issue>
            <fpage>555</fpage>
            <lpage>561</lpage>
         </bibl>
         <bibl id="B36">
            <title>
               <p>Application of genetic search in derivation of matrix models of peptide binding to MHC molecules</p>
            </title>
            <aug>
               <au>
                  <snm>Brusic</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Schonbach</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Takiguchi</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Ciesielski</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Harrison</snm>
                  <fnm>LC</fnm>
               </au>
            </aug>
            <source>Proc Int Conf Intell Syst Mol Biol</source>
            <pubdate>1997</pubdate>
            <volume>5</volume>
            <fpage>75</fpage>
            <lpage>83</lpage>
            <xrefbib>
               <pubid idtype="pmpid">9322018</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B37">
            <title>
               <p>Multi-Objecitve Evolutionary Algorithm for Discovering Peptide Binding Motifs</p>
            </title>
            <aug>
               <au>
                  <snm>Rajapakse</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Schmidt</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Brusic</snm>
                  <fnm>V</fnm>
               </au>
            </aug>
            <source>Applications of Evolutionary Computing</source>
            <publisher>Lecture Notes in Computer Science, Springer</publisher>
            <pubdate>2006</pubdate>
            <volume>3907</volume>
            <fpage>149</fpage>
            <lpage>158</lpage>
         </bibl>
         <bibl id="B38">
            <title>
               <p>Self peptides isolated from MHC glycoproteins of non-obese diabetic mice</p>
            </title>
            <aug>
               <au>
                  <snm>Reich</snm>
                  <fnm>EP</fnm>
               </au>
               <au>
                  <snm>von Grafenstein</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Barlow</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Swenson</snm>
                  <fnm>KE</fnm>
               </au>
               <au>
                  <snm>Williams</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Janeway</snm>
                  <fnm>CA</fnm>
                  <suf>Jr</suf>
               </au>
            </aug>
            <source>J Immunol</source>
            <pubdate>1994</pubdate>
            <volume>152</volume>
            <issue>5</issue>
            <fpage>2279</fpage>
            <lpage>2288</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">8133041</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B39">
            <title>
               <p>Encephalitogenic epitopes of myelin basic protein, proteolipid protein, myelin oligodendrocyte glycoprotein for experimental allergic encephalomyelitis induction in Biozzi ABH (H-2Ag7) mice share an amino acid motif</p>
            </title>
            <aug>
               <au>
                  <snm>Amor</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>O'Neill</snm>
                  <fnm>JK</fnm>
               </au>
               <au>
                  <snm>Morris</snm>
                  <fnm>MM</fnm>
               </au>
               <au>
                  <snm>Smith</snm>
                  <fnm>RM</fnm>
               </au>
               <au>
                  <snm>Wraith</snm>
                  <fnm>DC</fnm>
               </au>
               <au>
                  <snm>Groome</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Travers</snm>
                  <fnm>PJ</fnm>
               </au>
               <au>
                  <snm>Baker</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>J Immunol</source>
            <pubdate>1996</pubdate>
            <volume>156</volume>
            <issue>8</issue>
            <fpage>3000</fpage>
            <lpage>3008</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">8609422</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B40">
            <title>
               <p>Molecular characterization of the diabetes-associated mouse MHC class II protein, I-Ag7</p>
            </title>
            <aug>
               <au>
                  <snm>Reizis</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Eisenstein</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Bockova</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Konen-Waisman</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Mor</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Elias</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Cohen</snm>
                  <fnm>IR</fnm>
               </au>
            </aug>
            <source>Int Immunol</source>
            <pubdate>1997</pubdate>
            <volume>9</volume>
            <issue>1</issue>
            <fpage>43</fpage>
            <lpage>51</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">9043946</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B41">
            <title>
               <p>A peptide-binding motif for I-A(g7), the class II major histocompatibility complex (MHC) molecule of NOD and Biozzi AB/H mice</p>
            </title>
            <aug>
               <au>
                  <snm>Harrison</snm>
                  <fnm>LC</fnm>
               </au>
               <au>
                  <snm>Honeyman</snm>
                  <fnm>MC</fnm>
               </au>
               <au>
                  <snm>Trembleau</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Gregori</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Gallazzi</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Augstein</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Brusic</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Hammer</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Adorini</snm>
                  <fnm>L</fnm>
               </au>
            </aug>
            <source>J Exp Med</source>
            <pubdate>1997</pubdate>
            <volume>185</volume>
            <issue>6</issue>
            <fpage>1013</fpage>
            <lpage>1021</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">9091575</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B42">
            <title>
               <p>Structural basis of peptide binding and presentation by the type I diabetes-associated MHC class II molecule of NOD mice</p>
            </title>
            <aug>
               <au>
                  <snm>Latek</snm>
                  <fnm>RR</fnm>
               </au>
               <au>
                  <snm>Suri</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Petzold</snm>
                  <fnm>SJ</fnm>
               </au>
               <au>
                  <snm>Nelson</snm>
                  <fnm>CA</fnm>
               </au>
               <au>
                  <snm>Kanagawa</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Unanue</snm>
                  <fnm>ER</fnm>
               </au>
               <au>
                  <snm>Fremont</snm>
                  <fnm>DH</fnm>
               </au>
            </aug>
            <source>Immunity</source>
            <pubdate>2000</pubdate>
            <volume>12</volume>
            <issue>6</issue>
            <fpage>699</fpage>
            <lpage>710</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">10894169</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B43">
            <title>
               <p>The motif for peptide binding to the insulin-dependent diabetes mellitus-associated class II MHC molecule I-Ag7 validated by phage display library</p>
            </title>
            <aug>
               <au>
                  <snm>Gregori</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Bono</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Gallazzi</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Hammer</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Harrison</snm>
                  <fnm>LC</fnm>
               </au>
               <au>
                  <snm>Adorini</snm>
                  <fnm>L</fnm>
               </au>
            </aug>
            <source>Int Immunol</source>
            <pubdate>2000</pubdate>
            <volume>12</volume>
            <issue>4</issue>
            <fpage>493</fpage>
            <lpage>503</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">10744651</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B44">
            <title>
               <p>SYFPEITHI: database for MHC ligands and peptide motifs</p>
            </title>
            <aug>
               <au>
                  <snm>Rammensee</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Bachmann</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Emmerich</snm>
                  <fnm>NP</fnm>
               </au>
               <au>
                  <snm>Bachor</snm>
                  <fnm>OA</fnm>
               </au>
               <au>
                  <snm>Stevanovic</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Immunogenetics</source>
            <pubdate>1999</pubdate>
            <volume>50</volume>
            <issue>3&#8211;4</issue>
            <fpage>213</fpage>
            <lpage>219</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">10602881</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B45">
            <title>
               <p>The lack of consensus for I-A(g7)-peptide binding motifs: is there a requirement for anchor amino acid side chains?</p>
            </title>
            <aug>
               <au>
                  <snm>Carrasco-Marin</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Kanagawa</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Unanue</snm>
                  <fnm>ER</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>1999</pubdate>
            <volume>96</volume>
            <issue>15</issue>
            <fpage>8621</fpage>
            <lpage>8626</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">17566</pubid>
                  <pubid idtype="pmpid" link="fulltext">10411925</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B46">
            <title>
               <p>The I-Ag7 MHC class II molecule linked to murine diabetes is a promiscuous peptide binder</p>
            </title>
            <aug>
               <au>
                  <snm>Stratmann</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Apostolopoulos</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Mallet-Designe</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Corper</snm>
                  <fnm>AL</fnm>
               </au>
               <au>
                  <snm>Scott</snm>
                  <fnm>CA</fnm>
               </au>
               <au>
                  <snm>Wilson</snm>
                  <fnm>IA</fnm>
               </au>
               <au>
                  <snm>Kang</snm>
                  <fnm>AS</fnm>
               </au>
               <au>
                  <snm>Teyton</snm>
                  <fnm>L</fnm>
               </au>
            </aug>
            <source>J Immunology</source>
            <pubdate>2000</pubdate>
            <volume>165</volume>
            <issue>6</issue>
            <fpage>3214</fpage>
            <lpage>3225</lpage>
         </bibl>
         <bibl id="B47">
            <title>
               <p>The class II MHC I-Ag7 molecules from non-obese diabetic mice are poor peptide binders</p>
            </title>
            <aug>
               <au>
                  <snm>Carrasco-Marin</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Shimizu</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Kanagawa</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Unanue</snm>
                  <fnm>ER</fnm>
               </au>
            </aug>
            <source>J Immunol</source>
            <pubdate>1996</pubdate>
            <volume>156</volume>
            <issue>2</issue>
            <fpage>450</fpage>
            <lpage>458</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">8543793</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B48">
            <title>
               <p>In APCs, the autologous peptides selected by the diabetogenic I-Ag7 molecule are unique and determined by the amino acid changes in the P9 pocket</p>
            </title>
            <aug>
               <au>
                  <snm>Suri</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Vidavsky</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>van der Drift</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Kanagawa</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Gross</snm>
                  <fnm>ML</fnm>
               </au>
               <au>
                  <snm>Unanue</snm>
                  <fnm>ER</fnm>
               </au>
            </aug>
            <source>J Immunol</source>
            <pubdate>2002</pubdate>
            <volume>168</volume>
            <issue>3</issue>
            <fpage>1235</fpage>
            <lpage>1243</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">11801660</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B49">
            <title>
               <p>GANN: genetic algorithm neural networks for the detection of conserved combinations of features in DNA</p>
            </title>
            <aug>
               <au>
                  <snm>Beiko</snm>
                  <fnm>RG</fnm>
               </au>
               <au>
                  <snm>Charlebois</snm>
                  <fnm>RL</fnm>
               </au>
            </aug>
            <source>BMC Bioinformatics</source>
            <pubdate>2005</pubdate>
            <volume>6</volume>
            <issue>1</issue>
            <fpage>36</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">553964</pubid>
                  <pubid idtype="pmpid" link="fulltext">15725347</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B50">
            <title>
               <p>Discovery of sequence motifs related to coexpression of genes using evolutionary computation</p>
            </title>
            <aug>
               <au>
                  <snm>Fogel</snm>
                  <fnm>GB</fnm>
               </au>
               <au>
                  <snm>Weekes</snm>
                  <fnm>DG</fnm>
               </au>
               <au>
                  <snm>Varga</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Dow</snm>
                  <fnm>ER</fnm>
               </au>
               <au>
                  <snm>Harlow</snm>
                  <fnm>HB</fnm>
               </au>
               <au>
                  <snm>Onyia</snm>
                  <fnm>JE</fnm>
               </au>
               <au>
                  <snm>Su</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2004</pubdate>
            <volume>32</volume>
            <issue>13</issue>
            <fpage>3826</fpage>
            <lpage>3835</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">506801</pubid>
                  <pubid idtype="pmpid" link="fulltext">15266008</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B51">
            <title>
               <p>FMGA: Finding Motifs by Genetic Algorithm</p>
            </title>
            <aug>
               <au>
                  <snm>Liu</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Tsai</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Shih</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>IEEE BIBE</source>
            <pubdate>2004</pubdate>
         </bibl>
         <bibl id="B52">
            <title>
               <p>Human promoter prediction based on sorted consensus sequence patterns by genetic algorithms</p>
            </title>
            <aug>
               <au>
                  <snm>Lo</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Changchien</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Chang</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Lu</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Intl congr on Biological and Medical Engineering</source>
            <pubdate>2002</pubdate>
            <fpage>111</fpage>
            <lpage>112</lpage>
         </bibl>
         <bibl id="B53">
            <title>
               <p>Evolving Core Promoter Signal Motifs</p>
            </title>
            <aug>
               <au>
                  <snm>Corne</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Meade</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Sibly</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>IEEE Congress on Evolutionary Computation</source>
            <pubdate>2001</pubdate>
            <fpage>1162</fpage>
            <lpage>1169</lpage>
         </bibl>
         <bibl id="B54">
            <title>
               <p>Evolutionary Computation in Bioinformatics</p>
            </title>
            <aug>
               <au>
                  <snm>Fogel</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Corne</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <publisher>Morgan Kaufman publishers</publisher>
            <pubdate>2003</pubdate>
         </bibl>
         <bibl id="B55">
            <title>
               <p>An Introduction to Genetic Algorithms</p>
            </title>
            <aug>
               <au>
                  <snm>Mitchell</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <publisher>MIT press</publisher>
            <pubdate>1999</pubdate>
         </bibl>
         <bibl id="B56">
            <title>
               <p>Multi-Objective Optimization Using Evolutionary Algorithms</p>
            </title>
            <aug>
               <au>
                  <snm>Deb</snm>
                  <fnm>K</fnm>
               </au>
            </aug>
            <publisher>Wiley publishers</publisher>
            <pubdate>2001</pubdate>
         </bibl>
         <bibl id="B57">
            <title>
               <p>Adaptation in Natural and Artificial Systems</p>
            </title>
            <aug>
               <au>
                  <snm>Holland</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <publisher>Ann Arbor, MI: University of Michigan Press</publisher>
            <pubdate>1975</pubdate>
         </bibl>
         <bibl id="B58">
            <title>
               <p>A Fast and Elitist Multiobjective Genetic Algorithm:NSGA-II</p>
            </title>
            <aug>
               <au>
                  <snm>Deb</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Pratap</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Agrawal</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Meyarivan</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>IEEE Trans on Evolutionary Computation</source>
            <pubdate>2002</pubdate>
            <volume>6</volume>
            <issue>2</issue>
            <fpage>182</fpage>
            <lpage>197</lpage>
         </bibl>
         <bibl id="B59">
            <title>
               <p>Multiobjective Evolutionary Algorithms: A Comparative Case Study and the Strength of Pareto Approach</p>
            </title>
            <aug>
               <au>
                  <snm>Zitzler</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Thiele</snm>
                  <fnm>L</fnm>
               </au>
            </aug>
            <source>IEEE Trans on Evolutionary Computation</source>
            <pubdate>1999</pubdate>
            <volume>3</volume>
            <fpage>257</fpage>
            <lpage>271</lpage>
         </bibl>
         <bibl id="B60">
            <title>
               <p>Approximating the Nondominant front using the Pareto Archived evolution strategy</p>
            </title>
            <aug>
               <au>
                  <snm>Knowles</snm>
                  <fnm>JD</fnm>
               </au>
            </aug>
            <source>Evolutionary Computation</source>
            <publisher>MIT Press</publisher>
            <pubdate>2000</pubdate>
            <volume>8</volume>
            <issue>Summer</issue>
            <fpage>49</fpage>
            <lpage>172</lpage>
         </bibl>
         <bibl id="B61">
            <title>
               <p>Genetic Algorithms for Multiobjective Optimization: Formulation, discussion and generalization</p>
            </title>
            <aug>
               <au>
                  <snm>Fonseca</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Fleming</snm>
                  <fnm>PJ</fnm>
               </au>
            </aug>
            <source>the fifth Intl conference on Genetic Algorithms</source>
            <publisher>San Mateo, CA: Morgan Kauffman</publisher>
            <pubdate>1993</pubdate>
            <fpage>416</fpage>
            <lpage>423</lpage>
         </bibl>
         <bibl id="B62">
            <title>
               <p>A Structural Framework for Deciphering the Link Between I-Ag7 and Autoimmune Diabetes</p>
            </title>
            <aug>
               <au>
                  <snm>Corper</snm>
                  <fnm>AL</fnm>
               </au>
               <au>
                  <snm>Stratmann</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Apostolopoulos</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Scott</snm>
                  <fnm>CA</fnm>
               </au>
               <au>
                  <snm>Garcia</snm>
                  <fnm>KC</fnm>
               </au>
               <au>
                  <snm>Kang</snm>
                  <fnm>AS</fnm>
               </au>
               <au>
                  <snm>Wilson</snm>
                  <fnm>IA</fnm>
               </au>
               <au>
                  <snm>Teyton</snm>
                  <fnm>L</fnm>
               </au>
            </aug>
            <source>Science</source>
            <volume>288</volume>
            <issue>5465</issue>
            <fpage>505</fpage>
            <lpage>511</lpage>
            <note>21 April 2000</note>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">10775108</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B63">
            <title>
               <p>MHCPEP, a database of MHC-binding peptides: update 1997</p>
            </title>
            <aug>
               <au>
                  <snm>Brusic</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Rudy</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Harrison</snm>
                  <fnm>LC</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>1998</pubdate>
            <volume>26</volume>
            <issue>1</issue>
            <fpage>368</fpage>
            <lpage>371</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">147255</pubid>
                  <pubid idtype="pmpid" link="fulltext">9399876</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B64">
            <title>
               <p>Binding of conserved islet peptides by human and murine MHC class II molecules associated with susceptibility to type I diabetes</p>
            </title>
            <aug>
               <au>
                  <snm>Yu</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Gauthier</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Hausmann</snm>
                  <fnm>DH</fnm>
               </au>
               <au>
                  <snm>Wucherpfennig</snm>
                  <fnm>KW</fnm>
               </au>
            </aug>
            <source>Eur J Immunol</source>
            <pubdate>2000</pubdate>
            <volume>30</volume>
            <issue>9</issue>
            <fpage>2497</fpage>
            <lpage>2506</lpage>
            <xrefbib>
               <pubid idtype="pmpid">11009082</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B65">
            <aug>
               <au>
                  <snm>Webb</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Statistical Pattern Recognition</source>
            <publisher>John Wiley &amp; Sons</publisher>
            <edition>2</edition>
            <pubdate>2002</pubdate>
         </bibl>
         <bibl id="B66">
            <title>
               <p>Measuring the accuracy of diagnostic systems</p>
            </title>
            <aug>
               <au>
                  <snm>Swets</snm>
                  <fnm>JA</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>1988</pubdate>
            <volume>240</volume>
            <issue>4857</issue>
            <fpage>1285</fpage>
            <lpage>1293</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">3287615</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B67">
            <title>
               <p>Structure-based prediction of binding peptides to MHC class I molecules: Application to a broad range of MHC alleles</p>
            </title>
            <aug>
               <au>
                  <snm>Schueler-Furman</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Altuvia</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Sette</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Margalit</snm>
                  <fnm>H</fnm>
               </au>
            </aug>
            <source>Protein Sci</source>
            <pubdate>2000</pubdate>
            <volume>9</volume>
            <fpage>1838</fpage>
            <lpage>1846</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">11045629</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B68">
            <title>
               <p>Sequence logos: a new way to display consensus sequences</p>
            </title>
            <aug>
               <au>
                  <snm>Schneider</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Stephens</snm>
                  <fnm>RM</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Research</source>
            <pubdate>1990</pubdate>
            <volume>18</volume>
            <issue>20</issue>
            <fpage>6097</fpage>
            <lpage>6100</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">332411</pubid>
                  <pubid idtype="pmpid" link="fulltext">2172928</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B69">
            <title>
               <p>Peptide length-based prediction of peptide-MHC class II binding</p>
            </title>
            <aug>
               <au>
                  <snm>Chang</snm>
                  <fnm>ST</fnm>
               </au>
               <au>
                  <snm>Ghosh</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Kirschner</snm>
                  <fnm>DE</fnm>
               </au>
               <au>
                  <snm>Linderman</snm>
                  <fnm>JJ</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2006</pubdate>
            <volume>22</volume>
            <issue>22</issue>
            <fpage>2761</fpage>
            <lpage>2767</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">17000752</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B70">
            <title>
               <p>Naturally Processed HLA Class II Peptides Reveal Highly Conserved Immunogenic Flanking Region Sequence Preferences That Reflect Antigen Processing Rather Than Peptide-MHC Interactions</p>
            </title>
            <aug>
               <au>
                  <snm>Godkin</snm>
                  <fnm>AJ</fnm>
               </au>
               <au>
                  <snm>Smith</snm>
                  <fnm>KJ</fnm>
               </au>
               <au>
                  <snm>Willis</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Tejada-Simon</snm>
                  <fnm>MV</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Elliott</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Hill</snm>
                  <fnm>AVS</fnm>
               </au>
            </aug>
            <source>Immunology</source>
            <pubdate>2001</pubdate>
            <volume>166</volume>
            <issue>11</issue>
            <fpage>6720</fpage>
            <lpage>6727</lpage>
         </bibl>
         <bibl id="B71">
            <title>
               <p>MEME</p>
            </title>
            <url>http://meme.sdsc.edu/meme/</url>
         </bibl>
         <bibl id="B72">
            <title>
               <p>RANKPEP</p>
            </title>
            <url>http://bio.dfci.harvard.edu/Tools/rankpep.html</url>
         </bibl>
         <bibl id="B73">
            <title>
               <p>SVM based method for predicting HLA-DRB1*0401 binding peptides in an antigen sequence</p>
            </title>
            <aug>
               <au>
                  <snm>Bhasin</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Raghava</snm>
                  <fnm>GP</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2004</pubdate>
            <volume>20</volume>
            <issue>3</issue>
            <fpage>421</fpage>
            <lpage>423</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">14960470</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B74">
            <title>
               <p>MHCBench</p>
            </title>
            <url>http://www.imtech.res.in/raghava/mhcbench</url>
         </bibl>
         <bibl id="B75">
            <title>
               <p>Several common HLA-DR types share largely overlapping peptide binding repertoires</p>
            </title>
            <aug>
               <au>
                  <snm>Southwood</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Sidney</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Kondo</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>del Guercio</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Appella</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Hoffman</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Kubo</snm>
                  <fnm>RT</fnm>
               </au>
               <au>
                  <snm>Chestnut</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Grey</snm>
                  <fnm>HM</fnm>
               </au>
               <au>
                  <snm>Sette</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Immunology</source>
            <pubdate>1998</pubdate>
            <volume>160</volume>
            <fpage>3363</fpage>
            <lpage>3373</lpage>
         </bibl>
         <bibl id="B76">
            <title>
               <p>HLA-DR binding analysis of peptides from islet antigens in IDDM</p>
            </title>
            <aug>
               <au>
                  <snm>Geluk</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>van Meijgaarden</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Schloot</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Drijfhout</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Ottenhoff</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Roep</snm>
                  <fnm>B</fnm>
               </au>
            </aug>
            <source>Diabetes</source>
            <pubdate>1998</pubdate>
            <volume>47</volume>
            <issue>1584&#8211;1600</issue>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">9753297</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B77">
            <title>
               <p>NetMHCII</p>
            </title>
            <url>http://www.cbs.dtu.dk/services/NetMHCII</url>
         </bibl>
         <bibl id="B78">
            <title>
               <p>IEDB</p>
            </title>
            <url>http://www.immuneepitope.org</url>
         </bibl>
         <bibl id="B79">
            <title>
               <p>Weblogo</p>
            </title>
            <url>http://weblogo.berkeley.edu/</url>
         </bibl>
      </refgrp>
   </bm>
</art>
